Open Nissley Dan Phd Thesis Final.Pdf

The Pennsylvania State University The Graduate School Department of Chemistry MODELING AND PREDICTING CO-TRANSLATIONAL PROTEIN FOLDING WITH CHEMICAL KINETIC AND MOLECULAR DYNAMIC SIMULATIONS A Dissertation in Chemistry by Daniel A. Nissley © 2019 Daniel A. Nissley Submitted in Partial Fulfillment of the Requirements for the Degree of Doctor of Philosophy May 2019 The dissertation of Daniel A. Nissley was reviewed and approved* by the following: Edward P. O’Brien Assistant Professor of Chemistry and of the Institute for CyberScience Dissertation Adviser Chair of Committee Phillip C. Bevilacqua Head of Chemistry Department Distinguished Professor of Chemistry and Biochemistry and Molecular Biology William G. Noid Associate Professor of Chemistry Reka Albert Distinguished Professor of Physics and Biology *Signatures are on file in the Graduate School ii ABSTRACT Proteins are linear polymers of amino acids that perform myriad functions within living cells. Most proteins must form a specific three-dimensional structure, often referred to as the native state, in order to perform their biological function. The process of reaching the native state is termed protein folding. It is often assumed that thermodynamics alone determines the specific conformation of the native state of a protein, meaning that its structure is determined by the amino-acid sequence alone. Within living cells, however, proteins are translated based on mRNA templates by the ribosome in a distinctly out-of- equilibrium fashion. The non-equilibrium nature of translation means that kinetics can trump thermodynamics in determining the conformation of a protein, and recent experiments indicate that seemingly small changes to the kinetics of translation can radically alter protein structure and function and even lead to disease. One way in which the speed of translation can be perturbed is through synonymous codon mutations, which change the rate of translation but not the primary sequence of the nascent protein that is produced. Understanding how the non-equilibrium nature of protein synthesis influences the likelihood that a given protein will correctly fold and function is therefore critical to understanding protein biogenesis. This thesis contains four theoretical and computational studies of co-translational protein folding and its influence on protein conformations after synthesis is complete. The state of the experimental and computational literature is summarized in Chapter 1. In Chapter 2 I describe a chemical kinetic model that is able to accurately predict experimental co-translational folding probabilities for the first time. This chemical kinetic model is used to make the novel prediction that some yeast proteins which typically fold post- translationally can be made to fold co-translationally by recoding their mRNA sequences to contain the slowest-translating synonymous codon at each codon position. This chemical kinetic model is general, in principle, for all proteins and all organisms. The fluorescent technique FRET has recently been used to monitor protein folding on the ribosome both in vitro and in vivo. One disadvantage of such fluorescent techniques is that they produce a single value per time point, providing minimal structural information. In Chapter 3, I present results from low-friction, coarse-grain Langevin dynamics simulations that elucidate the structural origins of FRET measurements on the ribosome. These simulations are in strong agreement with experimental time series and reveal the underlying co-translational folding trajectory at a spatial resolution of 3.8 Å. I also show that the alternative hypothesis that nascent chain compaction occurs due to collapse as is expected for a polymer in poor solvent is not consistent with the experimental data. Dimensional collapse, however, could not be ruled out. I therefore suggest alternative dye positions that could be used to differentiate domain folding and domain dimensional collapse in future experiments. Chapter 4 of this thesis presents the hypothesis that the pathogenesis of Huntington’s Disease is, at least in part, due to the dysfunction of a co-translational process. The genetic cause of Huntington’s Disease is the expansion of a poly-glutamine region in Exon 1 of the HTT gene. Individuals with 35 or more CAG codons (which encode the iii amino acid glutamine) in their HTT gene will suffer disease symptom onset within a typical human lifespan. The age of symptom onset decreases linearly as the number CAG codons increases beyond 35. Based on strong circumstantial experimental evidence and a simple kinetic model, I argue that the expansion of the CAG codon region in the transcript leads to an increase in the speed of translation at a key time during the synthesis of huntingtin protein that leads to its misprocessing and the onset of disease symptoms. I go on to propose experiments to test this hypothesis. In Chapter 5 I describe high-throughput simulations of the translation elongation, translation termination, and post-translational dynamics of a representative subset of the E. coli cytosolic proteome. I find that roughly one in four proteins is kinetically trapped for as long as 3 minutes after the completion of protein synthesis. Finally, in Chapter 6 I summarize the conclusions that can be drawn from, and the future directions related to, my work. One obvious future goal is the extension of my simulations of multi-domain E. coli proteins to the study of how domain interfaces influence protein folding. That is, do domains fold independently or do they rely upon one another? My E. coli proteome data set also includes diverse information about translation termination kinetics, and preliminary simulations reveal that electrostatic effects seem to largely determine its timescales. In summary, the results presented in this thesis advance understanding of co-translational protein folding and the influence translation kinetics can have on protein conformations. iv TABLE OF CONTENTS LIST OF FIGURES ............................................................................................................x LIST OF TABLES .......................................................................................................... xiii ABBREVIATIONS ........................................................................................................ xiv ACKNOWLEDGEMENTS ..............................................................................................xv Chapter 1: INTRODUCTION .............................................................................................1 1.1 Kinetics can be more important than thermodynamics for protein folding ..............1 1.2 Synonymous codon substitutions can alter the rate of translation ............................2 1.3 Co-translational folding can be influenced by codon translation rates .....................3 1.4 Codon translation rates modulate protein function, misfolding, and aggregation ....4 1.5 Translocation of proteins across cell membranes is modulated by codon translation rates .................................................................................................................................5 1.6 Other co-translational processes can also depend on codon translation rates ...........6 1.7 Several human diseases have been linked to codon translation rates ........................8 1.8 Recent approaches to modeling codon translation rate effects on co-translational protein folding and translocation .....................................................................................8 1.9 Thesis objectives .....................................................................................................11 Chapter 2: ACCURATE PREDICTION OF CELLULAR CO-TRANSLATIONAL FOLDING INDICATES PROTEINS CAN SWITCH FROM POST- TO CO- TRANSLATIONAL FOLDING .......................................................................................13 2.1 Abstract ...................................................................................................................13 2.2 Introduction .............................................................................................................13 2.3 Results .....................................................................................................................15 2.3.1 Derivation of the model ....................................................................................15 2.3.2 Constructing a fully constrained model ..........................................................18 2.3.3 Prediction of pulse-chase co-translational folding curves .............................20 2.3.4 Prediction of FactSeq co-translational folding curves ...................................20 2.3.5 Sensitivity of predictions to parameter variation ...........................................22 2.3.6 Model sensitivity to variable codon translation rates ....................................24 2.3.7 Domains can switch from post- to co-translational folding ..........................25 v 2.4 Discussion ...............................................................................................................26 2.5 Acknowledgements ..................................................................................................29 Chapter 3: STRUCTURAL ORIGINS OF FRET-OBSERVED NASCENT CHAIN COMPACTION ON THE RIBOSOME ...........................................................................30 3.1 Abstract ...................................................................................................................30

Open Nissley Dan Phd Thesis Final.Pdf

Analysis of Gene Expression Data for Gene Ontology

A Computational Approach for Defining a Signature of Β-Cell Golgi Stress in Diabetes Mellitus

Proteomic Analysis of Thioredoxin-Targeted Proteins in Escherichia Coli

Supporting Information

Supplementary Tables

(Ubl-Ptms): Small Peptides with Huge Impact in Liver Fibrosis

A Systematic Analysis of Nuclear Heat Shock Protein 90 (Hsp90) Reveals A

Exploring Intrinsically Disordered Proteins in Chlamydomonas

Regulation of Human 69-Kda Choline Acetyltransferase Protein Stability and Function by Molecular Chaperones and the Ubiquitin- Proteasome System

Mrvr, a Group B Streptococcus Transcription Factor That Controls Multiple Virulence Traits

The Effect of the URM1 Pathway on Translation by Thiolation of Specific Trnas

Insights Into the Role and Mechanism of the AAA+ Adaptor Clps