Université Grenoble 1 — Joseph Fourier Sciences et Géographie
N attribué par la bibliothèque
Thèse de Tim GRÜNE Doctorat: Chimie et Sciences du Vivant — Biologie Discipline: Aspect Moleculaires et Cellulaires de la Biologie
Structural studies on ISWI, an ATP dependent nucleosome remodelling factor
Thèse dirigée par: Christoph W. MÜLLER
Laboratoire Européenne de Biologie Moleculaire, Grenoble
Soutenance publique le 3 Octobre 2003
Jury:
Christoph W. MÜLLER EMBL Grenoble Directeur de thèse Elena CONTI EMBL Heidelberg Rapportrice Félix REY CNRS Gif sur Yvette Rapporteur Hans GEISELMANN UJF Grenoble Président du Jury Saadi KHOCHBIN UJF Grenoble membre du Jury 2 3
Abstract
The Imitation Switch protein, or ISWI, from D. melanogaster is an essential enzyme that uses the energy from ATP hydrolysis in order to rearrange nucleosomes in chromatin. It plays an important role in gene expression because access to DNA and especially to promoter sites is altered by nucleosome positioning. ISWI can thus act both as an enhancer and repressor of transcription. In all eukaryotes one can find a large number of complexes involved in chromatin remodelling. Despite the diversity and variety of functioning, all these complexes contain an ATPase with a homologous so called SNF2 domain that is conserved through all eukaryotes. Four groups of remodelling ATPases can be distinguished, SNF2, SNF2L, CHD1, and INO80 of which only the first three have been further characterised according to conserved domains they contain besides the SNF2 domain. For more than 15 years these complexes have been known and a large pool of data is available to characterise a process that, together with covalent histone modifications, alters the chromatin structure and has important influence on processes like transcription, DNA repair, and replication. But to date detailed information that might shed light on the mechanism of how remodelling is carried out has been missing and only coarse hypotheses have been proposed. The work summarised in this document began in January 2000 with the aim to find structural information about ISWI. The starting point was a clone and a protocol that allowed to produce the enzyme recombinantly, but the path was certainly not straight. A manifold of attempts was undertaken, many of which are not described here. Acf1 and ISWI build one of the complexes that can be found in vivo. The possibility of ISWI being stabilised by Acf1 was considered, but neither could Acf1 expression be detected in bacteria nor could the complex be produced in insect cells in sufficient amounts for structural studies. Binding to substrates like the N terminal tail of histone H4, ATP and non hydrolisable analogues of ATP, or cruciform DNA was investigated but did not produce useful results. Given that ATPases often undergo large conformational changes, it was not unexpected that crystallisation trials with the full length protein (Mr = 120 kDa) failed. One of the first experiments was therefore to find subdomains of the protein by restricted proteolysis. It showed that the enzyme consists of two flexibly linked main parts, the N terminus that makes two thirds of the protein, and the C terminus, about one third. These two parts were subcloned, but only the C terminal part proved to be stable and gave crystals. Many N terminal clones could be purified but were difficult to concentrate, even worse than the full length protein. Crystals of the C terminal fragment of ISWI were obtained in July 2001. Since then, most of the effort went into improving the crystals‘ data collection, but the structure could only be solved with a new crystal form that suddenly appeared in May 2002. The introduction of this thesis provides a short overview on chromatin remodelling complexes but emphasises the ISWI family. Chromatin remodelling covers a wide network of intertwined actions that include histone tail modifications, transcription, silencing and the condensation and decondensation of chromatin. No attempt was made to fully cover the literature. The in vitro assays used to characterise the functioning and interaction with substrate are explained in a general way and only the results specific to ISWI are described in further detail. The part following the introduction is dedicated to the techniques that were used to gain insight into the struc ture of ISWI. These obviously include protein crystallisation and crystallography (with emphasis on the phase problem), but also a short section about circular dichroism. The main result of this work is the crystal structure of ISWI [691:991] to 1.9 Å resolution. The elongated structure consists of three domains which are described separately in detail. The very N terminal domain presents a new fold and has been named the “Hand” domain. It is in direct contact with the following SANT domain. The last domain, SLIDE, is separated from the rest of the molecule by a straight 50 Å long spacer helix. The fragment has an asymmetric charge distribution of acidic and basic residues between the N terminal domains and the C terminal domain. Most of the molecule has a negative charged distribution on the surface, but especially the SLIDE domain contains some positive patches. The following analysis showed that this patch is probably the main area where the C terminal part of ISWI contacts the DNA of the nucleosome by a classical helix turn helix motif. The water, glucose, and glycerol molecules, that were found in the structure, are described in detail because they are important for crystal contacts. The description includes a beautiful composition with a water molecule sitting right on one of the symmetry axes that forms a pentagonal water ring with two more water molecules from the asymmetric unit and their symmetry mates. Most of the water molecules of the structure concentrate at the C terminus of the protein where they build part of the interface between two protein molecules. Functional interpretation of the structure was based on the homology of both the SANT and the SLIDE domain with DNA binding proteins like the oncogene product c Myb and the homeodomain protein Pax 6. The SLIDE domain is only distantly related to SANT domains which had been proposed to be DNA binding. However, we now found strong evidence that in fact the more remotely related SLIDE domain contacts the DNA directly while the helix of the SANT domain that, according to the structural homology, should be in contact with the DNA is too negatively charged to bind DNA. The thesis finishes with a suggestion of how the C terminal fragment of ISWI might bind the nucleosome and hence act as a substrate recognition module for full length ISWI.
Note for reading Like during this introduction, I sometimes refer to the names of the domains in the structure of the C terminal fragment of ISWI. In order to better understand the text I recommend the reader to first look at figure 13.1 in order to get an overview of the model and the domain names and their borders. Furthermore, 4
I sometimes use the term ISWI C. With this term I refer to any of the C terminal clones of ISWI I prepared during this work where a distinction is not necessary. These are mainly the ones that crystallised: ISWI [691:991], ISWI [701:991] and ISWI [713:991]. 5
Résumé1
La protéine Imitation Switch de Drosophila melanogaster est une enzyme essentielle qui utilise l’énergie de l’hydrolyse de l’adénosine triphosphate pour réarranger des nucléosomes dans la chromatine. Elle joue un rôle important dans l’expression des gènes parce que l’accès à l’ADN et aux sites promoteurs est modifié par le po sitionnement des nucléosomes. ISWI peut donc agir comme activateur mais aussi comme répresseur de la trans cription. Dans tous les Eucaryotes, on peut trouver un grand nombre de complexes multiprotéiques impliqués dans le réarrangement de la chromatine. En dépit de leur diversité et de leur varieté de fonctionnement, tous ces com plexes contiennent une ATPase présentant un domaine homologue très conservé nommé SNF2. Une caractérisation plus approfondie de ces ATPases impliquées dans le remodelage de la chromatine a permis de distinguer quatre groups : SNF2, SNF2L, CHD1 et INO80. En quinze ans, depuis la découverte de ces complexes, un grand nombre d’informations ont été collectées pour caractériser le processus de remodelage de la chromatine qui, avec les mo difications covalentes des histones, affecte la structure de la chromatine et a donc une influence importante sur des processus comme la réplication, la réparation de l’ADN et la transcription. Mais jusqu’ici le manque d’informa tions détaillées qui pourraient éclairer le mécanisme de remodelage du nucléosome a seulement conduit à proposer des hypothèses. Le travail récapitulé dans ce document a commencé en janvier 2000 avec le but de l’obtention d’informa tions structurales sur ISWI. Le point de départ était un clone et un protocole permettant de produire l’enzyme ISWI recombinante chez E.coli. Mais le chemin conduisant à des résultats a été ramifié : une multitude d’expé riences différentes a été effectuées, dont beaucoup ne sont pas décrites ici. Par exemple, ISWI établit avec Acf1 un complexe qui peut être trouvé in vivo. La possibilité de stabiliser ISWI par association à Acf1 a été conside rée. Malheureusement aucune expression d’Acf1 n’a été détectée dans les bactéries et les quantités de complexe produites en cellules d’insecte étaient insuffisantes pour des études structurles. L’association d’ISWI avec des sub strats dont l’extrémité N terminale de l’histone H4, l’ATP, des analogues non hydrolysables de l’ATP ou de l’ADN cruciforme a été étudiée mais n’a pas produit de résultats utiles. Sachant que les ATPases subissent de grands changements de conformation durant leur activité enzymatique, il n’est pas surprenant que les essais de cristallisation avec la protéine entière (Mr = 120 kDa) aient échoué. L’une des premières expériences a donc été une protéolyse restreinte de la protéine entière afin d’identifier des sous domaines. Elle a montré que ISWI se compose de deux parties principales liées par une partie flexible : le domaine N terminal constitue deux tiers de la protéine, le domaine C terminal environ un tiers. Ces deux parties ont été subclonées mais seulement la partie C terminale s’avérait stable et résultait en cristaux. De nombreux clones de la partie N terminale ont été purifiés mais leur concentration était difficile, encore plus mauvaise que celle de la protéine entière. Des cristaux de la partie C terminale d’ISWI ont été obtenus en juillet 2001. Depuis lors, la majeurs partie de l’effort a été consacré à l’amélioration des cristaux et à la collecte de données ; la structure a pû être résolue avec une nouvelle forme de cristal soudainement apparue en mai 2002. L’introduction de cette thèse fournit une courte vue d’ensemble sur les complexes de remodelage de la chroma tine en soulignant la famille d’ISWI. Le remodelage de la chromatine couvre un large réseau d’actions entrelacées impliquant modifications des extrémités des histones, activation et répression de la transcription, condensation et décondensation de la chromatine. Aucune tentative de couvrir totalement la littérature n’a été faite. Les analyses in vitro caractérisant le fonctionnement et l’intéraction avec le substrat sont expliquées de manière générale et seuls les résultats spécifiques à ISWI sont décrits en détail. La partie suivant l’introduction est consacrée aux techniques qui ont été employées pour aboutir à la structure d’ISWI. Cela inclut bien sûr la cristallisation de la protéine et la cristallographie (avec un rapport détaillé sur le problème de phase) mais également une courte section au sujet du dichroisme circulaire. Le principale résultat de ce travail est l’obtention de la structure de ISWI [691 :991] à 1.9 Å de résolution. La structure allongée consiste en trois domaines décrits séparément en détails. Le domaine N terminal présente un nouveau repliement et a été nommé domaine HAND. Il est en contact direct avec le domaine suivant SANT. Le dernier domaine, SLIDE, est séparé du reste de la molécule par une hélice droite, longue de 50 Å. Le frag ment a une distribution asymétrique des charges des résidues acides et basiques entre les domaines N terminal et C terminal. La majeure partie de la molécule a une distribution de charges négatives sur la surfaces, mais le domaine SLIDE en particulier contient quelques parties positives. L’analyse suivante a montré que cette région est probablement la partie principale par laquelle le C terminale d’ISWI entre en contact avec l’ADN du nucléosome par un motif classique “hélice coude hélice”. Des molécules d’eau, de glucose et de glycérol trouvées dans la structure sont décrites en détail car elles sont importantes pour les contacts entre cristaux. La déscription inclut une belle composition avec une molécule d’eau se positionnant exactemant sur un des axes de symétrie et formant une boucle pentagonale avec deux autres molécules d’eau de l’unité asymétrique et de leurs compagnons de symétrie. La plupart des molcules´ d’eau de la structure sont concentrées au niveau du domaine C terminale de la protéine où ils constituent une partie de l’interface entre deux molécules de protéine. La interprétation fonctionnelle de la structure a été basée sur l’homologie des domaines SANT et SLIDE avec des protéines liant l’ADN comme le produit de l’oncogènes c Myb ou la protéine Pax 6. Le domaine SLIDE est lié au domaine SANT qui avait été proposé comme domaine se liant à l’ADN. Cependant, nous avons maintenant
1I should thank Cédric for correcting my initial translation of the abstract which resulted in a complete re write . . . 6 de forts indices montrant qu’en fait c’est le domaine SLIDE qui entre en contact avec l’ADN alors que l’hélice du domaine SANT qui devrait se lier à l’ADN est trop négativement chargée pour assurer cette fonction. La thèse se termine sur un modèle suggerant la manière dont le domaine C terminale d’ISWI pourrait se lier au nucléosome, agissant ainsi comme module d’identification du substrat par ISWI entier.
Remarque afin de faciliter la lecture : Je me refère parfois aux noms de domaines dans la structure du fragment C terminal d’ISWI. Afin de mieux comprendre le texte, je recommande au lecteur de regarder tout d’abord la figure 13.1 afin d’avoir une vision d’ensemble du modèle, des noms de domaines et de leurs limites. En outre, j’utilise parfois le terme de ISWI C qui se refère alors a n’importe quel clone du domaine C terminal d’ISWI préparé pendant ce travail et dont la distinction précise n’est pas nécessaire. Il s’agit principalement des clone suivants qui ont été cristallisé : ISWI [691 :991], ISWI [701 :991] et ISWI 713 :991]. 7
Acknowledgments
Since I often ask before I think, I am bound to forget many people in this list who helped me during my nearly four years stay at the EMBL. Yet, I want to try to make it as complete as my memory admits. Andreas for being a living encyclopaedia and patiently answering all my chemical questions. Andreas, Carlo, Raimond & Serge for the lessons in crystallography. Annie for lookin after and buying the chemicals and other things I needed. Annie, Monique for tidying all the mess I left behind in the wet lab. Christoph for guiding me well through the work of my thesis. At several decisive steps he advised me well and still allowed me a lot of freedom to develop my own ideas. Denis, Jean-Marie, and Jean-Pierre they had answers for all technical and less technical questions about broken and unbroken lab equipment L’équipe de natation du GUC et SAM et Phillippe
Fabrice, Jan & Mark for their help with cloning et al.. Fabrice strengthened my understanding of oligo design and PCR. Guy Schoehn for the time he spent at the electron microscopy with the ACF complex and ISWI (unfortunately, I could not reproduce the preparations to confirm what he saw) Jean-Pierre he had particular importance to the lab work for he built a 25 cm extension to both desk and bench without which my back would have severly suffered. Kreischi weil er als erster mich auf die Idee brachte, daß es hilfreich ist, mit Denken Probleme anzugehen.
Mark & Serge for interesting and entertaining discourses about computing and programming. Martine for all the small and big questions that arose in the lab. Raimond for introducing me to his beamline and letting me play a lot with it. He spent several hours of is spare time explaining to me and measuring (like non existing platinum signals). Stephen Curry for he is responsible for the basics of what this work is based on — I profited from his great skills of teaching crystallography and molecular biology The members of my thesis advisory committee for having listened to all of my progress reports and useful dis cussions. After my first progress report, Saadi Khochbin suggested to carry out binding studies with cruci form DNA. Mila weil sie mir zeigte, wie man mit Denken Probleme lösen kann. Und für unzähliges mehr. Sie fand auch die beiden Paare, die die “Hand” Domäne stabilisieren und die wunderschöne Figur der Wassermoleküle, die in dieser Arbeit in Bild 13.6 auf Seite 75 beschrieben ist. 8 Contents
I Introduction 15
1 Structural features of the nucleosome 17 1.1 Structure of the nucleosome core particle ...... 17
2 Chromatin remodelling complexes 21 2.1 The unifying SNF2 domain ...... 21 2.2 The three subgroups of SNF2 remodelling enzymes ...... 21 2.3 ISWI containing complexes ...... 22 2.4 In vitro characterisation of chromatin remodelling enzymes ...... 23 2.5 Substrate dependence and nucleosome sliding ...... 24 2.6 Importance of ISWI in vivo ...... 24
3 Scope of this thesis 27
II Methods — theoretical background 29
4 Circular dichroism 31 4.1 Theory ...... 31 4.2 Data evaluation — wavelength scan and melting curve ...... 31
5 Protein crystallisation 33 5.1 Crystallisation techniques ...... 33
6 Protein crystallography 35 6.1 The phase problem ...... 35 6.1.1 Molecular replacement ...... 36 6.1.2 Experimental phasing ...... 36 6.2 Phase improvement — density modification ...... 37 6.2.1 Automatic model building ...... 38 6.3 TLS refinement ...... 38
III Materials and Methods 39
7 Purification and crystallisation 41 7.1 Subcloning ...... 41 7.2 Expression and purification of recombinant proteins in E. coli ...... 41 7.2.1 Protein expression ...... 41 7.2.2 Production of seleno methionine substituted protein ...... 42 7.2.3 Purification of 6xHis tagged Proteins ...... 42 7.2.4 Purification with the IMPACT T7 system ...... 43 7.2.5 Concentration measurement ...... 43 7.3 Crystallisation of the C terminal sublcones of ISWI ...... 43 7.4 Harvesting and freezing of crystals ...... 44
8 Additional experiments for protein characterisation 47 8.1 Bandshift assays with ISWI [691:991] and cruciform DNA ...... 47 8.2 Restricted proteolysis ...... 47 8.3 Circular dichroism ...... 48
9 10 CONTENTS
9 Data collection, processing and refinement 49 9.1 Data collection ...... 49 9.2 Data processing ...... 49 9.2.1 Generation of tagged reflections for free R calculation ...... 49 9.3 Phasing ...... 50 9.3.1 MAD location of Selenium sites ...... 50 9.4 Model building and refinement ...... 50 9.5 Other useful programs ...... 50 9.6 Superposition of the nucleosome and DNA ...... 50
IV Results 53
10 Protein characterisation 55 10.1 Restricted proteolysis ...... 55 10.1.1 Subcloning and protein expression...... 56 10.2 Circular dichroism ...... 57 10.2.1 Measurements — wavelength scans ...... 57 10.2.2 Measurements — melting curve ...... 58 10.3 Binding of ISWI [691:991] to cruciform DNA ...... 59
11 Crystallogenesis of ISWI [691:991] 61 11.1 Hexagonal space group ...... 61 11.2 Monoclinic space group ...... 61 11.3 Production of crystals for phasing ...... 62 11.3.1 Hexagonal space group ...... 62 11.3.2 Monoclinic space group ...... 63
12 Data collection and processing 65 12.1 Data statistics ...... 65 12.2 Low resolution data ...... 66 12.3 Density modification and automated building — resolve ...... 66 12.4 Molecular replacement with data from the hexagonal crystal form ...... 68
13 Description of the structure of ISWI-C 69 13.1 Overall structure ...... 69 13.2 “Hand” domain — a new fold ...... 71 13.3 SANT domain ...... 71 13.4 SLIDE domain ...... 72 13.5 Solvent molecules in the structure ...... 72
14 Comparison with known structures 77 14.1 Interpretation of structural homology ...... 79 14.1.1 Consequences for nucleosome recognition by ISWI ...... 80
15 Discussion and perspective 85
V Appendix 87
A Secondary structure prediction of full-length ISWI 89
B List of clones 91
C Calculating the slope of CD data 93
D Article 97 List of Tables
2.1 Subfamilies of SNF2 ...... 21
9.1 Settings for collection of data sets ...... 49
10.1 Secondary structure prediction for ISWI [691:991] from CD spectra ...... 58
11.1 Tests with heavy metal derivatives on the rotating anode ...... 63
12.1 Data sets statistics ...... 65 12.2 Summary of solve refinement ...... 66
14.1 Results of DALI search for structural homologues ...... 77
A.1 Domain prediction by SMART ...... 90
B.1 N terminal subclones of ISWI (pProEx Htb) ...... 91 B.2 C terminal subclones of ISWI (pProEx Htb) ...... 92
11 12 LIST OF TABLES List of Figures
1.1 Model of the nucleosome core particle ...... 18
4.1 CD spectra of poly L Lysine in three conformations ...... 32
5.1 Crystallisation methods — vapour diffusion ...... 33 5.2 Crystallisation methods — liquid phase diffusion ...... 34
6.1 Experimental phasing ...... 37
7.1 Purification effect of second Ni column ...... 43
10.1 Domain composition of ISWI ...... 55 10.2 Trypsin digestion of full length ISWI ...... 56 10.3 Secondary structure prediction of ISWI[691:991] ...... 57 10.4 Temperature dependent expression of ISWI C ...... 58 10.5 Circular dichroism — ISWI [691:991] and ISWI ...... 58 10.6 Melting curve and first derivative of ISWI [691:991] ...... 59 10.7 EMSA of ISWI [691:991] with Holliday junction DNA ...... 60
11.1 Crystal growth by reversed salting in. Phase diagram and example pictures...... 62 11.2 Examples for micro–seeding ...... 63
12.1 Sample diffraction images of high and low resolution pass ...... 67 12.2 Comparison of completeness for high and low resolution pass ...... 68
13.1 Structure of ISWI [691:991] ...... 70 13.2 Fold and stabilisation of the “Hand” domain ...... 71 13.3 Interaction interface between “Hand” and SANT domain ...... 72 13.4 The SLIDE domain ...... 73 13.5 Binding of the glucose molecule ...... 73 13.6 A special water configuration at the interface between two molecules ...... 75 13.7 Location of the special water molecule within the structure ...... 75
14.1 Superposition of SANT and SLIDE domains with their closest structural neighbours ...... 78 14.2 Residues of SANT and SLIDE contacting DNA ...... 79 14.3 Binding possibilities of ISWI [691:991] at the nucleosome ...... 80 14.4 Two possibilities how ISWI [691:991] contacts the tails of H3 and H4 ...... 82 14.5 Basic residues on the spacer helix support the model of how ISWI [691:991] contacts the DNA . . 83
C.1 Effect of window size in perl script to determine the inflection point of melting curve data . . . 95
13 14 LIST OF FIGURES Part I
Introduction
15
Chapter 1
Transcription regulation — structural features of the nucleosome
Eukaryotic organisms maintain their genome as chromatin, a dynamic assembly of DNA, RNA, and proteins. Its diversity and dynamics — people speak of chromatin fluidity — allows for high control over nuclear processes such as transcription, replication and repair, that are required to maintain the cell’s viability. Chromatin has various faces. During cell division, it condenses to what is called heterochromatin and chromosomes can be distinguished under a light microscope; condensation into the highly compacted heterochromatin makes the DNA mostly in accessible and suppresses transcription, a process called silencing (Voet and Voet, 1995; Kornberg and Lorch, 1999). During the remaining time of the cell cycle, however, the chromosomes cannot be distinguished any more; chromatin appears as a mixture of heterochromatin and the loosely packed euchromatin. At this stage, DNA can be accessed by a vast machinery present in the nucleus. Transcription, for example, involves more than hundred proteins that regulate proper functioning of RNA polymerases, which themselves are complexes in the mega Dal ton range. These proteins need access to the DNA, notably the promoter sites in order to initiate and carry out transcription. The fact that DNA can be compacted into higher order structures finally leading to the chromosomes (as shown e.g. by electron microscopy (Voet and Voet, 1995)) indicates that the genome is not stored as bare DNA but by the help of repetitive elements that direct folding and unfolding. There are several degrees of compaction between free DNA and chromatin; first comes the nucleosome, a protein octamer built up of four different histones, two each, with the double helix wrapped around. With the aid of linker histones and trans acting proteins, nucleosomes form nucleosome fibres of 10–30 nm diameter (depending on the ionic strength). These are assembled into even higher order structures that contain additional proteins like HP1 (heterochromatin protein 1) and SIR proteins (Hayes and Hansen, 2001). The existence of the nucleosome was first suggested by R. Kornberg in 1974 based on experimental results available at the time (Kornberg and Thomas, 1974; Kornberg and Lorch, 1999; Voet and Voet, 1995). Amongst all chromosomal proteins, five are the most abundant ones, with a mass level comparable to that of the DNA. These proteins have been named histones H1, H2A, H2B, H3, and H4. Histone H1 is present at only half the mass of each of the other four histones. It seems to be important for inter nucleosomal interaction. The nucleosome itself consists of a stretch of double stranded DNA wrapped around a histone octamer consisting of the (H3 H4)2 tetramer flanked by two H2A H2B dimers. These four histones that build the histone octamer are amongst the most conserved proteins, especially the histones H3 and H4 (Kornberg and Lorch, 1999). Micrococcal nuclease (which cuts free double stranded DNA) digestion of chromatin showed that 146 bp of DNA are wrapped around the histone octamer, building the nucleosome core particle; the sequence linking two nucleosomes varies about a mean of 50 bp.
1.1 Structure of the nucleosome core particle
An important contribution to the understanding of the nucleosome was the X ray structure at 2.8 Å resolution (Luger et al., 1997), now amended by several other structures at higher resolution, e.g. (Davey et al., 2002). The structure revealed the shape of a flat cylinder with a diameter of 100 Å and a height of 60 Å. Many hydrogen bonds and salt bridges result in strong interactions between the proteins and the phosphate backbones but also make the DNA deviate from its canonical straight double helical form and make it turn 1 3/4 times around the histone octamer. These contacts render the nucleosome a rather stable complex at physiological conditions. Its globular structure minimises the surface and protects the DNA; therefore, the histone tails are important signals to the “outside world”, see Figure 1.1. With up to 40 residues they make up to one third of the histone mass and reach far from the nucleosome globule. The distance of the outermost histone tail can be at least 45 Å from the DNA surface, which is nearly half the diameter of the nucleosome. Even though the contacts between proteins and DNA are not sequence specific, the curvature of the DNA does depend on its sequence. This influences the position of
17 18 CHAPTER 1. STRUCTURAL FEATURES OF THE NUCLEOSOME
(a) view of half the nucleosome perpendicular to the dyad (b) view along the dyad axis, rotated by 90◦ around the axis x-axis
FIGURE 1.1: Model of the nucleosome core particle at 1.9 Å resolution (PDB-code 1kx5; atoms with zero occupancy removed). Figure 1.1(a) shows only one copy of each histone and half the DNA for better clarity. The “handshake” motif as dimerisation interface between the H3–H4 (blue–green) and H2A–H2B (red–yellow) respectively can be seen well. The side view of Figure 1.1(b) illustrates the compactness of the nucleosome. The nucleosome occludes major parts of the DNA double helix. the nucleosome and they appear to be close to promoters, regulatory elements, or other special sites (Kornberg and Lorch, 1999). Protection of the DNA and providing an important cornerstone in chromatin compaction are rather passive roles of the nucleosome. It is also —directly and indirectly — involved in transcription regulation. Directly because, as mentioned above, the nucleosome can hide promoter sites and thus inhibit expression of a particular gene. Indirectly because of the histone tails that function as signals and binding anchors for many factors that further direct the steps to be carried out. There are two basic and important processes of nucleosome modifications in the foreground of discussion: 1. Covalent histone tail modifications. As mentioned above, the histone N termini are very long and flexible, reaching out from the nucleosome core. Their residues are subject to various modifications that regulate several processes, and many DNA dependent pathways depend on the state of modifications of the histone tails (Strahl and Allis, 2000; Iizuka and Smith, 2003, for reviews). The following ones seem especially important: (De-)Acetylation of lysines mostly on the amino terminal ends of H3 and H4 are associated with transcrip tion activation. Histone acetyltransferases (HAT’s) and histone deacetylases (HDAC’s) carry out these modifications. Acetylation neutralises the charge of the lysine and thereby alters its binding behaviour to other proteins. Phosphorylation of serine 10 of histone H3 is often (but not always) associated with chromatin condensa tion; however, other sites in other histones including H1 can also be phosphorylated. Methylation of lysines or arginines, mostly of histones H3 and H4; mono , di and tri methylation can be observed. Methylation of histone tails is related to DNA methylation and and both events are usually associated with transcription repression even though the opposite has been reported, too (Bernstein et al., 2002). 2. ATP dependent nucleosome remodelling. A large number of complexes is capable of changing the positions of nucleosomes on DNA. Important for these complexes is a central ATP dependent subunit. These subunits share a common ATPase domain with strong homology across all eukaryotes that classifies them as members of the SNF2 family of helicases. This subgroup of the DEAD/H helicases is unified by a stretch of several hundred amino acids that contains characteristic, highly conserved motifs. Remodelling is important both for repression and activation of transcription. Histone tail modifications and nucle osome remodelling are not independent processes but often occur together. Remodelling that suppresses transcrip 1.1. STRUCTURE OF THE NUCLEOSOME CORE PARTICLE 19 tion has been observed to be accompanied by methylation and de acetylation, and by acetylation when it enhances transcription (Santoro et al., 2002; Tariq et al., 2003). 20 CHAPTER 1. STRUCTURAL FEATURES OF THE NUCLEOSOME Chapter 2
Chromatin remodelling complexes
2.1 The unifying SNF2 domain
The first chromatin remodelling activity was found in yeast with the SWI/SNF complex (mating type switch / sugar non fermenting). This 2 MDa complex contains about eleven subunits including the ATP dependent enzyme SWI2/SNF2. Homologs of SWI2/SNF2 were soon found in many eukaryotes. Database searches in the Drosophila melanogaster genome revealed Brahma as close relative and, more distantly related, ISWI, imitation switch.A comprehensive work classified a large range of these proteins and found a central SNF2 domain characterised by several conserved motifs. Seven of these motifs make them belong to the DEAD/H class of helicases (Eisen et al., 1995; Bork and Koonin, 1993). The second of these motifs is the well known Walker motif A, GXnGK[TS], that is used by many proteins for magnesium mediated ATP binding. The authors’ phylogenetic analyses divided the SNF2 family into 15 subfamilies, but only three of them are chromatin remodelling enzymes, SNF2, SNF2L (now often named ISWI), and CHD1 (also called Mi 2). This subdivision is based on differences in the sequences adjacent to the helicase domain which will be described in the following section. One difference between these groups is the size of the complexes they form. The first one, SNF2 builds the largest complexes with more than ten subunits. The SWI2L / ISWI like proteins are found in the smallest complexes with two to four subunits. The sizes of Mi 2 containing complexes lie between these ones. To date no eukaryote is known that does not contain a member of the family spanned by these three chromatin remodelling enzymes. There is a paralogue of SNF2/SWI2 that does not follow this scheme and cannot be classified in one of the three aforementioned groups. Ino80 (ORF YGL150C) from yeast, a 170 kDa protein, shares the homology of the ATPase domain (expectation value for the SNF2_N domain (PFAM) = 10−109), but neither PFAM nor SMART indicate any known domains other than that. It is member of a 12 unit complex and seems to be directly involved in DNA repair since deletion mutants are more sensitive to UV and radioactive radiation than wild type cells (Shen et al., 2000; Ebbert et al., 1999; Steger et al., 2003).
2.2 The three subgroups of SNF2 remodelling enzymes
Currently one distinguishes three classes of chromatin remodelling enzymes: SNF2, SNF2L or ISWI, and CHD1 or Mi 2. Databases like PFAM (Bateman et al., 2002) and SMART (Schultz et al., 1998) split the SNF2 domain into two sub domains, named DEXDc and HELICc in the case of SMART, SNF2_N and helicase_C for PFAM. HELICc and helicase_C differ only slightly in their consensus, DEXDc is a more general definition than SNF2_N. The first family, SNF2, contains a bromodomain C terminal to the SNF2 domain, the second one, SNF2L / ISWI, a SANT domain, and the third group, CHD1 / Mi 2, an N terminal chromo domain, often accompanied by a PHD finger. The domain substructure is sketched in Table 2.1.
SNF2 SNF2L / ISWI CHD1 / Mi-2
DEXDc and HELICc encompass the ATPase domain common to all three groups
C-terminal bromodomain C-terminal SANT domain N-terminal chromo domain 10–20 subunits / complex 2–4 subunits / complex 5–8 subunits / complex nucleosome transfer in cis and trans nucleosome transfer in cis acetyltransferase activity
TABLE 2.1: Subfamilies of SNF2 are distinguished according to the domain structure adjacent to the SNF2-domain (here split into DEXDc and HELICc). SNF2-like proteins have a C-terminal bromodomain, SNF2L-like proteins a C-terminal SANT domain, and CHD1-like proteins an N-terminal chromo domain that is generally preceded by a PHD finger.
21 22 CHAPTER 2. CHROMATIN REMODELLING COMPLEXES
The bromodomain of the SNF2 group binds histone tails and is associated with transcription activation. The name is derived from brahma and the analogy to chromo domains. It can be found not only in SNF2 like pro teins but also in acetyltransferases. The bromodomain specifically binds acetylated histone tails. Details of the interaction with residues 16–19 from histone H4 were demonstrated by the crystal structure of the 110 residues bromodomain of Gcn5p complexed with residues 15–29 of histone H4 acetylated at Lys16 (Owen et al., 2000). However, as in the case of the chromo domain, homology must not be conferred to function without great caution, and bromodomains in different proteins may exhibit different functions.
The chromo domain of the CHD1 group occurs also in HP1, a protein associated with heterochromatin, where the domain binds methylated H3 tails. This is not the case for Mi 2, and it has been suggested that the chromo domain in Mi 2 interacts with DNA rather than histones (Bouazoune et al., 2002).
The SANT domain is part of the fragment presented in this work and will be described in greater detail. The name is derived from the four proteins that initially defined it, Swi3 (ISWI family), the Ada2 (a subunit of the acetyltransferase complex SAGA), the co repressor N CoR, and the general transcription factor TFIIIB. All these proteins interact with nucleosomal DNA. Because of their strong homology to the DNA binding repeats of the oncogene product c Myb, the SANT domain has been proposed to also bind DNA (Aasland et al., 1996). In the known structure of c Myb, the three homologous repeats each consist of a globular three helix bundle. DNA is bound through the helix turn helix motif of the second and third helix. Many proteins make use of the helix turn helix motif to bind the major groove of double stranded DNA. Recent results indicate, that SANT domains are very important as a histone tail presenting module. The effect of mutations in the SANT domains of several members of yeast chromatin modifying complexes were inves tigated (Boyer et al., 2002): Swi3 is one of 17 subunits of the SWI/SNF complex; Ada2 is present in several HAT complexes containing the enzyme Gcn5, including the SAGA complex; Rsc8 belongs to RSC whose ATPase subunit belongs to the SNF2 family, and so does the SWI/SNF complex. In this study all deletions seriously im peded the viability of the yeast cells. Importantly, the SANT domain of Ada2 has a large impact on the interaction of the SAGA complex with the H3 tail. Pull down assays against GST H3 tails with the intact SAGA complex kept both Gcn5 and Ada2 attached to the glutathionine beads. A deletion mutant of Ada2 where ten residues were removed from the putative third helix of the SANT domain transferred a major part of the complex to the unbound fraction. The authors also point out that they could not detect any in vivo influence of three independent point mutations of residues in Swi3. The corresponding residues in Myb on the other hand play critical roles for its DNA binding. A second study (Sterner et al., 2002) tested the effects of various mutations of the SANT domain in Ada2 on the activity of the SAGA complex and they found that the second half of the SANT domain is important for binding to Gcn5. The corresponding part in Myb is in contact with DNA and therefore less likely to be involved in protein protein contacts. Our analysis of the C terminus of ISWI and binding studies confirm that the ISWI SANT domain is probably not involved in direct DNA binding (Grüne et al., 2003).
2.3 ISWI containing complexes
ISWI was found in Drosophila melanogaster in the following complexes:
1. ACF, the ATP utilising chromatin assembly and remodelling factor, or CHRAC, the chromatin accessibility complex. From current assays, ACF and CHRAC cannot be distinguished. They both contain ISWI and Acf1, 270 kDa. CHRAC was purified with two additional peptides, Chrac 14 and Chrac 16 (14 and 16 kDa respectively), but apart from their composition no difference between CHRAC and ACF has yet been re ported (Corona et al., 2000; Ito et al., 1999).
2. NURF, the nucleosome remodelling factor. Apart from ISWI, NURF contains Nurf 301, which is similar, but not identical, to Acf1; Nurf 55, also found in the chromatin accessibility complex CAF 1; and Nurf 38, an inorganic pyrophosphatase.
Following the literature, defining these complexes has not been easy. They were identified and purified on the basis of different biochemical assays. NURF was found to be required for chromatin remodelling that is induced by the GAGA factor (Tsukiyama and Wu, 1995). CHRAC was identified by its ability to mobilise nucleosomes in a manner that allows enhanced access of restriction enzymes to DNA packed into chromatin. ACF was identified by a NAP I assisted chromatin assembly assay (Kadonaga, 1998). Acf1, a 270 kDa protein, was initially reported not to be part of CHRAC while topoisomerase II was co purified with CHRAC. This is probably why most people still refer to ACF and CHRAC as distinct complexes even though they cannot be distinguished based on there activities. NURF, however, is clearly distinct. As discussed in Section 2.6, depletion of NURF301 is not recovered 2.4. IN VITRO CHARACTERISATION OF CHROMATIN REMODELLING ENZYMES 23 by CHRAC/ACF, and unlike NURF, which renders nucleosomes stochastically distributed on a multi nucleosomal array, CHRAC/ACF have the opposite effect and cause regular spacing. Considering the complexity of interactions involved in nuclear processes, it might not always be possible to define a complex as an isolated stable entity. Different stages of functioning may require different manifestations. Studies by Memedula and Belmont (Memedula and Belmont, 2003) for examples provide evidence that BAF 155 and BAF 170, both members of SWI/SNF like complexes in yeast, are recruited more than an hour after Brg1 or Brm, the active enzymes of those complexes. Something similar might be true for Chrac 14 and Chrac 16, and that these two proteins are only required for some but not all modes of function of ACF/CHRAC. In that case, ACF and CHRAC would of cause have to be referred to as two distinct complexes.
2.4 In vitro characterisation of chromatin remodelling enzymes
ATP hydrolysis and remodelling activity can be monitored separately in vitro. Hydrolysis is often measured by dissociation of γ32P from ATP. Sequencing gel electrophoresis after enzymatic digestion of multi nucleosomal arrays can be used to check remodelling activity. The protection of DNA by the nucleosome alters the digestion pattern by DNase I or MNase (micrococcal nuclease) that can be observed. Electrophoretic mobility shift assays (EMSA) are used to detect interaction between substrate and complexes. These techniques have widely been used to characterise the function of chromatin remodelling proteins and complexes (Corona et al., 1999; Clapier et al., 2001; Havas et al., 2000; Whitehouse et al., 2003). In vivo all chro matin remodelling factors act as complexes, for ISWI it has been shown that it remodels chromatin without the aid of co factors. The rate of hydrolysis is substrate dependent. ISWI by itself as well as ISWI containing complexes require nucleosomal substrate with an intact H4 tail and an overhang of DNA extending from the nucleosome core for full activity. The activity of ISWI is further stimulated in the presence of Acf1. The hydrolysis of other com plexes is already stimulated by free DNA (SWI/SNF) or the nucleosome core particle without histone tails (Mi 2). The following list summarises the most frequently used assays for remodelling complexes and enzymes (Kingston and Narlikar, 1999).
ATP hydrolysis can be monitored qualitatively by thin layer chromatography by using radioactively labelled ATP and quantitatively with a scintillation counter or a phosphor imager. In the case of ISWI, unfortunately, the rate of hydrolysis is very low and impedes kinematic studies (Whitehouse et al., 2003).
Nucleosome assembly and remodelling by NAP 1 supported by a chromatin remodelling enzyme results in the protection of DNA as shown by agarose gel electrophoresis upon restricted digestion by MNase. MNase cannot cut DNA protected by the histone core. Hence changes in the protection pattern can be used to show nucleosome movement. For example a multi nucleosomal array with randomly placed nucleosomes will result in a smear whereas an evenly spaced array produces distinct bands as the enzyme cleaves only at the locations that are unoccupied by the nucleosome. DNase prefers sites where the minor groove faces away from the nucleosome. This allows for higher resolution mapping then the MNase assay.
Transcription enhancement. This can be measured with labelled nucleotides that are being incorporated into the transcribed RNA. NURF, for example, enhances binding of the transcription factor GAGA and hence the amount of RNA produced during the assay.
Nucleosome sliding. Mono nucleosomes with overhanging DNA, i.e., more than 150 bp, can be separated by EMSA depending on the location of the histone octamer. If they are at the edge of the DNA they migrate faster than if they are at a central position.
Assays testing these functions were originally used to discover many remodelling complexes. An important result towards understanding the remodelling mechanism was the finding that ISWI works without support of other complex subunits (Corona et al., 1999). ISWI hydrolysis ATP in the presence of nucleosomes, remodels nucleosomal arrays and facilitates chromatin assembly. Unfortunately, quantification and comparison between different experiments, especially with respect to the ISWI group, is impeded by the low activity and difficult reproducibility (Martens and Winston, 2003; Whitehouse et al., 2003, and private communication J. Brzeski). The following observations illustrate the influence of the other subunits within the complexes. ISWI and CHRAC/ACF render a nucleosomal array evenly spaced while NURF causes a stochastic distribution (the same is true for the SWI/SNF complex with SNF2 from the SNF2 subgroup). On the other hand, ISWI incubated with a nucleosome with an overhang of DNA makes it move to the edge while CHRAC/ACF centres the nucleosome. Only the large complexes RSC or SWI/SNF from the SNF2 group have been reported to displace nucleosomes in trans (Panigrahi et al., 2003; Längst et al., 1999). 24 CHAPTER 2. CHROMATIN REMODELLING COMPLEXES
2.5 Substrate dependence and nucleosome sliding
ISWI by itself hydrolyses ATP only weakly. Critical for its full activity is only the H4 N terminal tail and some linker DNA extending beyond the nucleosome core. The activity is further enhanced when ISWI acts in complex with Acf1. Nucleosomes from recombinant histones with all but the histone H4 tail removed do not reduce the activity of ISWI while it becomes inactive with nucleosomes containing all but the H4 N terminal tail. Further analysis assigned an important role to a small epitope in the region between Gly10 and Arg19: With the first ten residues removed from histone H4 (∆10 H4) ISWI maintains a normal hydrolysis level while the level is reduced to background with a ∆19 H4 mutant. As opposed to ATP hydrolysis, nucleosome sliding is nearly abolished with any of the four tails missing. It is only reduced but still clearly detectable if ISWI acts as the CHRAC complex with any of the three histone tails of H2A, H2B, or H3 removed (yet again, a missing H4 tail abolishes sliding even for CHRAC) (Brehm et al., 2000; Clapier et al., 2001; Clapier et al., 2002). Both double and single stranded DNA induce ATPase activity of ISWI but at a much lower level compared to nucleosomal substrate (Boyer et al., 2000; Corona et al., 1999; Whitehouse et al., 2003). This exhibits further differences between the subgroups of the SNF2 family: The SNF2 group can be equally stimulated by free DNA and nucleosomes, with or without tails. Mi 2 requires nucleosomes but not tails, and with nucleosome free DNA the activity of Mi 2 is reduced to about 30%. Current models of how the nucleosome sliding or remodelling process works are based on two ideas:
Twisting. Chromatin remodelling complexes and enzymes have been reported to alter super helical torsion (Havas et al., 2000). This raised the idea that the DNA could be “screwed” around the histone core. This would increase the torsional stress and the requirement for means of release, especially in multi nucleosomal arrays. Torsional stress can be released by topoisomerase I, but in vitro assays work without this enzyme.
Bulging or looping. Remodelling is not inhibited by nicked DNA which is supposed to interrupt torsion propaga tion (Aoyagi and Hayes, 2002). Therefore a mechanism purely based on twisting of the DNA can probably be excluded. A “bulging” or “looping” mechanism has been proposed instead where the remodelling com plex forms a loop of DNA that propagates around the histone octamer (Längst et al., 1999).
A mixture of both ideas is also being considered, but no detailed enough experiments are available for a more precise model. Several recent publications support the second mechanism. The loss of approximately 40 bp worth of histone DNA interactions upon remodelling by SWI/SNF has been reported (Bazett Jones et al., 1999), as well as the formation of an up to 50 bp large loop by SWI/SNF during nucleosome remodelling (Kassabov et al., 2003). Finally, the translocase abilities of ISWI were tested by the displacement of triplex DNA 0–60 bp away from an edge positioned nucleosome. Between 40 and 50 bp no more triplex displacement was detected (Whitehouse et al., 2003). This work also found the first link between proteins from the SNF2 family and the superfamily of DEAD/H helicases they belong to. For none of the chromatin remodelling enzyme strand separating activity has been reported. The triplex displacement assay first showed a 3′ 5′ preference for ISWI: the triplex was not removed with a five or ten base pair gap in the 3′ 5′ strand;→ a gap in the opposite strand had no effect and triplex removal was as strong as with no gap in the double→ stranded DNA. The authors state that strand activity is a property of DNA helicases and that many members of the DEAD/H family show 3′ 5′ specificity. → 2.6 Importance of ISWI in vivo
Many experiments have been published trying to elucidate the remodelling mechanisms and their links to other gene regulators. Only little is known about their role in vivo. Two very important, complementary contributions were published recently (Badenhorst et al., 2002; Deuring et al., 2000). In the latter publication Drosophila mutants were examined with C terminally truncated forms of ISWI: ISWI[1:800] and ISWI[1:953]. Neither could be detected in embryos1 and the mutants die in late larval or in early pupal state even though no phenotypical anomalies were found. Preliminary survival was attributed to the presence of paternal ISWI — in adults, ISWI is mainly expressed in oocytes and testicles, maybe only in order to provide a stock for their offspring. Local over expression in the eye disc of ISWIK159R, a point mutation that lacks the capability to hydrolyse ATP (Corona et al., 1999), results in heavy mutilation of the eye of the adult fly. On the molecular level, the males’ X chromosome of the mutant fly embryos was heavily deformed; an effect on autosomes was detected but more subtle. In their studies full length ISWI does not co localise significantly with the transcription activator GAGA. This seems to be in contrast to the activation enhancement of GAGA by ISWI in vitro (Tsukiyama et al., 1995) as well as the strong reduction of expression of heat shock protein 70 (hsp70), ultrabithorax (ubx), and engrailed (en), all three targets of GAGA. However, since Drosophila has more than one complex which contains ISWI as ATPase (CHRAC/ACF and NURF), it was unclear whether or not their findings could be contributed directly to the malfunctioning of ISWI.
1The authors suggest that the C terminus of ISWI is essential for its proper folding; this is supported by the fact that recombinant N terminal subclones of full length ISWI are very unstable and difficult to purify as reported later in this work. 2.6. IMPORTANCE OF ISWI IN VIVO 25
Therefore, the experiments were repeated with Nurf 301 (Badenhorst et al., 2002), the large subunit in NURF. which led to very similar results. Since the functionality of the other two ISWI containing complexes present in Drosophila should not be affected by the Nurf 301 mutation, these results highlight basic differences between the NURF complex and ACF of CHRAC. To my knowledge, similar investigations using Acf1, the common large subunit of ACF and CHRAC, are still missing. 26 CHAPTER 2. CHROMATIN REMODELLING COMPLEXES Chapter 3
Scope of this thesis
Despite the amount of information available about chromatin remodelling complexes, enzymes and their function, structural information that might help explain the remodelling mechanism has been missing. Therefore a collabo ration between the crystallography group of Christoph Müller, who is interested in DNA protein interactions, and the group of Peter Becker, whose work has contributed a lot to the understanding of chromatin remodelling and especially ISWI and CHRAC, was initiated. The goal of this work was to supply structural information about ISWI in order to help understanding the mechanism of remodelling. To crystallise one of the ISWI containing complexes to would have been too ambitious for a four years project. The results from D. Corona from P. Becker’s group (Corona et al., 1999) had shown that the remodelling activity is intrinsic to ISWI and not the fully assembled complexes. This made this enzyme by itself a particularly suitable target for the task to shed some light on the mechanism of chromatin remodelling.
27 28 CHAPTER 3. SCOPE OF THIS THESIS Part II
Methods — theoretical background
29
Chapter 4
Circular dichroism
4.1 Theory
The secondary structure elements of proteins (α helix, β strand and random coil regions) have different absorption coefficients for light with negative and positive helicity. Therefore, analysis of the change of polarisation of light can exhibit information about conformation and stability. The most general expression of a homogeneous plane electromagnetic wave is given by (Jackson, 1998)
ikx−iωt E(x t) = (ε1E1 + ε2E2)e (4.1) with ε1 2 being two linearly independent unit vectors in the plane perpendicular to the direction of propagation k and E1 and E2 two complex amplitudes. A wave is said to have positive helicity (or to be left circularly polarised in optics) if the total amplitude sweeps counterclockwise around k, negative helicity in the opposite case. Since proteins modify the state of circular polarisation, a base transformation is convenient by introducing 1 ε± = (ε1 iε2) √2 ± ikx−iωt E(x t) = (ε+E+ + ε−E−)e (4.2)
The (complex) amplitude can be separated into magnitude and phase,
iδ± E± = a±e
The four Stokes parameters are hence defined and expressed as
ε∗ E 2 ε∗ E 2 2 2 s0 = + + − = a+ + a− (4.3) ∗ ∗ s1 = 2Re (ε E) (ε−E ) = 2a+a− cos (δ− δ+) (4.4) + − ∗ ∗ s = 2Im (ε E) (ε−E) = 2a a− sin(δ− δ ) (4.5) 2 + + − + ε∗E 2 ε∗ E 2 2 2 s3 = + − = a+ a− (4.6) − − The Stokes parameters can be measured experimentally and fully determine the state of the wave after interaction with the protein solution. Because of their handedness, proteins have different and – more importantly – complex refraction indices for ε+ and ε− respectively, i.e., the two compounds of the wave are absorbed differently. This causes what is called "circular dichroism". The ellipticity of the sample is defined via the difference in absorbance ∆I for the negative and positive component of the wave, the sample concentration c, and the path length ∆ through the sample. This quantity is wavelength dependent. Using Beer’s law the ellipticity can be expressed depending only on the measurable values s0 and s3:
2 I I− 1 (s + s ) Θ(λ) = + − = log 0 3 (4.7) c ∆ c ∆ 10 (s s )2 × × 0 − 3 4.2 Data evaluation — wavelength scan and melting curve
Figure 4.1 shows the spectra for poly L lysine in three different conformations, i.e., purely α helical, purely β turns, and as random coil (Greenfield and Fasman, 1969). Ideally, the signal of any protein ought to be a linear superposition of these three spectra, weighted by the amount of each conformation present in the structure of the protein. The data could be analysed by fitting to a theoretical curve; however, to make up for possible deviations due to tertiary structure elements, recorded data are normally compared to data bases assembled from proteins
31 32 CHAPTER 4. CIRCULAR DICHROISM
8 × 104 α−lysine β−lysine coiled lysine 6 × 104
4 × 104
2 × 104 /dmol] 2
0 × 100 [°cm Θ
−2 × 104
−4 × 104
−6 × 104 190 200 210 220 230 240 250 wavelength [nm]
FIGURE 4.1: CD-spectra of three different conformations of poly-L-lysine: α-helical, β-turn, and coiled-coil (data from Greenfield and Fasman, (Greenfield and Fasman, 1969)). A general spectrum can be considered a linear superposition of these three curves. The data between 190 nm and 200 nm contain information important for evaluation. with known structure. This method requires accurate knowledge about protein concentration and it is not easy to tell a priori how accurate the prediction is. CD data from a wavelength scan may be more useful to detect conformational changes induced by ligand binding or in different solvent condition. A second, and more reliable deduction about protein properties can be drawn from a “melting curve”: The CD signal is measured at a fixed wavelength while the temperature of the sample is increased. As the protein denatures (“melts”), the signal is reduced. Normally the melting curve contains one or more inflection points. The temperature of the ith inflection i point is defined as melting temperature Tm. To maximise the signal difference, a wavelength should be chosen where the signal is extremal at the starting temperature. The melting temperature can also be affected by ligand binding; in fact an interesting experiment would have been CD measurements of ISWI with and without cruciform DNA since the fragment ISWI[691:991] and full length ISWI both weakly bind to it. Chapter 5
Protein crystallisation
5.1 Crystallisation techniques
Like salts, proteins can form crystals if their concentration in solution exceeds their solubility. The interaction between protein molecules, however, is much weaker than for small (especially ionic) molecules, and crystallisa tion is a rare event compared to disordered aggregation. Finding appropriate conditions is one of the bottlenecks towards obtaining a crystal structure.
Vapour diffusion Because of its simple set up, crystallisation trials for proteins are often carried out by the vapour diffusion method. Thereby a small volume of protein solution is brought into a closed system with a large reservoir solution. A concentration difference between the reservoir and the sample causes vapour diffusion between the two solutions until the vapour pressure in the system is at equilibrium. Hence the change of conditions in the protein solution can bring about the precipitation of the protein. Under the right conditions this happens by the formation of crystals; in most cases, however, by amorphous aggregation. In practice, 1–10 l of the purified protein is mixed with the reservoir solution which contains a precipitant at concentration cP . If the ratio is 1:1, both protein and precipitant concentration are halved upon mixing and will return to the initial concentration at equilibrium. But due to the presence of the precipitant the solubility of the protein can now be lower so that it precipitates. One can vary the mixing ratio and even have different constituents for the reservoir solution and the solution the protein is mixed with (Grüne, 1999). The two most common vapour diffusion techniques are the sitting drop and the hanging drop method. In the sitting drop method the protein precipitant mixture is placed onto a small depression or bridge on top of the reservoir. In the hanging drop method, the protein sample is prepared on a cover slip that is turned upside down before sealing the well, as illustrated in Figure 5.1.
protein bridge
reservoir
FIGURE 5.1: Crystallisation by vapour diffusion is based on equilibration via the gas phase so that only volatile compounds can interchange. The two most common methods are the sitting (left) and hanging drop (right).
Liquid phase diffusion A very different approach is the liquid phase diffusion method. Here, protein solution and reservoir are not mixed directly but separated by either a membrane or e.g. a layer of agarose so that the exchange is also diffusion driven but concerns all constituents in the two liquids that are small enough to pass through the filter, not only volatile ones. Therefore, non volatile compounds are also exchanged which can lead to very different, often improved results (Hansen et al., 2002).
Screening Whether or not crystals form depends on a large manifold of parameters and there is no or very limited a prior information about which ones to choose. The following parameters (and more) can influence crystallisation: