HIGHLIGHT www.rsc.org/molecularbiosystems | Molecular BioSystems Towards quantitative predictions in cell biology using chemical properties of proteins Michele Vendruscolo and Gian Gaetano Tartaglia*

DOI: 10.1039/b805710a

It has recently been suggested that the concentrations of proteins in the cell are tuned towards their critical values, and that the alteration of this balance often results in misfolding diseases. This concept is intriguing because the in vivo concentrations of proteins are closely regulated by complex cellular processes, while their critical concentrations are primarily determined by the chemical characters of their amino acid sequences. We discuss here how the presence of a link between the upper levels of in vivo concentrations and critical concentrations offers an opportunity to make quantitative predictions in cell biology based on the chemical properties of proteins.

Introduction: protein aggregation rates of the corresponding Solubility and chemical 7 and cell biology proteins measured in vitro (Fig. 2). properties of proteins In this highlight we explore the oppor- Cellular functions are orchestrated by tunities provided by the relationships be- The solubility of proteins can be defined complex regulatory networks involving tween the upper levels of in vivo in terms of their critical concentration, primarily nucleic acids and proteins.1,2 concentrations and the critical concentra- which is the concentration above which These same cellular functions are also, tions of proteins (Fig. 1) and between they form a condensed phase.9 The solu- however, dependent on the basic chem- mRNA expression levels and aggregation bility depends strongly on the thermo- istry of the molecules that carry them rates of the corresponding proteins (Fig. 2) dynamic conditions, including temperature, out.3–6 Therefore, the ‘‘chemical’’ and to develop strategies to make quantitative pH and ionic strength and on the compo- the ‘‘cellular’’ views of cell biology are predictions in cell biology through the sition of the solvent. Therefore the solu- complementary and closely related. A analysis of the chemical properties of pro- bility under standard in vitro conditions specific link between them emerges from teins. We first describe the advances that might differ from the solubility in the the suggestion that proteins remain so- have been recently made for predicting the highly complex cellular environment.9 luble in the cellular environment only up solubility of proteins8–12 and their propen- Moreover, as the most insoluble regions to the concentrations required for their sity for aggregation.13–20 We then explain of protein sequences are secluded from functions (Fig. 1). This notion follows how the relationship between protein ag- the solvent through the folding process, from the strong correlation that has been gregation rates and mRNA expression the solubility of proteins depends also on recently reported between expression le- levels (Fig. 2) provides a way to use the the stability of their native states.9 vels of human genes in vivo and the chemical properties of proteins to estimate Despite these complications, Wilkinson the expression levels themselves (Fig. 3), and Harrison demonstrated that the solu- and to assess the degree of neuronal dys- bility of proteins in Escherichia coli can be Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge, function and degeneration in a Drosophila predicted from their chemical properties, CB2 1EW, UK. E-mail: [email protected] model of Alzheimer’s disease (Fig. 4).21 including charge, propensity for forming

Michele Vendruscolo obtained a Gian Gaetano Tartaglia ob- PhD in condensed matter phy- tained an MPhil in theoretical sics at SISSA in Trieste. He is at La Sapienza in now a reader in theoretical che- Rome. He then received his mical biology at the University PhD in biochemistry at the of Cambridge and an EMBO University of Zurich, and he is Young Investigator. His re- currently working as a postdoc- search interests are primarily toral researcher in Cambridge focused on the investigation of in the field of protein misfolding the structures and properties of and aggregation. proteins, and their relationship to biological evolution and mis- folding diseases. Michele Vendruscolo Gian Gaetano Tartaglia

1170 | Mol. BioSyst., 2008, 4, 1170–1175 This journal is c The Royal Society of Chemistry 2008 proteins whose deposition is associated with specific diseases, but is a generic property of polypeptide chains.24 In 2003, Chiti, Dobson and co-workers reported the existence of a significant correlation between the changes in the aggregation rates resulting from single amino acid substitutions and their effects on the chemical properties of poly- peptide chains:13

ln(nwt/nmut)=aDH + b(DDGcoil-a + DDGb-coil)+gDC (2) Fig. 1 Schematic illustration of the expected relationship between the in vivo concentrations and the critical concentrations of proteins under physiological conditions. The amino acid In this equation, ln(nmut/nwt) is the loga- sequences of proteins have evolved to remain soluble at the upper levels of abundance required rithm in base 10 of the ratio between nwt, 25–27 by the cell during their cycles. As random mutations tend to destabilize their native folds the aggregation rate of the wild type and increase their propensities to aggregate,28 proteins slowly drift towards the toxic region of protein, and nmut, the aggregation rate low solubility (shown in red). Evolutionary selection prevents them from entering this region and of its mutational variant. DH, DDG stops the drift at the boundary between the toxic and the non-toxic regions (the latter shown in coil-a + DDG , and DC represent the green). This boundary is rather sharp (yellow band) and therefore causes the high correlation b-coil between aggregation rates and expression levels reported in Fig. 2. change in hydrophobicity, secondary structure propensity and charge upon mutation, respectively. Eqn (2) repro- duces to a remarkable extent the changes in the aggregation rates observed experi- mentally in vitro for single amino acid substitutions for series of peptide and proteins, including those associated with disease.13 This initial observation was then de- veloped into an approach in which en- vironmental conditions are also taken into account in order to predict the over- all aggregation propensities of peptides and proteins from the knowledge of their amino acid sequences and of the condi- tions used experimentally to monitor the aggregation process, including the pH, Fig. 2 Relationship between mRNA expression levels and aggregation rates for 11 human the ionic strength and the protein con- proteins for which the latter have been measured in vitro under near-physiological conditions.7 centration of the solution in which ag- The coefficient of correlation between these two quantities is 0.97. gregation occurs.16 The parameters corresponding to all these terms were turns, hydrophilicity and length of the dataset of 81 proteins, the simple mathe- estimated by fitting the elongation rates sequence.8 The degree of solubility of a matical model given by eqn (1) was pro- of a dataset of proteins compiled protein, cv, was estimated as: posed for the identification of soluble through an extensive literature search.16  eukaryotic targets in E. coli.22 After The same approach was also shown to N þ G þ P þ S cv ¼ a this initial work, several other studies have lead to accurate predictions of the n increased our understanding of the regions of the sequence of a peptide or R þ K D E þ b þ g ð1Þ relationship between the chemical proper- a protein that are most important n ties of amino acid sequences and their in determining the aggregation solubility.10–12 process.16,17,19,20 In this equation, n is the number of amino acids in the protein, a and b are two parameters describing, respectively, the Aggregation and chemical A link between protein electrostatic charge and the propensity to properties of proteins chemistry and cell biology form turns, and g is a constant; further- more N, G, P,andS are the number of It is well known that proteins have a By considering together the results Asn, Gly, Pro, and Ser residues and R, K, strong tendency to aggregate into inso- described in the previous two sections, D,andE the number of Arg, Lys, Asp, luble assemblies,23 and that the ability to we observe that the same chemical para- and Glu residues, respectively. Using a form aggregates is not restricted to the meters used for predicting the solubility

This journal is c The Royal Society of Chemistry 2008 Mol. BioSyst., 2008, 4, 1170–1175 | 1171 between aggregation rates measured in vitro and concentrations detected in vivo. By investigating the nature of this link, we have reported a strong relationship between in vivo mRNA expression levels and in vitro protein aggregation rates7 (Fig. 2). We suggested that this correla- tion is the consequence of an evolution- ary pressure acting to decrease the risk of aggregation in the cell.7 This evolution- ary pressure is exerted since failure of proteins to fold correctly can give rise to cellular malfunctions and diseases. Ran- dom amino acid mutations have a strong tendency to destabilize the native fold of a protein25–27 and enhance the propen- Fig. 3 Relationship between gene expression levels and the chemical properties of the sity to aggregate.28 Since protein concen- corresponding protein sequences. High expression levels are associated with protein sequences trations and mRNA expression levels are with large numbers of polar (D, E, N, R, Q, and H) and hydrophilic (T, K, C, G, A, and R) correlated in E. coli,29 the relationship amino acids. Low expression levels are observed for protein sequences with large numbers of that we identified indicates a close corre- amino acids with high hydrophobicity (F, M, I, L, V, Y, and P), and high b-sheet propensity lation between concentrations and solu- (V, I, F, W, L, Y, and T). These results were obtained by sorting the expression levels of about bility of proteins (Fig. 1). 2000 E. coli genes (http://redpoll.pharmacy.ualberta.ca/CCDB/) and by selecting the 50 top- Since the aggregation propensities of ranked and 50 bottom-ranked genes. The horizontal line represents the average value for the two datasets taken together. Chemical propensities are defined following a consensus of experimental proteins can be predicted from their che- 8–20 scales (http://expasy.org/tools/protscale.html). mical properties. and the aggregation propensities themselves are correlated to of proteins in E. coli8–12,22 are also em- pensities of proteins in vitro.13–20 These the in vivo concentrations,7 it should also ployed for predicting aggregation pro- results hint at the existence of a link be possible to make predictions about a

Fig. 4 The toxicity of mutational variants of the Ab peptide can be better predicted using the propensities to form protofibrillar assemblies rather than the propensities to form fibrillar aggregates.21 (a) Correlation between aggregation propensity and neurotoxicity in Drosophila of Ab mutational variants. The Zyggregator algorithm19,20 (http://www-vendruscolo.ch.cam.ac.uk/zyggregator.php) was used to rationally design 17 variants of the Ab peptide on the basis of their aggregation propensities. The toxicity of the variants, measured as deficiencies in the locomotor activity, shows a correlation of 75% with the predicted aggregation propensities.21 Mutants of the E22G ’’Arctic’’ variant (red circles) diverge from the trend defined by the mutants of the wild-type Ab (black circles). Notably, different fibril morphologies are associated with the E22G variants, which aggregate into low molecular weight protofibrils. Taken together, these results show that the algorithm used for predicting aggregation propensities, while capable of predicting the in vitro aggregation rates of all the mutants considered, including those on the E22G background,21 is unable to fully capture their toxic effects. (b) Correlation between the predictions of the propensities to form protofibrillar aggregates and toxicities measured in vivo in transgenic Drosophila.21 The toxicity of the variants, measured as deficiencies in the locomotor activity, shows a correlation of 83% with the predicted protofibrillar propensities, which is significantly higher than the one obtained using the aggregation propensities. In particular, mutations on the E22G background (red circles) do not diverge significantly from the trend given by the wild type mutants (black circles).

1172 | Mol. BioSyst., 2008, 4, 1170–1175 This journal is c The Royal Society of Chemistry 2008 variety of cellular processes by using the of all the possible 798 single-point muta- I31E/E22G variants suggests that non- chemical properties of the proteins in- tions of the Ab peptide were predicted. toxic proteins can be expressed at higher volved in them. In the next two sections Thirteen non-naturally occurring var- levels without impairing cellular pro- we provide two examples that show how iants were then selected according to cesses, which strongly supports our ob- this idea can be applied. such propensities for expression in the servation of a link between aggregation central nervous system of Drosophila.In rates and expression levels. addition, four mutational variants of the In order to rationalize the deviations Relationship between gene E22G (‘‘Arctic’’) form of the Ab peptide, of some of the mutants that we observed, expression levels and amino which leads to early onset of Alzheimer’s we considered the series of experimental acid composition of the disease in humans,34 were designed to studies that have shown for a range of corresponding proteins modulate its aggregation propensity. It peptides and proteins that fibril has been shown by in vitro experiments formation is preceded by the appearance We first illustrate the close connection that the E22G mutant forms aggregates of organized molecular assemblies usual- between protein chemistry and cell bio- and fibrils at much lower concentrations ly termed protofibrils.37,38 Interest in logy by showing that gene expression (o2.5 mM) than the wild type (12.5 mM).35 these low molecular weight oligomers levels and the amino acid compositions At the same concentration, the aggrega- has increased since these species have of the corresponding protein sequences tion process occurs more rapidly for been detected in the brains of patients are linked in E. coli (Fig. 3). Highly- the E22G mutant compared with the suffering from Alzheimer’s disease.39,40 expressed genes are associated with wild type protein and amyloid fibrils In fact, at different stages of Ab assem- protein sequences with high charge and hy- appear about four days earlier after the bly, Ab oligomers and fibrils are ob- drophilicity, while genes with low levels onset of sample incubation without served that differ in structure as well as of expression are associated with protein seeding.36 toxic function. In particular, earlier oli- sequences with high hydrophobicity and Using the Drosophila longevity to gomeric assemblies have been found to propensity to form b-sheets (Fig. 3). quantify the effect of Ab variants, a be toxic to cells in cell cultures and in a Consistent with these results, proteins correlation of 75% was found between transgenic mouse model.41,42 Hence, we with many polar amino acids represent the predicted propensities to aggregate investigated the relationship between an optimal target for heterologous ex- and the severity of neurodegeneration neurodegenerative effects of the Ab var- pression in E. coli.30 In addition, mem- (Fig. 4a). Although the transcription le- iants and their associated protofibrillar brane proteins, which are characterized vels of the wild type Ab and of the E22G propensities.21 In order to build a pre- by the presence of several extended hydro- variant were found to be practically the dictor for protofibrillar propensities, we phobic regions, show very low levels of same, the lifespan of flies expressing the used the aggregation rates of the mutants expression in E. coli.31 Furthermore, wild-type Ab peptide was observed to of the protein AcP43 to parametrize an natively unfolded proteins tend to have be about 45 days, while the lifespan of equation that has the same form as the more charged and fewer hydrophobic the flies expressing the E22G variant was one used for prediction of aggregation amino acids32 in order to increase their reduced to about 10 days.21 We also propensities. In the new predictions, we solubility in the absence of the protection found that flies expressing the F20E mu- assumed that regions characterized by against aggregation provided by a folded tant lived 25% longer than flies expres- either positive or negative propensities structure,19 but despite these features sing the wild-type Ab peptide, and that are important to predict the proto- their expression levels are in some cases the F20E mutant does not form in vivo fibrillar propensity. We made this lower than those of globular proteins,33 deposits, even when its transcription le- hypothesis because regions associated with thus indicating the key role played by the vels are three times higher than those of negative propensities are usually charged or native structure in maintaining the solu- the wild-type Ab peptide. High correla- have a high propensity for being unstruc- bility of polypeptide chains.19 tion was also found between predicted tured, which may play a key role at the aggregation propensities and locomotor early stages of the oligomerization pro- deficits. Some deviations from this trend cess.44–46 In the approach that we have Protein concentrations and were, however, observed. The I31E mu- described, the chemical properties that we neurotoxicity of Ab peptide tation on the E22G background use for predicting the aggregation rates into aggregates in Drosophila (I31E/E22G) exhibited neuronal effects that fibrillar or protofibrillar assemblies are si- melanogaster did not match the predicted aggregation milar. As we, however, determined the propensity. Intriguingly, the I31E/E22G parameters through separate fitting proce- The possibility of using protein chemis- peptide was found to aggregate in vitro at dures in which the aggregation rates into try to make quantitative predictions a rate that matches the predictions but its specific types of assemblies were measured, about cellular processes is further illu- deposits were not accompanied by cav- the relative weights of the chemical proper- strated here by reviewing a study in ities in brain tissue. Moreover, even ties are different, thus resulting in specific which the aggregation propensities of a when the transcription levels of the predictions for each type of assembly series of mutational variants of the Ab I31E/E22G peptide are twice as much process.20 peptide were related to the lifespan of as the levels of the E22G peptide, the The predicted protofibrillar propensi- transgenic Drosophila melanogaster.21 In protein was not observed to be toxic. ties showed a correlation of 83% with that study the aggregation propensities Importantly, the case of the F20E and the fly longevity, revealing a stronger

This journal is c The Royal Society of Chemistry 2008 Mol. BioSyst., 2008, 4, 1170–1175 | 1173 relationship with the in vivo toxicity than 3 R. L. Strausberg and S. L. Schreiber, Escherichia coli, Biotechnol. Bioeng., 1999, the aggregation propensity (Fig. 4b). From knowing to controlling: a path from 65, 382–388. genomics to drugs using small molecule 19 F. Chiti and C. M. Dobson, Protein mis- These results show not only that predic- probes, Science, 2003, 300, 294–295. folding, functional amyloid, and human tions made on the basis of the chemical 4 B. R. Stockwell, Chemical genetics: ligand- disease, Annu. Rev. Biochem., 2006, 75, properties of Ab peptide variants can based discovery of gene function, Nat. 333–366. account quantitatively for the observed Rev. Genet., 2000, 1, 116–125. 20 C. M. Dobson, Protein misfolding, evolu- 5 C. M. Dobson, Chemical space and bio- tion and disease, Trends Biochem. Sci., changes in the lifespan of transgenic flies logy, Nature, 2004, 432, 824–828. 1999, 24, 329–332. expressing such peptides in their central 6 S. Ding and P. G. Schultz, A role for 21 A. A. Pakula and R. T. Sauer, Genetic nervous system, but also that this type of chemistry in stem cell biology, Nat. analysis of protein stability and function, prediction can provide information Biotechnol., 2004, 22, 833–840. Annu. Rev. Genet., 1989, 23, 289–310. 7 G. G. Tartaglia, S. Pechmann, 22 K. A. Bava, M. M. Gromiha, H. Uedaira, about the nature of the molecular species C. M. Dobson and M. Vendruscolo, Life K. Kitajima and A. Sarai, ProTherm, ver- having the most serious neurotoxic on the edge: a link between gene expres- sion 4.0: thermodynamic database for pro- effects. sion levels and aggregation rates of human teins and mutants, Nucleic Acids Res., proteins, Trends Biochem. Sci., 2007, 32, 2004, 32, D120–D121. 204–206. 23 R. Godoy-Ruiz, R. Perez-Jiminez, 8 D. L. Wilkinson and R. G. Harrison, Pre- B. Ibarra-Molero and J. M. Sanchez-Ruiz, Conclusions dicting the solubility of recombinant pro- Relation between protein stability, evolu- teins in Escherichia coli, Bio/Technology, tion and structure, as probed by carboxylic We have described close relationships 1991, 9, 443–448. acid mutations, J. Mol. Biol., 2004, 336, between the chemical properties of pro- 9 C. H. Schein, Solubility as a function of 313–318. teins and their solubility, expression le- protein structure and solvent components, 24 Z. Ignatova, Monitoring protein stability Bio/Technology, 1990, 8, 308–315. in vivo, Microb. Cell Fact., 2005, 4, 23. vels and aggregation propensities. These 10 C. S. Goh, N. Lan, S. M. Douglas, B. Wu, 25 P. Lu, C. Vogel, R. Wang, X. Yao and results suggest that strategies for predict- N.Echols,A.Smith,D.Milburn, E. M. Marcotte, Absolute protein expres- ing the outcome of cellular processes G. T. Montelione, H. Zhao and sion profiling estimates the relative contri- based on chemical properties of proteins M. Gersterin, Mining the structural genomics butions of transcriptional and pipeline: identification of protein properties translational regulation, Nat. Biotechnol., can be used as an alternative to those that affect high-throughput experimental 2007, 25, 117–124. based on the analysis of the cellular analysis, J. Mol. Biol., 2004, 336, 115–130. 26 K. Bu¨ssow, C. Scheich, V. Sievert, processes themselves.47–51 11 S. Idicula-Thomas, A. J. Kulkarni, U. Harttig, J. Schultz, B. Simon, P. Bork, B. D. Kulkarni, V. K. Jayaraman and H. Lehrach and U. Heinemann, Structural We have illustrated this approach by P. V. Balaji, A support vector machine-based genomics of human proteins—target selec- showing that neurotoxic effects of mis- method for predicting the propensity of a tion and generation of a public catalogue folded protein assemblies in a living or- protein to be soluble or to form inclusion of expression clones, Microb. Cell Fact., ganism can be predicted on the basis of body on overexpression in Escherichia coli, 2005, 4, 21. Bioinformatics, 2006, 22, 278–284. 27 S. Sundararaj, A. Guo, B. Habibi-Nazhad, the analysis of the chemical properties of 12 K. F. DuBay, A. P. Pawar, F. Chiti, J. Zurdo, M. Rouani, P. Stothard, M. Ellison and proteins that determine their propensities C. M. Dobson and M. Vendruscolo, Predic- D. S. Wishart, The CyberCell Database to form protofibrillar aggregates. Thus, tion of the absolute aggregation rates of (CCDB): a comprehensive, self-updating, although cells have evolved complex reg- amyloidogenic polypeptide chains, J. Mol. relational database to coordinate and Biol., 2004, 341, 1317–1326. facilitate in silico modeling of Escherichia ulatory mechanisms to supervise folding 13 A. P. Pawar, K. F. Dubay, J. Zurdo, coli, Nucleic Acids Res., 2004, 32, and combat misfolding, the intrinsic ten- F. Chiti, M. Vendruscolo and D293–D295. dency of polypeptide chains to aggregate C. M. Dobson, Prediction of ‘‘aggregation- 28 V. N. Uversky, revisited. prone’’ and ‘‘aggregation-susceptible’’ A polypeptide chain at the folding- plays a key role in determining the regions in proteins associated with neuro- misfolding-nonfolding cross-roads: which pathological consequences of their degenerative diseases, J. Mol. Biol., 2005, way to go?, Cell. Mol. Life Sci., 2003, 60, aberrant assemblies. 350, 379–392. 1852–1871. 14 A. Trovato, F. Chiti, A. Maritan and 29 A. Mohan, W. J. Sullivan, Jr, F. Seno, Insight into the structure of amy- P. Radivojac, A. K. Dunker and loid fibrils from the analysis of globular V. N. Uversky, Intrinsic disorder in patho- Acknowledgements proteins, PLoS Comput. Biol., 2007, 2, genic and non-pathogenic microbes: dis- 1608–1618. covering and analyzing the unfoldomes of We thank Dr Sebastian Pechmann, Dr 15 G. G. Tartaglia, A. Pawar, S. Campioni, early-branching eukaryotes, Mol. BioSyst., D. Crowther, Dr L. Luheshi, and Prof. F. Chiti, C. M. Dobson and 2008, 4, 328–340. C. M. Dobson for stimulating discus- M. Vendruscolo, Prediction of aggregation- 30 K. Sennvik, C. Nilsberth, C. Stenh, sions. This work was supported by the prone regions in structured proteins, L. Lannfelt and E. Bendedikz, The Arctic J. Mol. Biol.,2008,380, 425–436. Alzheimer mutation enhances sensitivity Medical Research Council, the Well- 16 G. G. Tartaglia and M. Vendruscolo, The to toxic stress in human neuroblastoma come Trust, and Leverhulme Trust. Zyggregator method for predicting protein cells, Neurosci. Lett., 2002, 326, 51–55. aggregation propensities, Chem. Soc. Rev., 31 O. N. Antzutkin, Amyloidosis of Alzheimer’s 2008, 37, 1395–1401. Abeta peptides: solid-state nuclear magnetic References 17 L. M. Luheshi, G. G. Tartaglia, resonance, electron paramagnetic resonance, A. C. Brorsson, A. P. Pawar, I. E. Watson, transmission electron microscopy, scanning 1 M. Levine and R. Tjian, Transcription F. Chiti, M. Vendruscolo, D. A. Lomas, transmission electron microscopy and atomic regulation and animal diversity, Nature, C. M. Dobson and D. C. Crowther, Systema- force microscopy studies, Magn. Reson. 2003, 424, 147–151. tic in vivo analysis of the intrinsic determi- Chem., 2004, 42, 231–246. 2 E. H. Davidson, D. R. McClay and nants of amyloid Beta pathogenicity, PLoS 32 J. Meinhardt, G. G. Tartaglia, A. L. Hood, Regulatory gene networks and Biol., 2007, 5, e290. Pawar, T. Christopeit, P. Hortschansky, the properties of the developmental pro- 18 G. D. Davis, C. Elisee, D. M. Newham and V. Schroeckh, C. M. Dobson, cess, Proc. Natl. Acad. Sci. U. S. A., 2003, R. G. Harrison, New fusion protein sys- M. Vendruscolo and M. Fa¨ndrich, Simila- 100, 1475–1480. tems designed to give soluble expression in rities in the thermodynamics and kinetics

1174 | Mol. BioSyst., 2008, 4, 1170–1175 This journal is c The Royal Society of Chemistry 2008 of aggregation of disease-related 37 M. D. Kirkitadze, G. Bitan and fibrinogen, Proc. Natl. Acad. Sci. U. S. A., Abeta(1-40) peptides, Protein Sci., 2007, D. B. Teplow, Paradigm shifts in 2000, 97, 85–90. 16, 1214–1222. Alzheimer’s disease and other neurodegen- 42 F. Massi, D. Kilmov, D. Thirumalai and 33 J. D. Harper, S. S. Wong, C. M. Lieber erative disorders: the emerging role of J. E. Straub, Charge states rather than and P. T. Lansbury, Observation of meta- oligomeric assemblies, J. Neurosci. Res., propensity for beta-structure determine stable Abeta amyloid protofibrils by atom- 2002, 69, 567–577. enhanced fibrillogenesis in wild-type ic force microscopy, Chem. Biol., 1997, 4, 38 M. P. Lambert, A. K. Barlow, Alzheimer’s beta-amyloid peptide com- 119–125. B. A. Chromy, C. Edwards, C. Freed, pared to E22Q Dutch mutant, Protein 34 R. Bader, R. Bamford, J. Zurdo, M. Liosatos, T. E. Morgan, I. Rozovsky, Sci., 2002, 11, 1639–1647. B. F. Luisi and C. M. Dobson, Probing B. Trommer, K. L. Viola, P. Wals, 43 D. H. Lackner, T. H. Beilharz, the mechanism of amyloidogenesis C. Zhang, C. E. Finch, G. A. Krafft and S. Marguerat, J. Mata, S. Watt, through a tandem repeat of the W. L. Klein, Diffusible, nonfibrillar li- F. Shubert, T. Preiss and J. Ba¨ler, A net- PI3-SH3 domain suggests a generic model gands derived from Abeta1-42 are potent work of multiple regulatory layers shapes for protein aggregation and fibril central nervous system neurotoxins, Proc. gene expression in fission yeast, Mol. Cell, formation, J. Mol. Biol., 2006, 356, Natl. Acad. Sci. U. S. A., 1998, 95, 2007, 26, 145–155. 189–208. 6448–6453. 44 D. H. Nguyen and P. D’haeseleer, Deci- 35 A. E. Roher, M. O. Chaney, Y. M. Kuo, 39 F. Chiti, M. Calamai, N. Taddei, phering principles of transcription regula- S. D. Webster, W. B. Stine, M. Stefani, G. Ramponi and tion in eukaryotic genomes, Mol. Syst. L. J. Haverskamp, A. S. Woods, C. M. Dobson, Studies of the aggregation Biol., 2006, 2, 1–9. R. J. Cotter, J. M. Tuohy, G. A. KRafft, of mutant proteins in vitro provide insights 45 E. Segal, T. Ravek-Sadka, M. Schroeder, B. S. Bonnel and M. R. Emmerling, Mor- into the genetics of amyloid diseases, Proc. U. Unnerstall and U. Gaul, Predicting phology and toxicity of Abeta-(1-42) di- Natl. Acad. Sci. U. S. A., 2002, 99, expression patterns from regulatory se- mer derived from neuritic and vascular 16419–16426. quence in Drosophila segmentation, amyloid deposits of Alzheimer’s disease, 40 T. Lu¨hrs, C. Ritter, M. Adrian, D. Riek- Nature, 2008, 451, 535–540. J. Biol. Chem., 1996, 271, 20631–20635. Loher, B. Bohrmann, H. Do¨beli, 46 H. J. Bussemaker, B. C. Foat and 36 L. F. Lue, Y. M. Kuo, A. E. Roher, D. Schubert and R. Riek, 3D structure of L. D. Ward, Predictive modeling of gen- L. Brachova, Y. Shen, L. Sue, T. Beach, Alzheimer’s amyloid-beta(1-42) fibrils, ome-wide mRNA expression: from mod- J. H. Kurth, R. E. Rydel and J. Rogers, Proc. Natl. Acad. Sci. U. S. A., 2005, ules to molecules, Annu. Rev. Biophys. Soluble amyloid beta peptide concentra- 102, 17342–17347. Biomol. Struct., 2007, 36, 329–347. tion as a predictor of synaptic change in 41 J. H. Brown, N. Volkmann, G. Jun, 47 M. A. Beer and S. Tavazoie, Predicting Alzheimer’s disease, Am. J. Pathol., 1999, A. H. Henschen-Edman and C. Cohen, gene expression from sequence, Cell, 2004, 155, 853–862. The crystal structure of modified bovine 117, 185–198.

This journal is c The Royal Society of Chemistry 2008 Mol. BioSyst., 2008, 4, 1170–1175 | 1175