Navigating the Extremes of Biological Datasets for Reliable Structural Inference and Design
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Design and Screening of Degenerate-Codon-Based Protein Ensembles with M13 Bacteriophage
Design and Screening of Degenerate-Codon-Based Protein Ensembles with M13 Bacteriophage by Griffin James Clausen B.S. Biomedical Engineering University of Minnesota – Twin Cities, 2012 Submitted to the Department of Biological Engineering in Partial Fulfillment of the Requirements for the Degree of Doctor of Philosophy in Biological Engineering at the Massachusetts Institute of Technology February 2019 © 2019 Massachusetts Institute of Technology. All rights reserved. Signature of Author: ____________________________________________________________ Griffin Clausen Department of Biological Engineering January 14, 2019 Certified by: ___________________________________________________________________ Angela Belcher James Mason Crafts Professor of Biological Engineering and Materials Science Thesis Supervisor Accepted by: ___________________________________________________________________ Forest White Professor of Biological Engineering Graduate Program Committee Chair This doctoral thesis has been examined by the following committee: Amy Keating Thesis Committee Chair Professor of Biology and Biological Engineering Massachusetts Institute of Technology Angela Belcher Thesis Supervisor James Mason Crafts Professor of Biological Engineering and Materials Science Massachusetts Institute of Technology Paul Blainey Core Member, Broad Institute Associate Professor of Biological Engineering Massachusetts Institute of Technology 2 Design and Screening of Degenerate-Codon-Based Protein Ensembles with M13 Bacteriophage by Griffin James Clausen Submitted -
Modern Methods for Laboratory Diversification of Biomolecules Bratulic and Badran 51
Available online at www.sciencedirect.com ScienceDirect Modern methods for laboratory diversification of biomolecules Sinisa Bratulic and Ahmed H Badran Genetic variation fuels Darwinian evolution, yet spontaneous to the success of a directed evolution campaign, as the mutation rates are maintained at low levels to ensure cellular sequence landscape of a standard protein or biopolymer is viability. Low mutation rates preclude the exhaustive typically too vast to be exhaustively searched [1]. Where a exploration of sequence space for protein evolution and priori information is limited, unbiased and random in vitro genome engineering applications, prompting scientists to [2,3] or in vivo [4–7] mutagenesis methods have success- develop methods for efficient and targeted diversification of fully generated libraries of variants with improved or nucleic acid sequences. Directed evolution of biomolecules novel functionalities. Alternatively, bioinformatics, struc- relies upon the generation of unbiased genetic diversity to tural, or biochemical information can be leveraged to discover variants with desirable properties, whereas genome- comprehensively explore a portion of the variant land- engineering applications require selective modifications on a scape by focusing mutagenesis on functionally relevant genomic scale with minimal off-targets. Here, we review the sites [8,9]. Finally, newly identified beneficial mutations current toolkit of mutagenesis strategies employed in directed can be isolated or integrated into single sequences using evolution and genome engineering. These state-of-the-art in vitro [10,11] or in vivo [12 ,13] recombination methods. methods enable facile modifications and improvements of Recently, laboratory evolution has shifted to techniques single genes, multicomponent pathways, and whole genomes that directly couple the diversification and assessment for basic and applied research, while simultaneously paving the steps, providing the basis for continuous in vivo evolution way for genome editing therapeutic interventions. -
Combinatorial Reshaping of the Candida Antarctica Lipase a Substrate Pocket for Enantioselectivity Using an Extremely Condensed Library
Combinatorial reshaping of the Candida antarctica lipase A substrate pocket for enantioselectivity using an extremely condensed library Anders G. Sandström1, Ylva Wikmark1, Karin Engström, Jonas Nyhlén, and Jan-E. Bäckvall2 Department of Organic Chemistry, Arrhenius Laboratory, Stockholm University, SE-106 91 Stockholm, Sweden Edited by Chi-Huey Wong, Academia Sinica, Taipei, Taiwan, and approved October 14, 2011 (received for review July 15, 2011) A highly combinatorial structure-based protein engineering meth- ful in improving properties such as thermostability, substrate od for obtaining enantioselectivity is reported that results in a acceptance or enantioselectivity (17, 18). These methods gener- thorough modification of the substrate binding pocket of Candida ally create small libraries, as only a few amino acid residues are antarctica lipase A (CALA). Nine amino acid residues surrounding targeted. Protein engineering methods that focus on the active the entire pocket were simultaneously mutated, contributing to site has a higher chance of influencing catalytic properties (19). a reshaping of the substrate pocket to give increased enantio- Such active-site-focused libraries have been used with good re- selectivity and activity for a sterically demanding substrate. This sults for the improvement of activity and enantioselectivity (3). approach seems to be powerful for developing enantioselectivity Two or three amino acid positions are generally subjected simul- when a complete reshaping of the active site is required. Screening taneously to mutagenesis to account for the potential synergistic toward ibuprofen ester 1, a substrate for which previously used conformational and electrostatic effects that may appear. This ap- methods had failed, gave variants with a significantly increased proach is called combinatorial active-site saturation test (CAST) enantioselectivity and activity. -
Analyses of All Possible Point Mutations Within a Protein Reveals Relationships Between Function and Experimental Fitness: a Dissertation
University of Massachusetts Medical School eScholarship@UMMS GSBS Dissertations and Theses Graduate School of Biomedical Sciences 2014-03-25 Analyses of All Possible Point Mutations within a Protein Reveals Relationships between Function and Experimental Fitness: A Dissertation Benjamin P. Roscoe University of Massachusetts Medical School Let us know how access to this document benefits ou.y Follow this and additional works at: https://escholarship.umassmed.edu/gsbs_diss Part of the Biochemistry Commons, Cellular and Molecular Physiology Commons, Molecular Biology Commons, and the Molecular Genetics Commons Repository Citation Roscoe BP. (2014). Analyses of All Possible Point Mutations within a Protein Reveals Relationships between Function and Experimental Fitness: A Dissertation. GSBS Dissertations and Theses. https://doi.org/10.13028/M2G027. Retrieved from https://escholarship.umassmed.edu/gsbs_diss/716 This material is brought to you by eScholarship@UMMS. It has been accepted for inclusion in GSBS Dissertations and Theses by an authorized administrator of eScholarship@UMMS. For more information, please contact [email protected]. ANALYSES OF ALL POSSIBLE POINT MUTATIONS WITHIN A PROTEIN REVEALS RELATIONSHIPS BETWEEN FUNCTION AND EXPERIMENTAL FITNESS A Dissertation Presented By BENJAMIN PETER ROSCOE Submitted to the Faculty of the University of Massachusetts Graduate School of Biomedical Sciences, Worcester in partial fulfillment for the degree of DOCTOR OF PHILOSOPHY (March 25, 2014) BIOCHEMISTRY AND MOLECULAR PHARMACOLOGY ANALYSES OF ALL POSSIBLE POINT MUTATIONS WITHIN A PROTEIN REVEALS RELATIONSHIPS BETWEEN FUNCTION AND EXPERIMENTAL FITNESS A Dissertation Presented By BENJAMIN PETER ROSCOE The signatures of the Dissertation Defense Committee signify completion and approval as to style and content of the Dissertation Daniel N. -
Methods for the Directed Evolution of Proteins
REVIEWS STUDY DESIGNS Methods for the directed evolution of proteins Michael S. Packer and David R. Liu Abstract | Directed evolution has proved to be an effective strategy for improving or altering the activity of biomolecules for industrial, research and therapeutic applications. The evolution of proteins in the laboratory requires methods for generating genetic diversity and for identifying protein variants with desired properties. This Review describes some of the tools used to diversify genes, as well as informative examples of screening and selection methods that identify or isolate evolved proteins. We highlight recent cases in which directed evolution generated enzymatic activities and substrate specificities not known to exist in nature. Natural selection Over many generations, iterated mutation and natural In this Review, we summarize techniques that gener- A process by which individuals selection during biological evolution provide solutions ate single-gene libraries, including standard methods as with the highest reproductive for challenges that organisms face in the natural world. well as novel approaches that can generate superior diver- fitness pass on their genetic material to their offspring, thus However, the traits that result from natural selection sity containing a larger proportion of functional mutants. maintaining and enriching only occasionally overlap with features of organ- We also review screening and selection methods that heritable traits that are isms and biomolecules that are sought by humans. identify or isolate improved variants within these librar- adaptive to the natural To guide evolution to access useful phenotypes more ies. Although these strategies can be applied to multi environment. frequently, humans for centuries have used artificial gene pathways3,4 and gene networks5–7, the examples in selection Artificial selection , beginning with the selective breeding of this Review will focus exclusively on the laboratory evolu- 1 2 (Also known as selective crops and domestication of animals . -
Nonproteinogenic Deep Mutational Scanning of Linear and Cyclic Peptides
Nonproteinogenic deep mutational scanning of linear and cyclic peptides Joseph M. Rogersa, Toby Passiouraa, and Hiroaki Sugaa,b,1 aDepartment of Chemistry, Graduate School of Science, The University of Tokyo, Tokyo 113-0033, Japan; and bCore Research for Evolutionary Science and Technology, Japan Science and Technology Agency, Saitama 332-0012, Japan Edited by David Baker, University of Washington, Seattle, WA, and approved September 18, 2018 (received for review June 10, 2018) High-resolution structure–activity analysis of polypeptides re- mutants that can be constructed (18). Moreover, it is possible to quires amino acid structures that are not present in the universal combine parallel peptide synthesis with measures of function genetic code. Examination of peptide and protein interactions (19). However, these approaches cannot construct peptide li- with this resolution has been limited by the need to individually braries with the sequence length and numbers that deep muta- synthesize and test peptides containing nonproteinogenic amino tional scanning can, which, at its core, uses high-fidelity nucleic acids. We describe a method to scan entire peptide sequences with acid-directed synthesis of polypeptides by the ribosome. multiple nonproteinogenic amino acids and, in parallel, determine Ribosomal synthesis (i.e., translation) can be manipulated to the thermodynamics of binding to a partner protein. By coupling include nonproteinogenic amino acids (20). In vitro genetic code genetic code reprogramming to deep mutational scanning, any reprogramming is particularly versatile, allowing for the in- number of amino acids can be exhaustively substituted into pep- corporation of amino acids with diverse chemical structures (21). tides, and single experiments can return all free energy changes of Flexizymes, flexible tRNA-acylation ribozymes, can load almost binding. -
Codongenie: Optimised Ambiguous Codon Design Tools
A peer-reviewed version of this preprint was published in PeerJ on 10 July 2017. View the peer-reviewed version (peerj.com/articles/cs-120), which is the preferred citable publication unless you specifically need to cite this preprint. Swainston N, Currin A, Green L, Breitling R, Day PJ, Kell DB. 2017. CodonGenie: optimised ambiguous codon design tools. PeerJ Computer Science 3:e120 https://doi.org/10.7717/peerj-cs.120 CodonGenie: optimised ambiguous codon design tools Neil Swainston Corresp., 1 , Andrew Currin 1 , Lucy Green 1 , Rainer Breitling 1, 2 , Philip J Day 3 , Douglas B Kell 1, 2 1 Manchester Centre for Synthetic Biology of Fine and Speciality Chemicals (SYNBIOCHEM), University of Manchester, Manchester, United Kingdom 2 School of Chemistry, University of Manchester, Manchester, United Kingdom 3 Faculty of Biology, Medicine and Health, University of Manchester, Manchester, United Kingdom Corresponding Author: Neil Swainston Email address: [email protected] CodonGenie, freely available from http://codon.synbiochem.co.uk , is a simple web application for designing ambiguous codons to support protein mutagenesis applications. Ambiguous codons are derived from specific heterogeneous nucleotide mixtures, which create sequence degeneracy when synthesised in a DNA library. In directed evolution studies, such codons are carefully selected to encode multiple amino acids. For example, the codon NTN, where the code N denotes a mixture of all four nucleotides, will encode a mixture of phenylalanine, leucine, isoleucine, methionine and valine. Given a user-defined target collection of amino acids matched to an intended host organism, CodonGenie designs and analyses all ambiguous codons that encode the required amino acids. -
Validation of Correction of Macromolecular Structural Models
Validation of Correction of Macromolecular Structural Models William Rochira MSc by Research University of York Chemistry December 2020 Abstract Recent advances in automation in the field of computational structural biology have created a void to be filled by novel validation software. In this project, the problems and inadequacies of currently available validation tools are identified, and the requirements of a novel validation tool are both ascertained and addressed. The development of a new validation software package is described in detail, starting with the development of the front-end interface and the back-end calculations, followed by the integration of these two components to produce an all-in-one validation package, which can calculate its own comprehensive per-residue validation metrics and present them in a compact, interactive, graphical interface, so as to allow the intuitive and thorough analysis of a protein model’s quality that is understandable at a glance. This interface features a novel graphical representation of validation, which plots multiple validation metrics along concentric axes such that correlations between those metrics are immediately apparent, and poorly-modelled regions are emphasised to the user. The software can be run standalone, or plugged into new or existing validation pipelines, and can incorporate calculated metrics from other validation services such as MolProbity (1). It supports multi-model comparison in its single view, and runs with negligible time penalty, making it especially suitable for evaluating incremental changes that result from automated or manual iterative model building. To showcase its extensibility and pluggable design, the integration of this package into the existing CCP4i2 (2) software suite is described. -
Ccp4 Newsletter on Protein Crystallography
CCP4 NEWSLETTER ON PROTEIN CRYSTALLOGRAPHY An informal Newsletter associated with the BBSRC Collaborative Computational Project No. 4 on Protein Crystallography. Number 42 Summer 2005 Contents News 1. CCP4 General news html Peter Briggs(1), Charles Ballard(1), Martyn Winn(1), Norman Stein(1), Daniel Rolfe(1), Francois Remacle(1), Graeme Winter(1), Ronan Keegan(1), Paul Emsley(2) 1CCP4, Daresbury Laboratory, Warrington WA4 4AD, UK, 2Structural Biology department, York University, York, UK 2. Developments in CCP4i html Peter Briggs, Charles Ballard, Martyn Winn, Francois Remacle CCP4, Daresbury Laboratory, Warrington WA4 4AD, UK 3. Report on the CCP4 Workshop at ACA 2005, Orlando, Florida html Peter Briggs CCP4, Daresbury Laboratory, Warrington WA4 4AD, UK 4. Diamond: Status Report on MX Beamlines and Computing pdf doc Alun Ashton, Katherine McAuley, Sara Fletcher, Jose Brandao-Neto, Liz Duke, Gwyndaf Evans, Ralf Flaig, Bill Pulford, Thomas Sorensen, Richard Woolliscroft Diamond Light Source, Diamond House, Chilton, OX11 0DE Software 5. Crank, Crunch2 and Bp3: A platform for rapid automated structure determination html Steven R. Ness, Irakli Sikharulidze, R.A.G de Graaff, Navraj S. Pannu Biophysical Structural Chemistry, Leiden Institute of Chemistry, P.O. Box 9502, 2300 RA Leiden, The Netherlands 6. CHOOCH – automatic analysis of fluorescence scans and determination of optimal X-ray wavelengths for MAD and SAD doc Gwyndaf Evans Diamond Light Source, Diamond House, Chilton, OX11 0DE 7. Coot news pdf Paul Emsley(1), Kevin Cowtan(1), Bernhard Lohkamp(2) 1Structural Biology department, York University, York, UK, 2Department of Medical biochemistry and biophysics, Karolinska institutet, Stockholm, Sweden 8. The Phenix refinement framework doc Afonine P.V., Grosse-Kunstleve R.W, Adams P.D. -
Saturation Mutagenesis of a Major Histocompatibility Complex
Proc. Natl. Acad. Sci. USA Vol. 85, pp. 3535-3539, May 1988 Genetics Saturation mutagenesis of a major histocompatibility complex protein domain: Identification of a single conserved amino acid important for allorecognition (H-2/cytotoxic T cells) RICHARD MURRAY, CLYDE A. HUTCHISON III, AND JEFFREY A. FRELINGER* Department of Microbiology and Immunology and the Lineberger Cancer Research Center, University of North Carolina, Chapel Hill, NC 27599 Communicated by Ray D. Owen, February 19, 1988 (received for review November 23, 1987) ABSTRACT The interactive association between T lym- class II molecules and antigen (7, 8), and form the basis of phocytes and their target cells is an important system of recognition by CTLs. CTLs also recognize foreign class I cell-cell interactions. Major histocompatibility complex class I molecules directly (9, 10). molecules are the cell surface structures recognized by cytolytic Our laboratory has identified the class I H-2DP gene from T lymphocytes. To define the molecular structures recognized a genomic library by transfection experiments (11). Sequence by cytotoxic T lymphocytes, we have saturated the 270-base- analysis revealed a typical murine class I gene (12). Also, we pair al exon of the H-2DP gene with point mutations, rapidly have shown that most of the H-2DP epitopes recognized by producing a "library" of 2.5 x 103 independent mutants. The CTLs and B cells reside in the al and a2 protein domains library contains enough recombinant clones (each clone encod- (refs. 13 and 14; T. Kanda and J.A.F., unpublished obser- ing approximately one amino acid replacement mutation) to vations), similar to other murine class I molecules (15-17). -
The Coot User Manual
The Coot User Manual 1st April 2004 2 Paul Emsley [email protected] Contents 1 Introduction 7 1.1 This document . ............................ 7 1.2 What is Coot? . ............................ 7 1.3 What Coot is Not . ............................ 7 1.4 Hardware Requirements . ........................ 7 1.4.1 Mouse . ............................ 8 1.5 Environment Variables . ........................ 8 1.6 Command Line Arguments ........................ 8 1.7 Web Page ................................... 9 1.8 Crash . ................................... 9 2 Mousing and Keyboarding 11 2.1 Next Residue . ............................ 11 2.2 Keyboard Contouring . ........................ 11 2.3 Keyboard Rotation . ............................ 11 2.4 Keyboard Translation ............................ 12 2.5 Keyboard Zoom and Clip . ........................ 12 2.6 Scrollwheel . ............................ 12 2.7 Selecting Atoms . ............................ 12 2.8 Virtual Trackball . ............................ 12 2.9 More on Zooming . ............................ 12 3 General Features 13 3.1 Version number . ............................ 13 3.2 Antialiasing . ............................ 13 3.3 Molecule Number . ............................ 13 3.4 Display Issues . ............................ 13 3.4.1 Origin Marker ............................ 14 3.4.2 Raster3D output . ........................ 14 3.5 Display Manager . ............................ 14 3.6 The file selector . ............................ 15 3.6.1 File-name Filtering . ....................... -
Patch Cloning Method for Multiple Site-Directed and Saturation Mutagenesis Naohiro Taniguchi, Sayumi Nakayama, Takashi Kawakami and Hiroshi Murakami*
Taniguchi et al. BMC Biotechnology 2013, 13:91 http://www.biomedcentral.com/1472-6750/13/91 METHODOLOGY ARTICLE Open Access Patch cloning method for multiple site-directed and saturation mutagenesis Naohiro Taniguchi, Sayumi Nakayama, Takashi Kawakami and Hiroshi Murakami* Abstract Background: Various DNA manipulation methods have been developed to prepare mutant genes for protein engineering. However, development of more efficient and convenient method is still demanded. Homologous DNA assembly methods, which do not depend on restriction enzymes, have been used as convenient tools for cloning and have been applied to site-directed mutagenesis recently. This study describes an optimized homologous DNA assembly method, termed as multiple patch cloning (MUPAC), for multiple site-directed and saturation mutagenesis. Results: To demonstrate MUPAC, we introduced five back mutations to a mutant green fluorescent protein (GFPuv) with five deleterious mutations at specific sites and transformed Escherichia coli (E. coli) with the plasmids obtained. We observed that the over 90% of resulting colonies possessed the plasmids containing the reverted GFPuv gene and exhibited fluorescence. We extended the test to introduce up to nine mutations in Moloney Murine Leukemia Virus reverse transcriptase (M-MLV RT) by assembling 11 DNA fragments using MUPAC. Analysis of the cloned plasmid by electrophoresis and DNA sequencing revealed that approximately 30% of colonies had the objective mutant M-MLV RT gene. Furthermore, we also utilized this method to prepare a library of mutant GFPuv genes containing saturation mutations at five specific sites, and we found that MUPAC successfully introduced NNK codons at all five sites, whereas other site remained intact. Conclusions: MUPAC could efficiently introduce various mutations at multiple specific sites within a gene.