Homology Modeling Lausanne, February 22, 2007

Total Page:16

File Type:pdf, Size:1020Kb

Load more

Swiss Institute of Bioinformatics EMBnet course: Introduction to Protein Structure Bioinformatics Homology Modeling Lausanne, February 22, 2007 Torsten Schwede Biozentrum - Universität Basel Swiss Institute of Bioinformatics Klingelbergstr 50-70 CH - 4056 Basel, Switzerland Tel: +41-61 267 15 81 How many structures do we know? http://www.wwpdb.org/ How many structures do we know? Growth of the Protein Data Bank PDB Total Yearly [ PDB: http://www.pdb.org ] [ PDB: http://www.pdb.org ] How many structures do we know? 10,000,000 1,000,000 Î No experimental structure for most protein sequences 100,000 10,000 TrEMBL 1,000 SwissProt PDB 100 1986 1988 1990 1992 1994 1996 1998 2000 2002 2004 2006 (Sources: PDB, EBI, SIB) In the near future for most of the known protein sequences no experimental structure will be available. Can we predict protein structures from genome sequences? MNIFEMLRID EGLRLKIYKD TEGYYTIGIG HLLTKSPSLN AAKSELDKAI GRNCNGVITK DEAEKLFNQD VDAAVRGILR NAKLKPVYDS LDAVRRCALI NMVFQMGETG VAGFTNSLRM LQQKRWDEAA VNLAKSRWYN QTPNRAKRVI TTFRTGTWDA YKNL MNIFEMLRID EGLRLKIYKD TEGYYTIGIG HLLTKSPSLN AAKSELDKAI GRNCNGVITK DEAEKLFNQD VDAAVRGILR NAKLKPVYDS LDAVRRCALI NMVFQMGETG VAGFTNSLRM LQQKRWDEAA VNLAKSRWYN QTPNRAKRVI TTFRTGTWDA YKNL Many proteins fold spontaneously to their native structure Protein folding is relatively fast (nsec – sec) Chaperones speed up folding, but do not alter the structure The protein sequence contains all information needed to create a correctly folded protein. Can we predict the folding process of a protein structure from their sequences (ab initio)? Molecular Dynamics 2 ki ν = ∑ ()li − li,0 bonds 2 2 ki + ∑ ()θi −θi,0 angles 2 V + ∑ N ()1+ cos()nω −γ torsions 2 12 6 N N ⎛ ⎡ ⎤ ⎞ ⎜ ⎛σ ij ⎞ ⎛σ ij ⎞ qiq j ⎟ + 4πε ⎢⎜ ⎟ − ⎜ ⎟ ⎥ + ∑∑⎜ ij ⎜ ⎟ ⎜ ⎟ ⎟ i=+11j=i ⎜ ⎢ r r ⎥ 4πε r ⎟ ⎝ ⎣⎝ ij ⎠ ⎝ ij ⎠ ⎦ 0 ij ⎠ Ab initio protein folding simulation Physical time for simulation 10–4 seconds Typical time-step size 10–15 seconds Number of MD time steps 1011 Atoms in a typical protein and water simulation 32’000 Approximate number of interactions in force calculation 109 Machine instructions per force calculation 1000 Total number of machine instructions 1023 Petaflop capacity computer (floating point operations per second) 1 petaflop (1015) Î Blue Gene will need 1-3 years to simulate 100 μsec. [ http://www.research.ibm.com/bluegene/ ] Growth of the Protein Data Bank PDB “Old” folds per year New folds per year [ PDB: http://www.pdb.org ] CATH - Protein Structure Classification Class(C) derived from secondary structure content is assigned automatically Architecture(A) describes the gross orientation of secondary structures, independent of connectivity. Topology(T) clusters structures according to their topological connections and numbers of secondary structures Homologous Superfamily (H) This level groups together protein domains which are thought to share a common ancestor and can therefore be described as homologous. [ http://www.biochem.ucl.ac.uk/bsm/cath_new/ ] Sequence similarity implies structural similarity? 100 . Sequence identity 75 implies structural similarity ! 50 Don't 25 know Pairwise sequence identity region 0 Number of residues aligned (B.Rost, Columbia, NewYork) Sequence similarity implies structural similarity? . 100 identity simil arity 80 Sequence identity 60 implies structural similarity 40 identity/similarity Don’t Percentage sequence 20 know region ..... 0 0 50 100 150 200 250 Number of residues aligned (B.Rost, Columbia, NewYork) Fold recognition / Threading Find a compatible fold for a given sequence .... >Protein XY MSTLYEKLGGTTAVDLAV DKFYERVLQDDRIKHFFA ? DVDMAKQRAHQKAFLTYA FGGTDKYDGRYMREAHKE ≈ LVENHGLNGEHFDAVAED LLATLKEMGVPEDLIAEV AAVAGAPAHKRDVLNQ Number of protein folds that occurs in nature is limited. Fold Recognition can be used to: ¾ Identify templates for comparative modeling ¾ Assign Protein Function Fold recognition / Threading The "biological" perspective: Homologous proteins have evolved by molecular evolution from a common ancestor. If we can establish homology, we can predict aspects of structure and function of a new protein by analogy. The "physical" perspective: The native conformation of a protein corresponds to a global free energy minimum of the protein / solvent system. To identify a compatible fold, the protein sequence is "threaded" through a library of folds, and empirical energy calculations are used to evaluate compatibility. No single method is perfect. Consensus methods often perform better: ¾ MetaPP: http://cubic.bioc.columbia.edu/predictprotein/ ¾ http://bioinfo.pl/meta/ Further reading: Adam Godzik, "Fold Recognition Methods", in: "Structural Bioinformatics", Bourne & Weissig, Eds. Protein Structure / Fold Databases PDB: http://www.pdb.org EBI-MSD http://www.ebi.ac.uk/msd/ SCOP http://scop.mrc-lmb.cam.ac.uk/scop/ CATH http://www.biochem.ucl.ac.uk/bsm/cath_new/ Fold Recognition Servers Meta server ¾ http://bioinfo.pl/meta/ 3DPSSM / Phyre ¾ http://www.sbg.bio.ic.ac.uk/servers/3dpssm/ ¾ http://www.sbg.bio.ic.ac.uk/~phyre/ GenTHREADER ¾ http://bioinf.cs.ucl.ac.uk/psipred/ FUGUE2 ¾ http://www-cryst.bioc.cam.ac.uk/~fugue/prfsearch.html SAM ¾ http://www.cse.ucsc.edu/research/compbio/HMM-apps/T99- query.html FOLD ¾ http://fold.doe-mbi.ucla.edu/ FFAS/PDBBLAST ¾ http://bioinformatics.burnham-inst.org/ Evolution of the globin family: Evolution of protein structure families 2.5 2.0 1.5 1.0 0.5 Rmsd of backbone atoms in core 0.0 100 50 0 Percent identical residues in core [ Chothia & Lesk (1986) ] Common core = all residues that can be superposed in 3D For proteins > 60% identical residues, the core contains > 90 % of all residues deviating less than 1.0 Å. Similar Sequence Î Similar Structure Homology modeling = Comparative protein modeling = Knowledge-based modeling Idea: Using experimental 3D-structures of related family members (templates) to calculate a model for a new sequence (target). Comparative Modeling Known Structures (Templates) Target Sequence Template Selection Alignment Structure Evaluation & Template - Target Assessment Structure modeling Homology Model(s) Comparative Modeling Known Structures (Templates) Target Sequence Template Selection Alignment Structure Evaluation & Template - Target Assessment • Protein Data Bank PDB http://www.pdb.org Structure modeling Homology Model(s) Î Database of templates • Separate into single chains • Remove bad structures (models) • Create BLASTable database or fold library (profiles, HMMs) Comparative Modeling Known Structures (Templates) Target Sequence Template Selection Template selection: Alignment Structure Evaluation & Template - Target Assessment 1. Sequence Similarity / Fold Structure modeling recognition Homology Model(s) 2. Structure quality (resolution, experimental method) 3. Experimental conditions (ligands and cofactors) Comparative Modeling Known Structures (Templates) Target Sequence Template Selection Alignment Structure Evaluation & Template - Target Assessment • Multiple sequence alignment for pairs > 40% identity Structure modeling or Homology Model(s) • Use structural alignment of templates to guide sequence alignment of target or • Use separate profiles for template and targets Comparative Modeling Known Structures (Templates) Target Sequence Template Selection Alignment Structure Evaluation & Template - Target Assessment • Errors in template selection or Structure modeling alignment result in bad Homology models Model(s) Î iterative cycles of alignment, modeling and evaluation Î Built many models, choose best. Comparative Modeling Known Structures (Templates) Target Sequence Template Selection Alignment Structure Evaluation & Template - Target Assessment I. Manual Model building Structure modeling Homology II. Template based fragment Model(s) assembly – Composer (Sybyl, Tripos) – SWISS-MODEL III. Satisfaction of spatial restraints – Modeller (Insight II, MSI) –CPH-Models I. Manual Modeling [ http://www.expasy.org/spdbv/ ] II. Template based fragment assembly Find structurally conserved core regions II. Template based fragment assembly Build model core ¾ … by averaging core template backbone atoms (weighted by local sequence similarity with the target sequence). Leave non-conserved regions (loops) for later …. II. Template based fragment assembly Loop (insertion) modeling ¾ Use the “spare part” algorithm to find compatible fragments in a Loop- Database, or “ab-initio” rebuilding (e.g. Monte Carlo, MD, GA, etc.) to build missing loops. II. Template based fragment assembly Side Chain placement ¾ Find the most probable side chain conformation, using • homologues structure information • back-bone dependent rotamer libraries • energetic and packing criteria II. Template based fragment assembly Rotamer Libraries ¾ Only a small fraction of all possible side chain conformations is observed in experimental structures ¾ Rotamer libraries provide an ensemble of likely conformations ¾ The propensity of rotamers depends on the backbone geometry: II. Template based fragment assembly Energy minimization ¾ modeling method will produce unfavorable contacts and bonds ¾ Energy minimization is used to • regularize local bond and angle geometry • Relax close contacts and geometric strain ¾ extensive energy minimization will move coordinates away from real structure ⇒ keep it to a minimum ¾ SWISS-MODEL is using GROMOS 96 force field for a steepest descent Homology Modeling III. Satisfaction of Spatial restraints M Q T S A F G T A E III. Satisfaction of Spatial restraints Alignment of target sequence
Recommended publications
  • NIH Public Access Author Manuscript Proteins

    NIH Public Access Author Manuscript Proteins

    NIH Public Access Author Manuscript Proteins. Author manuscript; available in PMC 2015 February 01. NIH-PA Author ManuscriptPublished NIH-PA Author Manuscript in final edited NIH-PA Author Manuscript form as: Proteins. 2014 February ; 82(0 2): 208–218. doi:10.1002/prot.24374. One contact for every twelve residues allows robust and accurate topology-level protein structure modeling David E. Kim, Frank DiMaio, Ray Yu-Ruei Wang, Yifan Song, and David Baker* Department of Biochemistry, University of Washington, Seattle 98195, Washington Abstract A number of methods have been described for identifying pairs of contacting residues in protein three-dimensional structures, but it is unclear how many contacts are required for accurate structure modeling. The CASP10 assisted contact experiment provided a blind test of contact guided protein structure modeling. We describe the models generated for these contact guided prediction challenges using the Rosetta structure modeling methodology. For nearly all cases, the submitted models had the correct overall topology, and in some cases, they had near atomic-level accuracy; for example the model of the 384 residue homo-oligomeric tetramer (Tc680o) had only 2.9 Å root-mean-square deviation (RMSD) from the crystal structure. Our results suggest that experimental and bioinformatic methods for obtaining contact information may need to generate only one correct contact for every 12 residues in the protein to allow accurate topology level modeling. Keywords protein structure prediction; rosetta; comparative modeling; homology modeling; ab initio prediction; contact prediction INTRODUCTION Predicting the three-dimensional structure of a protein given just the amino acid sequence with atomic-level accuracy has been limited to small (<100 residues), single domain proteins.
  • Homology Modeling and Analysis of Structure Predictions of the Bovine Rhinitis B Virus RNA Dependent RNA Polymerase (Rdrp)

    Homology Modeling and Analysis of Structure Predictions of the Bovine Rhinitis B Virus RNA Dependent RNA Polymerase (Rdrp)

    Int. J. Mol. Sci. 2012, 13, 8998-9013; doi:10.3390/ijms13078998 OPEN ACCESS International Journal of Molecular Sciences ISSN 1422-0067 www.mdpi.com/journal/ijms Article Homology Modeling and Analysis of Structure Predictions of the Bovine Rhinitis B Virus RNA Dependent RNA Polymerase (RdRp) Devendra K. Rai and Elizabeth Rieder * Foreign Animal Disease Research Unit, United States Department of Agriculture, Agricultural Research Service, Plum Island Animal Disease Center, Greenport, NY 11944, USA; E-Mail: [email protected] * Author to whom correspondence should be addressed; E-Mail: [email protected]; Tel.: +1-631-323-3177; Fax: +1-631-323-3006. Received: 3 May 2012; in revised form: 3 July 2012 / Accepted: 11 July 2012 / Published: 19 July 2012 Abstract: Bovine Rhinitis B Virus (BRBV) is a picornavirus responsible for mild respiratory infection of cattle. It is probably the least characterized among the aphthoviruses. BRBV is the closest relative known to Foot and Mouth Disease virus (FMDV) with a ~43% identical polyprotein sequence and as much as 67% identical sequence for the RNA dependent RNA polymerase (RdRp), which is also known as 3D polymerase (3Dpol). In the present study we carried out phylogenetic analysis, structure based sequence alignment and prediction of three-dimensional structure of BRBV 3Dpol using a combination of different computational tools. Model structures of BRBV 3Dpol were verified for their stereochemical quality and accuracy. The BRBV 3Dpol structure predicted by SWISS-MODEL exhibited highest scores in terms of stereochemical quality and accuracy, which were in the range of 2Å resolution crystal structures. The active site, nucleic acid binding site and overall structure were observed to be in agreement with the crystal structure of unliganded as well as template/primer (T/P), nucleotide tri-phosphate (NTP) and pyrophosphate (PPi) bound FMDV 3Dpol (PDB, 1U09 and 2E9Z).
  • Comparative Protein Structure Modeling of Genes and Genomes

    Comparative Protein Structure Modeling of Genes and Genomes

    P1: FPX/FOZ/fop/fok P2: FHN/FDR/fgi QC: FhN/fgm T1: FhN January 12, 2001 16:34 Annual Reviews AR098-11 Annu. Rev. Biophys. Biomol. Struct. 2000. 29:291–325 Copyright c 2000 by Annual Reviews. All rights reserved COMPARATIVE PROTEIN STRUCTURE MODELING OF GENES AND GENOMES Marc A. Mart´ı-Renom, Ashley C. Stuart, Andras´ Fiser, Roberto Sanchez,´ Francisco Melo, and Andrej Sˇali Laboratories of Molecular Biophysics, Pels Family Center for Biochemistry and Structural Biology, Rockefeller University, 1230 York Ave, New York, NY 10021; e-mail: [email protected] Key Words protein structure prediction, fold assignment, alignment, homology modeling, model evaluation, fully automated modeling, structural genomics ■ Abstract Comparative modeling predicts the three-dimensional structure of a given protein sequence (target) based primarily on its alignment to one or more pro- teins of known structure (templates). The prediction process consists of fold assign- ment, target–template alignment, model building, and model evaluation. The number of protein sequences that can be modeled and the accuracy of the predictions are in- creasing steadily because of the growth in the number of known protein structures and because of the improvements in the modeling software. Further advances are nec- essary in recognizing weak sequence–structure similarities, aligning sequences with structures, modeling of rigid body shifts, distortions, loops and side chains, as well as detecting errors in a model. Despite these problems, it is currently possible to model with useful accuracy significant parts of approximately one third of all known protein sequences. The use of individual comparative models in biology is already rewarding and increasingly widespread.
  • FORCE FIELDS for PROTEIN SIMULATIONS by JAY W. PONDER

    FORCE FIELDS for PROTEIN SIMULATIONS by JAY W. PONDER

    FORCE FIELDS FOR PROTEIN SIMULATIONS By JAY W. PONDER* AND DAVIDA. CASEt *Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, 51. Louis, Missouri 63110, and tDepartment of Molecular Biology, The Scripps Research Institute, La Jolla, California 92037 I. Introduction. ...... .... ... .. ... .... .. .. ........ .. .... .... ........ ........ ..... .... 27 II. Protein Force Fields, 1980 to the Present.............................................. 30 A. The Am.ber Force Fields.............................................................. 30 B. The CHARMM Force Fields ..., ......... 35 C. The OPLS Force Fields............................................................... 38 D. Other Protein Force Fields ....... 39 E. Comparisons Am.ong Protein Force Fields ,... 41 III. Beyond Fixed Atomic Point-Charge Electrostatics.................................... 45 A. Limitations of Fixed Atomic Point-Charges ........ 46 B. Flexible Models for Static Charge Distributions.................................. 48 C. Including Environmental Effects via Polarization................................ 50 D. Consistent Treatment of Electrostatics............................................. 52 E. Current Status of Polarizable Force Fields........................................ 57 IV. Modeling the Solvent Environment .... 62 A. Explicit Water Models ....... 62 B. Continuum Solvent Models.......................................................... 64 C. Molecular Dynamics Simulations with the Generalized Born Model........
  • Structural Bioinformatics

    Structural Bioinformatics

    Structural Bioinformatics Sanne Abeln K. Anton Feenstra Centre for Integrative Bioinformatics (IBIVU), and Department of Computer Science, Vrije Universiteit, De Boelelaan 1081A, 1081 HV Amsterdam, Netherlands December 4, 2017 arXiv:1712.00425v1 [q-bio.BM] 1 Dec 2017 2 Abstract This chapter deals with approaches for protein three-dimensional structure prediction, starting out from a single input sequence with unknown struc- ture, the `query' or `target' sequence. Both template based and template free modelling techniques are treated, and how resulting structural models may be selected and refined. We give a concrete flowchart for how to de- cide which modelling strategy is best suited in particular circumstances, and which steps need to be taken in each strategy. Notably, the ability to locate a suitable structural template by homology or fold recognition is crucial; without this models will be of low quality at best. With a template avail- able, the quality of the query-template alignment crucially determines the model quality. We also discuss how other, courser, experimental data may be incorporated in the modelling process to alleviate the problem of missing template structures. Finally, we discuss measures to predict the quality of models generated. Structural Bioinformatics c Abeln & Feenstra, 2014-2017 Contents 7 Strategies for protein structure model generation5 7.1 Template based protein structure modelling . .7 7.1.1 Homology based Template Finding . .7 7.1.2 Fold recognition . .8 7.1.3 Generating the target-template alignment . 10 7.1.4 Generating a model . 11 7.1.5 Loop or missing substructure modelling . 12 7.2 Template-free protein structure modelling .
  • Ten Quick Tips for Homology Modeling of High-Resolution Protein 3D Structures

    Ten Quick Tips for Homology Modeling of High-Resolution Protein 3D Structures

    PLOS COMPUTATIONAL BIOLOGY EDUCATION Ten quick tips for homology modeling of high- resolution protein 3D structures 1,2 1,2 1,2 Yazan HaddadID , Vojtech AdamID , Zbynek HegerID * 1 Department of Chemistry and Biochemistry, Mendel University in Brno, Brno, Czech Republic, 2 Central European Institute of Technology, Brno University of Technology, Brno, Czech Republic * [email protected] Author summary The purpose of this quick guide is to help new modelers who have little or no background in comparative modeling yet are keen to produce high-resolution protein 3D structures a1111111111 for their study by following systematic good modeling practices, using affordable personal a1111111111 computers or online computational resources. Through the available experimental 3D- a1111111111 structure repositories, the modeler should be able to access and use the atomic coordinates a1111111111 for building homology models. We also aim to provide the modeler with a rationale a1111111111 behind making a simple list of atomic coordinates suitable for computational analysis abiding to principles of physics (e.g., molecular mechanics). Keeping that objective in mind, these quick tips cover the process of homology modeling and some postmodeling computations such as molecular docking and molecular dynamics (MD). A brief section OPEN ACCESS was left for modeling nonprotein molecules, and a short case study of homology modeling Citation: Haddad Y, Adam V, Heger Z (2020) Ten is discussed. quick tips for homology modeling of high- resolution protein 3D structures. PLoS Comput Biol 16(4): e1007449. https://doi.org/10.1371/ journal.pcbi.1007449 Introduction Editor: Francis Ouellette, University of Toronto, Protein 3D-structure folding from a simple sequence of amino acids was seen as a very difficult CANADA problem in the past.
  • Methods for the Refinement of Protein Structure 3D Models

    Methods for the Refinement of Protein Structure 3D Models

    International Journal of Molecular Sciences Review Methods for the Refinement of Protein Structure 3D Models Recep Adiyaman and Liam James McGuffin * School of Biological Sciences, University of Reading, Reading RG6 6AS, UK; [email protected] * Correspondence: l.j.mcguffi[email protected]; Tel.: +44-0-118-378-6332 Received: 2 April 2019; Accepted: 7 May 2019; Published: 1 May 2019 Abstract: The refinement of predicted 3D protein models is crucial in bringing them closer towards experimental accuracy for further computational studies. Refinement approaches can be divided into two main stages: The sampling and scoring stages. Sampling strategies, such as the popular Molecular Dynamics (MD)-based protocols, aim to generate improved 3D models. However, generating 3D models that are closer to the native structure than the initial model remains challenging, as structural deviations from the native basin can be encountered due to force-field inaccuracies. Therefore, different restraint strategies have been applied in order to avoid deviations away from the native structure. For example, the accurate prediction of local errors and/or contacts in the initial models can be used to guide restraints. MD-based protocols, using physics-based force fields and smart restraints, have made significant progress towards a more consistent refinement of 3D models. The scoring stage, including energy functions and Model Quality Assessment Programs (MQAPs) are also used to discriminate near-native conformations from non-native conformations. Nevertheless, there are often very small differences among generated 3D models in refinement pipelines, which makes model discrimination and selection problematic. For this reason, the identification of the most native-like conformations remains a major challenge.
  • Exercise 6: Homology Modeling

    Exercise 6: Homology Modeling

    Workshop in Computational Structural Biology (81813), Spring 2019 Exercise 6: Homology Modeling Homology modeling, sometimes referred to as comparative modeling, is a commonly-used alternative to ab-initio modeling (which requires zero knowledge about the native structure, but also overlooks much of our understanding of structural diversity). Homology modeling leverages the massive amount of structural knowledge available today to guide our search of the native structure. Practically, it assumes that sequence similarity implies structural similarity, i.e. assumes a protein homologous to our query protein cannot be very different from it. Therefore, we can roughly "thread" the query sequence onto the template (homologue) structure, and exploit existing knowledge to guide our modeling. Contents Homology modeling with Rosetta 1 Overview 1 Template Selection 1 Loop Modeling 2 Running the full simulation 4 Bonus section! Homology modeling with automated/pre-calculated servers 6 MODBASE 6 I-Tasser 6 Homology modeling with Rosetta Overview In this section, we will model the structure of the Erbin PDZ domain, using homologue template structures, and analyze the quality of the models. The structure of this protein has been solved (PDB ID 1MFG), so we will be able to compare prediction with experiment. As taught in class, classical homology modeling in Rosetta is composed of four phases: 1. Template selection 2. Template-query alignment 3. Rebuilding loop regions 4. Full-atom refinement of the model Our model protein is the Erbin PDZ domain: Erbin contains a class-I PDZ domain that binds to the C-terminal region of the receptor tyrosine kinase ErbB2. PDB ID 1MFG is the crystal structure of the human Erbin PDZ bound to the peptide EYLGLDVPV corresponding to the C-terminal residues of human ErbB2.
  • Homology Modeling

    Homology Modeling

    25 HOMOLOGY MODELING Elmar Krieger, Sander B. Nabuurs, and Gert Vriend The ultimate goal of protein modeling is to predict a structure from its sequence with an accuracy that is comparable to the best results achieved experimentally. This would allow users to safely use rapidly generated in silico protein models in all the contexts where today only experimental structures provide a solid basis: structure-based drug design, analysis of protein function, interactions, antigenic behavior, and rational design of proteins with increased stability or novel functions. In addition, protein modeling is the only way to obtain structural information if experimental techniques fail. Many proteins are simply too large for NMR analysis and cannot be crystallized for X-ray diffraction. Among the three major approaches to three-dimensional (3D) structure prediction described in this and the following two chapters, homology modeling is the easiest one. It is based on two major observations: 1. The structure of a protein is uniquely determined by its amino acid sequence (Epstain, Goldberger, and Anfinsen, 1963). Knowing the sequence should, at least in theory, suffice to obtain the structure. 2. During evolution, the structure is more stable and changes much slower than the associated sequence, so that similar sequences adopt practically identical structures, and distantly related sequences still fold into similar structures. This relationship was first identified by Chothia and Lesk (1986) and later quantified by Sander and Schneider (1991). Thanks to the exponential growth of the Protein Data Bank (PDB), Rost (1999) could recently derive a precise limit for this rule, shown in Figure 25.1.
  • Homology Modeling and Optimized Expression of Truncated IK Protein, Tik, As an Anti-Inflammatory Peptide

    Homology Modeling and Optimized Expression of Truncated IK Protein, Tik, As an Anti-Inflammatory Peptide

    molecules Article Homology Modeling and Optimized Expression of Truncated IK Protein, tIK, as an Anti-Inflammatory Peptide Yuyoung Song, Minseon Kim and Yongae Kim * Department of Chemistry, Hankuk University of Foreign Studies, Yongin 17035, Korea; [email protected] (Y.S.); [email protected] (M.K.) * Correspondence: [email protected]; Tel.: +82-2-2173-8705; Fax: +82-31-330-4566 Academic Editors: Manuela Pintado, Ezequiel Coscueta and María Emilia Brassesco Received: 11 August 2020; Accepted: 21 September 2020; Published: 23 September 2020 Abstract: Rheumatoid arthritis, caused by abnormalities in the autoimmune system, affects about 1% of the population. Rheumatoid arthritis does not yet have a proper treatment, and current treatment has various side effects. Therefore, there is a need for a therapeutic agent that can effectively treat rheumatoid arthritis without side effects. Recently, research on pharmaceutical drugs based on peptides has been actively conducted to reduce negative effects. Because peptide drugs are bio-friendly and bio-specific, they are characterized by no side effects. Truncated-IK (tIK) protein, a fragment of IK protein, has anti-inflammatory effects, including anti-rheumatoid arthritis activity. This study focused on the fact that tIK protein phosphorylates the interleukin 10 receptor. Through homology modeling with interleukin 10, short tIK epitopes were proposed to find the essential region of the sequence for anti-inflammatory activity. TH17 differentiation experiments were also performed with the proposed epitope. A peptide composed of 18 amino acids with an anti-inflammatory effect was named tIK-18mer. Additionally, a tIK 9-mer and a 14-mer were also found. The procedure for the experimental expression of the proposed tIK series (9-mer, 14-mer, and 18-mer) using bacterial strain is discussed.
  • Homology Modeling and Docking Studies of a 9-Fatty Acid Desaturase

    Homology Modeling and Docking Studies of a 9-Fatty Acid Desaturase

    Homology modeling and docking studies of a 19-fatty acid desaturase from a Cold-tolerant Pseudomonas sp. AMS8 Lawal Garba1,2,3, Mohamad Ariff Mohamad Yussoff4, Khairul Bariyyah Abd Halim4, Siti Nor Hasmah Ishak1, Mohd Shukuri Mohamad Ali1,5, Siti Nurbaya Oslan1,5 and Raja Noor Zaliha Raja Abd. Rahman1,2 1 Enzyme and Microbial Technology Research Centre, Faculty of Biotechnology and Biomolecular Sciences, Universiti Putra Malaysia, Serdang, Selangor, Malaysia 2 Department of Microbiology, Faculty of Biotechnology and Biomolecular Sciences, Universiti Putra Malaysia, Serdang, Selangor, Malaysia 3 Department of Microbiology, Faculty of Science, Gombe State University, Gombe, Gombe State, Nigeria 4 Department of Biotechnology, Kulliyyah of Science, International Islamic University Malaysia, Kuantan, Pahang Darul Makmur, Malaysia 5 Department of Biochemistry, Faculty of Biotechnology and Biomolecular Sciences, Universiti Putra Malaysia, Serdang, Selangor, Malaysia ABSTRACT Membrane-bound fatty acid desaturases perform oxygenated desaturation reactions to insert double bonds within fatty acyl chains in regioselective and stereoselective manners. The 19-fatty acid desaturase strictly creates the first double bond between C9 and 10 positions of most saturated substrates. As the three-dimensional structures of the bacterial membrane fatty acid desaturases are not available, relevant information about the enzymes are derived from their amino acid sequences, site-directed mutagenesis and domain swapping in similar membrane-bound desaturases. The cold-tolerant Pseudomonas sp. AMS8 was found to produce high amount of monounsaturated fatty acids at low temperature. Subsequently, an active 19-fatty acid desaturase was isolated and functionally expressed in Escherichia coli. In this paper we report homology modeling and docking studies of a 19-fatty acid desaturase from a Cold- Submitted 8 May 2017 tolerant Pseudomonas sp.
  • A Thesis Entitled Homology-Based Structural Prediction of the Binding

    A Thesis Entitled Homology-Based Structural Prediction of the Binding

    A Thesis entitled Homology-based Structural Prediction of the Binding Interface Between the Tick-Borne Encephalitis Virus Restriction Factor TRIM79 and the Flavivirus Non-structural 5 Protein. by Heather Piehl Brown Submitted to the Graduate Faculty as partial fulfillment of the requirements for the Master of Science Degree in Biomedical Science _________________________________________ R. Travis Taylor, PhD, Committee Chair _________________________________________ Xiche Hu, PhD, Committee Member _________________________________________ Robert M. Blumenthal, PhD, Committee Member _________________________________________ Amanda Bryant-Friedrich, PhD, Dean College of Graduate Studies The University of Toledo December 2016 Copyright 2016, Heather Piehl Brown This document is copyrighted material. Under copyright law, no parts of this document may be reproduced without the expressed permission of the author. An Abstract of Homology-based Structural Prediction of the Binding Interface Between the Tick-Borne Encephalitis Virus Restriction Factor TRIM79 and the Flavivirus Non-structural 5 Protein. by Heather P. Brown Submitted to the Graduate Faculty as partial fulfillment of the requirements for the Master of Science Degree in Biomedical Sciences The University of Toledo December 2016 The innate immune system of the host is vital for determining the outcome of virulent virus infections. Successful immune responses depend on detecting the specific virus, through interactions of the proteins or genomic material of the virus and host factors. We previously identified a host antiviral protein of the tripartite motif (TRIM) family, TRIM79, which plays a critical role in the antiviral response to flaviviruses. The Flavivirus genus includes many arboviruses that are significant human pathogens, such as tick-borne encephalitis virus (TBEV) and West Nile virus (WNV).