Computational Protein Design: Assessment And

Total Page:16

File Type:pdf, Size:1020Kb

Computational Protein Design: Assessment And COMPUTATIONAL PROTEIN DESIGN: ASSESSMENT AND APPLICATIONS Zhixiu Li Submitted to the faculty of University Graduate School in partial fulfillment of the requirements for the degree Doctor of Philosophy in the School of Informatics and Computing, Indiana University May 2015 Accepted by the Graduate Faculty, Indiana University, in partial fulfillment of the requirements for the degree of Doctor of Philosophy. ________________________________ Yunlong Liu, PhD, Chair ________________________________ Huanmei Wu, PhD Doctoral Committee ________________________________ Samy Meroueh, PhD November 24, 2014 ________________________________ Yaoqi Zhou, PhD ii © 2015 Zhixiu Li iii DEDICATION Dedicated to my family and friends. iv ACKNOWLEDGEMENTS I would like to thank my research committee members, my family and friends for their help in my Ph.D study. Foremost, I would like to express my sincere gratitude to my advisor Prof Yaoqi Zhou. He has provided me excellent research advice, great insights, enthusiasm and encouragement throughout all my research projects. His guidance and support help all the time in research and dissertation writing. I would never have been able to finish my dissertation without his help. Besides my advisor, I would like to extend my sincerest thanks and appreciation the rest of my research committee: Profs Yunlong Liu, Samy Meroueh, and Huanmei Wu, for all the useful discussions, encouragement, and insightful comments. I also want to thank my colleagues, collaborators, teachers and classmates in various projects. I shared a great time with them while learning from them. They are Profs Qizhuang Ye, Song Liu, Shiaofen Fang, Mohammad Al Hasan, James H. Hill, Jihua Wang, Drs Yuedong Yang, Huiying Zhao, Jian Zhan, Tuo Zhang, Liang Dai, Eshel Faraggi, Wenchang Xiang, Hui Huang, Md Tamjdul Hogue, Mr. Arthur Liu, Mr. Haoyu Cheng, Mr. Liang-Chin Huang, and others. Last but not the least, I would like to thank my family for their love and unconditional support throughout my life. v Zhixiu Li COMPUTATIONAL PROTEIN DESIGN: ASSESSMENT AND APPLICATIONS Computational protein design aims at designing amino acid sequences that can fold into a target structure and perform a desired function. Many computational design methods have been developed and their applications have been successful during past two decades. However, the success rate of protein design remains too low to be of a useful tool by biochemists whom are not an expert of computational biology. In this dissertation, we first developed novel computational assessment techniques to assess several state-of-the- art computational techniques. We found that significant progresses were made in several important measures by two new scoring functions from RosettaDesign and from OSCAR-design, respectively. We also developed the first machine-learning technique called SPIN that predicts a sequence profile compatible to a given structure with a novel nonlocal energy-based feature. The accuracy of predicted sequences is comparable to RosettaDesign in term of sequence identity to wild type sequences. In the last two application chapters, we have designed self-inhibitory peptides of Escherichia coli methionine aminopeptidase (EcMetAP) and de novo designed barstar. Several peptides were confirmed inhibition of EcMetAP at the micromole-range 50% inhibitory concentration. Meanwhile, the assessment of designed barstar sequences indicates the improvement of OSCAR-design over RosettaDesign. Yunlong Liu, PhD, Chair vi Contents List of Tables .................................................................................................................... xi List of Figures ................................................................................................................. xiii List of Equations ............................................................................................................. xv Chapter 1 Introduction ................................................................................................. 1 1.1 Protein: From Sequence to Structure ................................................................... 1 1.2 Computational Protein Design ............................................................................. 3 1.2.1 Searching Algorithm ..................................................................................... 5 1.2.2 Energy Function ............................................................................................ 6 1.3 Overview of the Dissertation ............................................................................... 7 Chapter 2 Energy Functions in De Novo Protein Design .......................................... 9 2.1 Abstract ................................................................................................................ 9 2.2 Introduction .......................................................................................................... 9 2.3 De Novo Designed and Structurally Validated Proteins .................................... 12 2.4 Origin of Low Success Rate in Protein Design .................................................. 15 2.5 Energy Function in Protein Design .................................................................... 18 2.5.1 RosettaDesign Energy Function ................................................................. 18 2.5.2 EGAD Energy Function .............................................................................. 20 2.5.3 Liang-Grishin Energy Function .................................................................. 21 2.5.4 Balancing Nonlocal and Local Interactions ................................................ 22 2.5.5 RosettaDesign-SR Energy Function ........................................................... 22 2.6 Computational Assessment of Designed Proteins .............................................. 23 2.6.1 Sequence Assessment: Native Sequence Recovery .................................... 25 vii 2.6.2 Local Assessment: Secondary Structure Recovery ..................................... 27 2.6.3 Local Assessment: Intrinsic Disorder ......................................................... 28 2.6.4 Surface Assessment: Solvent Accessibility Recovery ................................ 28 2.6.5 Surface Assessment: Hydrophobic Patch ................................................... 29 2.6.6 Packing Assessment: Total Accessible Surface Area ................................. 30 2.6.7 Global Structure Assessment ...................................................................... 31 2.6.8 Summary ..................................................................................................... 33 2.7 Community-wide Scoring Function Assessment ............................................... 34 2.8 Current Challenges and Future Prospects .......................................................... 35 Chapter 3 Assessment of Novel Energy Functions for Design ................................ 38 3.1 Introduction ........................................................................................................ 38 3.2 Results ................................................................................................................ 40 3.2.1 Sequence Assessment: Native Sequence Recovery .................................... 41 3.2.2 Local Assessment: Secondary Structure Recovery ..................................... 44 3.2.3 Local Assessment: Predicted Intrinsic Disorder and Low Complexity Residues ..................................................................................................................... 45 3.2.4 Surface Assessment: Solvent Accessibility Recovery ................................ 47 3.2.5 Surface Assessment: Hydrophobic Patch ................................................... 50 3.2.6 Packing Assessment: Total Accessible Surface Area ................................. 51 3.2.7 Global Structure Assessment ...................................................................... 54 3.3 Conclusion .......................................................................................................... 57 viii Chapter 4 Direct Prediction of the Profile of Sequences Compatible to a Protein Structure by Neural Networks with Fragment-Based Local and Energy-Based Nonlocal Profiles .................................................................................... 59 4.1 Abstract .............................................................................................................. 59 4.2 Introduction ........................................................................................................ 60 4.3 Methods .............................................................................................................. 62 4.3.1 Datasets ....................................................................................................... 62 4.3.2 Neural Network ........................................................................................... 64 4.3.3 Input Features.............................................................................................. 65 4.3.4 Output Layer ............................................................................................... 66 4.3.5 Ten-fold Cross Validation and Independent Test ....................................... 67 4.3.6 Performance Evaluation .............................................................................. 67 4.3.7 RosettaDesign ............................................................................................
Recommended publications
  • Molecular Dynamics Simulations in Drug Discovery and Pharmaceutical Development
    processes Review Molecular Dynamics Simulations in Drug Discovery and Pharmaceutical Development Outi M. H. Salo-Ahen 1,2,* , Ida Alanko 1,2, Rajendra Bhadane 1,2 , Alexandre M. J. J. Bonvin 3,* , Rodrigo Vargas Honorato 3, Shakhawath Hossain 4 , André H. Juffer 5 , Aleksei Kabedev 4, Maija Lahtela-Kakkonen 6, Anders Støttrup Larsen 7, Eveline Lescrinier 8 , Parthiban Marimuthu 1,2 , Muhammad Usman Mirza 8 , Ghulam Mustafa 9, Ariane Nunes-Alves 10,11,* , Tatu Pantsar 6,12, Atefeh Saadabadi 1,2 , Kalaimathy Singaravelu 13 and Michiel Vanmeert 8 1 Pharmaceutical Sciences Laboratory (Pharmacy), Åbo Akademi University, Tykistökatu 6 A, Biocity, FI-20520 Turku, Finland; ida.alanko@abo.fi (I.A.); rajendra.bhadane@abo.fi (R.B.); parthiban.marimuthu@abo.fi (P.M.); atefeh.saadabadi@abo.fi (A.S.) 2 Structural Bioinformatics Laboratory (Biochemistry), Åbo Akademi University, Tykistökatu 6 A, Biocity, FI-20520 Turku, Finland 3 Faculty of Science-Chemistry, Bijvoet Center for Biomolecular Research, Utrecht University, 3584 CH Utrecht, The Netherlands; [email protected] 4 Swedish Drug Delivery Forum (SDDF), Department of Pharmacy, Uppsala Biomedical Center, Uppsala University, 751 23 Uppsala, Sweden; [email protected] (S.H.); [email protected] (A.K.) 5 Biocenter Oulu & Faculty of Biochemistry and Molecular Medicine, University of Oulu, Aapistie 7 A, FI-90014 Oulu, Finland; andre.juffer@oulu.fi 6 School of Pharmacy, University of Eastern Finland, FI-70210 Kuopio, Finland; maija.lahtela-kakkonen@uef.fi (M.L.-K.); tatu.pantsar@uef.fi
    [Show full text]
  • Elephantid Genomes Reveal the Molecular Bases of Woolly Mammoth Adaptations to the Arctic
    Article Elephantid Genomes Reveal the Molecular Bases of Woolly Mammoth Adaptations to the Arctic Graphical Abstract Authors Vincent J. Lynch, Oscar C. Bedoya-Reina, Aakrosh Ratan, ..., George H. Perry, Webb Miller, Stephan C. Schuster Correspondence [email protected] (V.J.L.), [email protected] (W.M.) In Brief Lynch et al. sequence complete genomes from three Asian elephants and two woolly mammoths and identify amino acid changes unique to woolly mammoths. Woolly-mammoth-specific amino acid changes underlie cold- adapted traits in mammoths, including small ears, thick fur, and altered temperature sensation. Highlights d Complete genomes of three Asian elephants and two woolly mammoths were sequenced d Mammoth-specific amino acid changes were found in 1,642 protein-coding genes d Genes with mammoth-specific changes are associated with adaptation to extreme cold d An amino acid change in TRPV3 may have altered temperature sensation in mammoths Lynch et al., 2015, Cell Reports 12, 217–228 July 14, 2015 ª2015 The Authors http://dx.doi.org/10.1016/j.celrep.2015.06.027 Cell Reports Article Elephantid Genomes Reveal the Molecular Bases of Woolly Mammoth Adaptations to the Arctic Vincent J. Lynch,1,* Oscar C. Bedoya-Reina,2,4 Aakrosh Ratan,2,5 Michael Sulak,1 Daniela I. Drautz-Moses,2,6 George H. Perry,3 Webb Miller,2,* and Stephan C. Schuster2,6 1Department of Human Genetics, The University of Chicago, 920 East 58th Street, CLSC 319C, Chicago, IL 60637, USA 2Center for Comparative Genomics and Bioinformatics, Pennsylvania State University, 506B
    [Show full text]
  • The TIM Barrel Fold Nagarajan D
    The TIM barrel fold Nagarajan D. and Nanajkar N. Comments and corrections: Line 10: fix “αhelices” in “α-helices”. Lines 11-12: C-terminal loops are important for catalytic activity, while N-terminal loops are important for the stability of the TIM-barrels. This should be mentioned. Line 14: The reference #7 is not related to the statement. Line 14: There is a new EC classe (EC.7, translocases). Change “5 of 6” in “5 of 7”. Lines 26-27: It is not correct to state that the shear number of 8 for the TIM-barrels is due to “their staggered nature”. Most of the β-barrels have a staggered nature, but their shear number is not 8. Line 27: The reference #2 is imprecise. Wierenga did not defined himself the shear number of TIM-barrel proteins. Please check the 2 papers of Murzin AG, 1994, “Principle determining the structure of β-sheet barrels in proteins,” I and II, and the paper of Liu W, 1998, “Shear numbers of protein β-barrels: definition refinements and statistics”. Line 29: Again, it is not correct to state that the 4-fold geometric symmetry depends on the stagger. Since the number of strands (n) is equal to the Shear number (S), side-chains point alternatively towards the pore and the core, giving a 4-fold symmetry. Line 37: “historically” is a bit exaggerated for a reference dated 2015, especially if it comes from the author itself. Find a true historic reference, or just mention that you defined the regions “core” and “pore”. Line 43: “Consequently” is misleading.
    [Show full text]
  • Smurflite: Combining Simplified Markov Random Fields With
    SMURFLite: combining simplified Markov random fields with simulated evolution improves remote homology detection for beta-structural proteins into the twilight zone Noah M. Daniels 1, Raghavendra Hosur 2, Bonnie Berger 2∗, and Lenore J. Cowen 1∗ 1Department of Computer Science, Tufts University, Medford, MA 02155 2Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA 02139 ABSTRACT are limited in their power to recognize remote homologs because of Motivation: One of the most successful methods to date for their inability to model statistical dependencies between amino-acid recognizing protein sequences that are evolutionarily related has residues that are close in space but far apart in sequence (Lifson and been profile Hidden Markov Models (HMMs). However, these models Sander (1980); Zhu and Braun (1999); Olmea et al. (1999); Cowen do not capture pairwise statistical preferences of residues that are et al. (2002); Steward and Thorton (2002)). hydrogen bonded in beta sheets. These dependencies have been For this reason, many have suggested (White et al. (1994); partially captured in the HMM setting by simulated evolution in the Lathrop and Smith (1996); Thomas et al. (2008); Liu et al. (2009); training phase and can be fully captured by Markov Random Fields Menke et al. (2010); Peng and Xu (2011)) that more powerful (MRFs). However, the MRFs can be computationally prohibitive when Markov Random Fields (MRFs) be used. MRFs employ an auxiliary beta strands are interleaved in complex topologies. dependency graph which allows them to model more complex We introduce SMURFLite, a method that combines both simplified statistical dependencies, including statistical dependencies that Markov Random Fields and simulated evolution to substantially occur between amino-acid residues that are hydrogen bonded in beta improve remote homology detection for beta structures.
    [Show full text]
  • Open Ratulchowdhury Etd.Pdf
    The Pennsylvania State University The Graduate School COMPUTATIONAL REDESIGN OF CHANNEL PROTEINS, ENZYMES, AND ANTIBODIES A Dissertation in Chemical Engineering by Ratul Chowdhury © 2020 Ratul Chowdhury Submitted in Partial Fulfillment of the Requirements for the Degree of Doctor of Philosophy May 2020 The dissertation of Ratul Chowdhury was reviewed and approved* by the following: Costas D. Maranas Donald B. Broughton Professor of Chemical Engineering Dissertation Advisor Chair of Committee Manish Kumar Assistant Professor in Chemical Engineering Michael Janik Professor in Chemical Engineering Reka Albert Professor in Physics Phillip E. Savage Department Head and Graduate Program Chair Professor in Chemical Engineering ii ABSTRACT Nature relies on a wide range of enzymes with specific biocatalytic roles to carry out much of the chemistry needed to sustain life. Proteins catalyze the interconversion of a vast array of molecules with high specificity - from molecular nitrogen fixation to the synthesis of highly specialized hormones, quorum-sensing molecules, defend against disease causing foreign proteins, and maintain osmotic balance using transmembrane channel proteins. Ever increasing emphasis on renewable sources for energy and waste minimization has turned biocatalytic proteins (enzymes) into key industrial workhorses for targeted chemical conversions. Modern protein engineering is central to not only food and beverage manufacturing processes but are also often ingredients in countless consumer product formulations such as proteolytic enzymes in detergents and amylases and peptide-based therapeutics in the form of designed antibodies to combat neurodegenerative diseases (such as Parkinson’s, Alzheimer’s) and outbreaks of Zika and Ebola virus. However, successful protein design or tweaking an existing protein for a desired functionality has remained a constant challenge.
    [Show full text]
  • Recent Advances in Automated Protein Design and Its Future
    EXPERT OPINION ON DRUG DISCOVERY 2018, VOL. 13, NO. 7, 587–604 https://doi.org/10.1080/17460441.2018.1465922 REVIEW Recent advances in automated protein design and its future challenges Dani Setiawana, Jeffrey Brenderb and Yang Zhanga,c aDepartment of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA; bRadiation Biology Branch, Center for Cancer Research, National Cancer Institute – NIH, Bethesda, MD, USA; cDepartment of Biological Chemistry, University of Michigan, Ann Arbor, MI, USA ABSTRACT ARTICLE HISTORY Introduction: Protein function is determined by protein structure which is in turn determined by the Received 25 October 2017 corresponding protein sequence. If the rules that cause a protein to adopt a particular structure are Accepted 13 April 2018 understood, it should be possible to refine or even redefine the function of a protein by working KEYWORDS backwards from the desired structure to the sequence. Automated protein design attempts to calculate Protein design; automated the effects of mutations computationally with the goal of more radical or complex transformations than protein design; ab initio are accessible by experimental techniques. design; scoring function; Areas covered: The authors give a brief overview of the recent methodological advances in computer- protein folding; aided protein design, showing how methodological choices affect final design and how automated conformational search protein design can be used to address problems considered beyond traditional protein engineering, including the creation of novel protein scaffolds for drug development. Also, the authors address specifically the future challenges in the development of automated protein design. Expert opinion: Automated protein design holds potential as a protein engineering technique, parti- cularly in cases where screening by combinatorial mutagenesis is problematic.
    [Show full text]
  • 2020 Program Book
    PROGRAM BOOK Note that TAGC was cancelled and held online with a different schedule and program. This document serves as a record of the original program designed for the in-person meeting. April 22–26, 2020 Gaylord National Resort & Convention Center Metro Washington, DC TABLE OF CONTENTS About the GSA ........................................................................................................................................................ 3 Conference Organizers ...........................................................................................................................................4 General Information ...............................................................................................................................................7 Mobile App ....................................................................................................................................................7 Registration, Badges, and Pre-ordered T-shirts .............................................................................................7 Oral Presenters: Speaker Ready Room - Camellia 4.......................................................................................7 Poster Sessions and Exhibits - Prince George’s Exhibition Hall ......................................................................7 GSA Central - Booth 520 ................................................................................................................................8 Internet Access ..............................................................................................................................................8
    [Show full text]
  • Protein Design Is NP-Hard
    Protein Engineering vol.15 no.10 pp.779–782, 2002 Protein Design is NP-hard 1,2 3 Niles A.Pierce and Erik Winfree ri ∈ Ri at each position that minimizes the sum of the pairwise interaction energies between all positions: 1Applied and Computational Mathematics and 3Computer Science and Computation and Neural Systems,California Institute of Technology, ¥ Pasadena, CA 91125, USA Etotal Σ Σ E(ri,rj) Ͻ 2To whom correspondence should be addressed. i j, j i E-mail: [email protected] A solution to this problem is called a global minimum Biologists working in the area of computational protein energy conformation (GMEC). There is currently no known design have never doubted the seriousness of the algo- algorithm for identifying a GMEC solution efficiently (in a rithmic challenges that face them in attempting in silico specific sense to be defined shortly). Given the failure to sequence selection. It turns out that in the language of the identify such an approach, it is worth attempting to discern computer science community, this discrete optimization whether this is even a reasonable goal. Fortunately, a beautiful problem is NP-hard. The purpose of this paper is to theory from computer science allows us to classify the tracta- explain the context of this observation, to provide a simple bility of our problem in terms of other discrete optimization illustrative proof and to discuss the implications for future problems (Garey and Johnson, 1979; Papadimitriou and progress on algorithms for computational protein design. Steiglitz, 1982). This theory does not apply directly to the Keywords: complexity/design/NP-complete/NP-hard/proteins optimization form, but instead to a related decision form that has a ‘yes/no’ answer.
    [Show full text]
  • Functional Analysis of Somatic Mutations Affecting Receptor Tyrosine Kinase Family in Metastatic Colorectal Cancer
    Author Manuscript Published OnlineFirst on March 29, 2019; DOI: 10.1158/1535-7163.MCT-18-0582 Author manuscripts have been peer reviewed and accepted for publication but have not yet been edited. Functional analysis of somatic mutations affecting receptor tyrosine kinase family in metastatic colorectal cancer Leslie Duplaquet1, Martin Figeac2, Frédéric Leprêtre2, Charline Frandemiche3,4, Céline Villenet2, Shéhérazade Sebda2, Nasrin Sarafan-Vasseur5, Mélanie Bénozène1, Audrey Vinchent1, Gautier Goormachtigh1, Laurence Wicquart6, Nathalie Rousseau3, Ludivine Beaussire5, Stéphanie Truant7, Pierre Michel8, Jean-Christophe Sabourin9, Françoise Galateau-Sallé10, Marie-Christine Copin1,6, Gérard Zalcman11, Yvan De Launoit1, Véronique Fafeur1 and David Tulasne1 1 Univ. Lille, CNRS, Institut Pasteur de Lille, UMR 8161 - M3T – Mechanisms of Tumorigenesis and Target Therapies, F-59000 Lille, France. 2 Univ. Lille, Plateau de génomique fonctionnelle et structurale, CHU Lille, F-59000 Lille, France 3 TCBN - Tumorothèque Caen Basse-Normandie, F-14000 Caen, France. 4 Réseau Régional de Cancérologie – OncoBasseNormandie – F14000 Caen – France. 5 Normandie Univ, UNIROUEN, Inserm U1245, IRON group, Rouen University Hospital, Normandy Centre for Genomic and Personalized Medicine, F-76000 Rouen, France. 6 Tumorothèque du C2RC de Lille, F-59037 Lille, France. 7 Department of Digestive Surgery and Transplantation, CHU Lille, Univ Lille, 2 Avenue Oscar Lambret, 59037, Lille Cedex, France. 8 Department of hepato-gastroenterology, Rouen University Hospital, Normandie Univ, UNIROUEN, Inserm U1245, IRON group, F-76000 Rouen, France. 9 Department of Pathology, Normandy University, INSERM 1245, Rouen University Hospital, F 76 000 Rouen, France. 10 Department of Pathology, MESOPATH-MESOBANK, Centre León Bérard, Lyon, France. 11 Thoracic Oncology Department, CIC1425/CLIP2 Paris-Nord, Hôpital Bichat-Claude Bernard, Paris, France.
    [Show full text]
  • Commentary Rational Protein Design: Combining Theory and Experiment
    Proc. Natl. Acad. Sci. USA Vol. 94, pp. 10015–10017, September 1997 Commentary Rational protein design: Combining theory and experiment H. W. Hellinga* Department of Biochemistry, Duke University Medical Center, Durham, NC 27710 The rational design of protein structure and function is rapidly chosen a priori, kept fixed, and redecorated with different emerging as a powerful approach to test general theories in amino acid sequences that are predicted to be structurally protein chemistry (1). De novo creation of a protein or an compatible with that fold. This ‘‘inverse folding’’ approach (12) active site requires that all the necessary interactions are therefore removes the backbone conformational degrees of provided. The design approach is therefore a way to test the freedom from the design problem. limits of completeness of understanding experimentally. Fur- The first rational design approaches used qualitative rules of thermore, if the experiments are devised in a progressive protein structure applied by inspection (13). These experi- fashion, such that the simplest possible designs are tried first, ments established that it is possible to create sequences de novo followed by iterative additions of more complex interactions that adopt defined structures (1, 14). Furthermore, they dem- until the desired result is achieved, then it may be possible to onstrated that, by following a progressive design strategy [or identify a minimally sufficient set of components. At the center ‘‘hierachic design’’ (1)] in which increasing levels of complexity of the design approach is the ‘‘design cycle,’’ in which theory are iteratively introduced, new insights into the fundamentals and experiment alternate. The starting point is the develop- of protein structure and function can be gained.
    [Show full text]
  • Modeling and Predicting Super-Secondary Structures of Transmembrane Beta-Barrel Proteins Thuong Van Du Tran
    Modeling and predicting super-secondary structures of transmembrane beta-barrel proteins Thuong van Du Tran To cite this version: Thuong van Du Tran. Modeling and predicting super-secondary structures of transmembrane beta-barrel proteins. Bioinformatics [q-bio.QM]. Ecole Polytechnique X, 2011. English. NNT : 2011EPXX0104. pastel-00711285 HAL Id: pastel-00711285 https://pastel.archives-ouvertes.fr/pastel-00711285 Submitted on 23 Jun 2012 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. THESE` pr´esent´ee pour obtenir le grade de DOCTEUR DE L’ECOLE´ POLYTECHNIQUE Sp´ecialit´e: INFORMATIQUE par Thuong Van Du TRAN Titre de la th`ese: Modeling and Predicting Super-secondary Structures of Transmembrane β-barrel Proteins Soutenue le 7 d´ecembre 2011 devant le jury compos´ede: MM. Laurent MOUCHARD Rapporteurs Mikhail A. ROYTBERG MM. Gregory KUCHEROV Examinateurs Mireille REGNIER M. Jean-Marc STEYAERT Directeur Laboratoire d’Informatique UMR X-CNRS 7161 Ecole´ Polytechnique, 91128 Plaiseau CEDEX, FRANCE Composed with LATEX !c Thuong Van Du Tran. All rights reserved. Contents Introduction 1 1Fundamentalreviewofproteins 5 1.1 Introduction................................... 5 1.2 Proteins..................................... 5 1.2.1 Aminoacids............................... 5 1.2.2 Properties of amino acids .
    [Show full text]
  • Analyse Et Identification D'acides Aminés Essentiels De L'extension C
    UNIVERSITÉ DU QUÉBEC À MONTRÉAL ANALYSE ET IDENTIFICATION D'ACIDES AMINÉS ESSENTIELS DE L'EXTENSION C-TERMINALE DE L'ARN HÉLICASE DBP4 MÉMOIRE PRÉSENTÉ COMME EXIGENCE PARTIELLE DE LA MAÎTRISE EN BIOLOGIE PAR MOHAMED AMINE CHAABANE DÉCEMBRE 2016 UNIVERSITÉ DU QUÉBEC À MONTRÉAL Service des bibliothèques Avertissement La diffusion de ce mémoire se fait dans le respect des droits de son auteur, qui a signé le formulaire Autorisation de reproduire et de diffuser un travail de recherche de cycles supérieurs (SDU-522 - Rév.0?-2011 ). Cette autorisation stipule que «conformément à l'article 11 du Règlement no 8 des études de cycles supérieurs, [l'auteur] concède à l'Université du Québec à Montréal une licence non exclusive d'utilisation et de publication de la totalité ou d'une partie importante de [son] travail de recherche pour des fins pédagogiques et non commerciales. Plus précisément, [l 'auteur] autorise l'Université du Québec à Montréal à reproduire, diffuser, prêter, distribuer ou vendre des copies de [son] travail de recherche à des fins non commerciales sur quelque support que ce soit, y compris l'Internet. Cette licence et cette autorisation n'entraînent pas une renonciation de [la] part [de l'auteur] à [ses] droits moraux ni à [ses] droits de propriété intellectuelle. Sauf entente contraire, [l'auteur] conserve la liberté de diffuser et de commercialiser ou non ce travail dont [il] possède un exemplaire.» RElVIERCIEMENTS ll m'est agréable d'exprimer mes remerciements les plus sincères au Professeur François Dragon pour avoir bien voulu accepter de diriger mon projet de maîtrise. Je suis très reconnaissant envers l'aide qu'il m'a apportée pour bien mener le présent travail avec sa disponibilité, ses encouragements permanents, sa compréhension, son soutien moral et fmancier et sa grande modestie.
    [Show full text]