Membrane Protein Prediction Methods

Total Page:16

File Type:pdf, Size:1020Kb

Membrane Protein Prediction Methods Methods 41 (2007) 460–474 www.elsevier.com/locate/ymeth Membrane protein prediction methods Marco Punta a,b, Lucy R. Forrest a,b, Henry Bigelow a,b, Andrew Kernytsky a,b, Jinfeng Liu a,b,c, Burkhard Rost a,b,c,¤ a Department of Biochemistry and Molecular Biophysics, Columbia University, 1130 St. Nicholas Ave., New York, NY 10032, USA b Columbia University Center for Computational Biology and Bioinformatics (C2B2), 1130 St. Nicholas Ave., New York, NY 10032, USA c North East Structural Genomics Consortium (NESG), Columbia University, 1130 St. Nicholas Ave., New York, NY 10032, USA Accepted 5 July 2006 Abstract We survey computational approaches that tackle membrane protein structure and function prediction. While describing the main ideas that have led to the development of the most relevant and novel methods, we also discuss pitfalls, provide practical hints and high- light the challenges that remain. The methods covered include: sequence alignment, motif search, functional residue identiWcation, trans- membrane segment and protein topology predictions, homology and ab initio modeling. In general, predictions of functional and structural features of membrane proteins are improving, although progress is hampered by the limited amount of high-resolution experi- mental information available. While predictions of transmembrane segments and protein topology rank among the most accurate meth- ods in computational biology, more attention and eVort will be required in the future to ameliorate database search, homology and ab initio modeling. © 2006 Elsevier Inc. All rights reserved. Keywords: Membrane proteins; Protein structure prediction; Protein function prediction; Alignments; Transmembrane segment prediction; Homology modeling; ab initio modeling 1. Introduction thick (»30 Å) hydrophobic core. The sequence and struc- ture of IMPs reXects an extreme eVort of adaptation to this 1.1. The fold space of membrane proteins is relatively smallƒ environment. Since the interactions of polar groups with the hydrophobic core of the membrane are energetically Integral membrane proteins (IMPs) are polypeptide unfavorable, IMPs have evolved so as to minimize them. chains embedded into biological membranes. In this review, Inside the membrane, IMP chains form long regular sec- we focus exclusively on polytopic membrane proteins, i.e. ondary structure elements that cross the entire bilayer, i.e. proteins that span the membrane at least once. We will not are transmembrane (TM) segments. In this way, they satisfy consider monotopic membrane proteins, which despite the hydrogen-bond potential of the backbone amide and being anchored to the membrane do not cross it (anchors carbonyl groups. These secondary structure elements can include amphipathic helices and intra-membranous mole- either be TM helices (TMH) or TM -strands (TMB) cules covalently attached to the protein). arranged into -sheets; helices and strands are connected Biological membranes are phospholipid bilayers. The by regions that generally extend beyond the membrane polar heads of the phospholipids face the aqueous solution core, into the aqueous solution. Also, the assembly of TM on both sides of the membrane, while the lipid tails form a elements is strongly inXuenced by the presence of the bilayer. -Helices form bundles, with each helix strongly * Corresponding author. Fax: +1 212 305 7932. oriented towards the normal to the membrane plane [1]. E-mail address: [email protected] (B. Rost). -strands are arranged in barrel-like structures, with the URL: http://www.rostlab.org/ (B. Rost). barrel axis normal to the membrane plane. While helical 1046-2023/$ - see front matter © 2006 Elsevier Inc. All rights reserved. doi:10.1016/j.ymeth.2006.07.026 M. Punta et al. / Methods 41 (2007) 460–474 461 IMPs are practically ubiquitous (only being absent in the domains contain crevices and cavities that can accommo- outer membrane of Gram-negative bacteria), -barrel date numerous water molecules and other ligands, thus IMPs are only found in the outer membrane of Gram-nega- forming a more diverse environment with which the protein tive bacteria, mitochondria and chloroplasts. Proteins with can interact. There is also evidence that TMH topology is both integral TMHs and TMBs have not been observed. not solely determined by the protein sequence, but depends Due to the strong compositional biases imposed by the on complex interactions with the translocon, the machinery bilayer onto the amino-acid sequence, predicting the loca- responsible for inserting TMHs into the membrane [11]. tion of TM segments in the protein sequence turns out to be Consequently, topology may depend on the organism in a relatively easy task. In fact, in order to cross the mem- which the protein is expressed. Finally, functional IMPs are brane, TMHs need to be at least 15 residues long and com- very often the result of the assembly of several chains, pos- posed predominantly of hydrophobic amino acids. TMBs ing the problem of Wnding the correct protein–protein are generally longer than 10 residues and are comprised of docking solution, a notoriously diYcult task, despite recent alternating hydrophobic and polar amino acids (hydropho- progress [12]. For example, receptor binding and activity in bic side chains face the lipids while polar residues face the G-protein coupled receptors have been reported to be water- or protein-Wlled interior of the barrel). Also, regions linked to oligomerization [13]. Thus, the IMP structural connecting TM segments that are not translocated across space, although smaller than that sampled by water-soluble the bilayer (“inside” or cytoplasmic regions) are enriched in proteins, is still intricate enough to make computational positively charged amino acids, following the so-called structure prediction a very challenging task. ‘positive-inside rule’ [2,3]. This simpliWes the prediction of the way in which TM segments cross the membrane 1.3. Two main prediction tasks: identify and characterize (inside–out or outside–in). In conclusion, in comparison to water-soluble proteins, In this paper, we review computational methods that can IMP chains are able to sample only a limited number of be applied to study IMPs. Some of these approaches have folds [4]. This reduced conformational space may be easier to been developed speciWcally for IMPs, others have been search through traditional sampling algorithms (e.g. molecu- adapted or simply transferred from the techniques origi- lar dynamics and Monte Carlo). As we said, the number, nally devised for water-soluble proteins. Computational location and cross-membrane direction of TM segments can methods have been developed to address two diVerent be predicted rather accurately (see Section 2). It is, therefore, goals: to identify novel IMPs or to characterize the func- reasonable to believe that 3D1-structure prediction for IMPs tional and structural features of a protein experimentally is within the reach of computational methods. known to be inside the membrane. The Wrst task (identiWcation of IMPs) can be solved by 1.2. ƒ but membrane proteins still exhibit remarkable sequence or proWle alignment techniques that detect rela- structural variability tives of known IMPs. Alternatively, we can search motif databases (e.g. PROSITE [14], PRINTS [15]) to identify Notwithstanding the previously described constraints functional motifs that are characteristic of IMPs and use and biases, presently no method can accurately predict the these motifs to extend the database searches, or we can use 3D structure of any IMP from sequence alone. On one prediction methods that locate TM segments. Note that the hand, this is due to the very limited number of high-resolu- latter two approaches do not require homology to a known tion structures available, giving us a myopic view of the IMP and that the described methods produce diVerent and IMP structural space; on the other, IMP architectures may often complementary information. For example, Ruta et al. not be as simple as initially thought. Indeed, as more exper- [16] used database searches and sequence analysis to detect imental structures have become available, IMPs have a previously uncharacterized voltage-gated potassium revealed an unexpected level of structural diversity. Con- channel in the archae bacterium Aeropyrum pernix and straints on TMH length and tilt angles are not as strict as later conWrmed the prediction experimentally. On the other previously hypothesized [5]. Also, analysis of known struc- hand, prediction of TM segments has been widely used for tures in the Protein Data Bank [6] (PDB) indicated that estimating the fraction of membrane proteins in various about 50% of TMHs contain non-canonical elements (i.e. genomes and kingdoms of life. kinks, 310-helix and -helix turns) [7] and 5% of all TMHs Also the second task (functional and structural charac- cross the membrane only partially, i.e. form half TMHs [8]. terization) can be achieved by diVerent means, depending Stable loop regions have been found inside the hydropho- on the type of information that is available (Fig. 1). If the bic core of the membrane [9,10]. In some cases, membrane high-resolution structure of a protein (template) homolo- gous to the target is known, we can use homology modeling to obtain a prediction for the target structure. The quality 1 Abbreviations used: 3D, three-dimensional; GPCR, G-protein coupled of the resulting model typically depends on the sequence receptor; HMM, Hidden Markov Model; IMP, integral membrane pro- tein;
Recommended publications
  • Table S1. List of Proteins in the BAHD1 Interactome
    Table S1. List of proteins in the BAHD1 interactome BAHD1 nuclear partners found in this work yeast two-hybrid screen Name Description Function Reference (a) Chromatin adapters HP1α (CBX5) chromobox homolog 5 (HP1 alpha) Binds histone H3 methylated on lysine 9 and chromatin-associated proteins (20-23) HP1β (CBX1) chromobox homolog 1 (HP1 beta) Binds histone H3 methylated on lysine 9 and chromatin-associated proteins HP1γ (CBX3) chromobox homolog 3 (HP1 gamma) Binds histone H3 methylated on lysine 9 and chromatin-associated proteins MBD1 methyl-CpG binding domain protein 1 Binds methylated CpG dinucleotide and chromatin-associated proteins (22, 24-26) Chromatin modification enzymes CHD1 chromodomain helicase DNA binding protein 1 ATP-dependent chromatin remodeling activity (27-28) HDAC5 histone deacetylase 5 Histone deacetylase activity (23,29,30) SETDB1 (ESET;KMT1E) SET domain, bifurcated 1 Histone-lysine N-methyltransferase activity (31-34) Transcription factors GTF3C2 general transcription factor IIIC, polypeptide 2, beta 110kDa Required for RNA polymerase III-mediated transcription HEYL (Hey3) hairy/enhancer-of-split related with YRPW motif-like DNA-binding transcription factor with basic helix-loop-helix domain (35) KLF10 (TIEG1) Kruppel-like factor 10 DNA-binding transcription factor with C2H2 zinc finger domain (36) NR2F1 (COUP-TFI) nuclear receptor subfamily 2, group F, member 1 DNA-binding transcription factor with C4 type zinc finger domain (ligand-regulated) (36) PEG3 paternally expressed 3 DNA-binding transcription factor with
    [Show full text]
  • The Second Critical Assessment of Fully Automated Structure
    PROTEINS:Structure,Function,andGeneticsSuppl5:171–183(2001) DOI10.1002/prot.10036 CAFASP2:TheSecondCriticalAssessmentofFully AutomatedStructurePredictionMethods DanielFischer,1* ArneElofsson,2 LeszekRychlewski,3 FlorencioPazos,4† AlfonsoValencia,4† BurkhardRost,5 AngelR.Ortiz,6 andRolandL.Dunbrack,Jr.7 1Bioinformatics,DepartmentofComputerScience,BenGurionUniversity,Beer-Sheva,Israel 2StockholmmBioinformaticsCenter,StockholmUniversity,Stockholm,Sweden 3BioInfoBankInstitute,Poznan,Poland 4ProteinDesignGroup,CNB-CSIC,Cantoblanco,Madrid,Spain 5CUBIC,DepartmentofBiochemistryandMolecularBiophysics,ColumbiaUniversity,NewYork,NewYork 6DepartmentofPhysiologyandBiophysics,MountSinaiSchoolofMedicineoftheNewYorkUniversity,NewYork,NewYork 7InstituteforCancerResearch,FoxChaseCancerCenter,Philadelphia,Pennsylvania ABSTRACT TheresultsofthesecondCritical predictors.Weconcludethatasserverscontinueto AssessmentofFullyAutomatedStructurePredic- improve,theywillbecomeincreasinglyimportant tion(CAFASP2)arepresented.ThegoalsofCAFASP inanypredictionprocess,especiallywhendealing areto(i)assesstheperformanceoffullyautomatic withgenome-scalepredictiontasks.Weexpectthat webserversforstructureprediction,byusingthe inthenearfuture,theperformancedifferencebe- sameblindpredictiontargetsasthoseusedat tweenhumansandmachineswillcontinuetonar- CASP4,(ii)informthecommunityofusersaboutthe rowandthatfullyautomatedstructureprediction capabilitiesoftheservers,(iii)allowhumangroups willbecomeaneffectivecompanionandcomple- participatinginCASPtouseandanalyzetheresults menttoexperimentalstructuralgenomics.Proteins
    [Show full text]
  • CASP)-Round V
    PROTEINS: Structure, Function, and Genetics 53:334–339 (2003) Critical Assessment of Methods of Protein Structure Prediction (CASP)-Round V John Moult,1 Krzysztof Fidelis,2 Adam Zemla,2 and Tim Hubbard3 1Center for Advanced Research in Biotechnology, University of Maryland Biotechnology Institute, Rockville, Maryland 2Biology and Biotechnology Research Program, Lawrence Livermore National Laboratory, Livermore, California 3Sanger Institute, Wellcome Trust Genome Campus, Cambridgeshire, United Kingdom ABSTRACT This article provides an introduc- The role and importance of automated servers in the tion to the special issue of the journal Proteins structure prediction field continue to grow. Another main dedicated to the fifth CASP experiment to assess the section of the issue deals with this topic. The first of these state of the art in protein structure prediction. The articles describes the CAFASP3 experiment. The goal of article describes the conduct, the categories of pre- CAFASP is to assess the state of the art in automatic diction, and the evaluation and assessment proce- methods of structure prediction.16 Whereas CASP allows dures of the experiment. A brief summary of progress any combination of computational and human methods, over the five CASP experiments is provided. Related CAFASP captures predictions directly from fully auto- developments in the field are also described. Proteins matic servers. CAFASP makes use of the CASP target 2003;53:334–339. © 2003 Wiley-Liss, Inc. distribution and prediction collection infrastructure, but is otherwise independent. The results of the CAFASP3 experi- Key words: protein structure prediction; communi- ment were also evaluated by the CASP assessors, provid- tywide experiment; CASP ing a comparison of fully automatic and hybrid methods.
    [Show full text]
  • Inactive USP14 and Inactive UCHL5 Cause Accumulation of Distinct Ubiquitinated Proteins In
    bioRxiv preprint doi: https://doi.org/10.1101/479758; this version posted November 26, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. Inactive USP14 and inactive UCHL5 cause accumulation of distinct ubiquitinated proteins in mammalian cells Jayashree Chadchankar, Victoria Korboukh, Peter Doig, Steve J. Jacobsen, Nicholas J. Brandon, Stephen J. Moss, Qi Wang AstraZeneca Tufts Laboratory for Basic and Translational Neuroscience, Tufts University, Boston, MA (J.C., S.J.M.) Neuroscience, IMED Biotech Unit, AstraZeneca, Boston, MA (S.J.J., N.J.B., Q.W.) Discovery Sciences, IMED Biotech Unit, AstraZeneca, Boston, MA (V.K., P.D.) Department of Neuroscience, Tufts University School of Medicine, Boston, MA (S.J.M.) Department of Neuroscience, Physiology and Pharmacology, University College, London, WC1E, 6BT, UK (S.J.M.) 1 bioRxiv preprint doi: https://doi.org/10.1101/479758; this version posted November 26, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. Running title: Effects of inactive USP14 and UCHL5 in mammalian cells Keywords: USP14, UCHL5, deubiquitinase, proteasome, ubiquitin, β‐catenin Corresponding author: Qi Wang Address: Neuroscience, IMED Biotech Unit, AstraZeneca, Boston, MA 02451 Email address: [email protected] 2 bioRxiv preprint doi: https://doi.org/10.1101/479758; this version posted November 26, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission.
    [Show full text]
  • Proteomic and Bioinformatic Pipeline to Screen the Ligands of S
    www.nature.com/scientificreports OPEN Proteomic and bioinformatic pipeline to screen the ligands of S. pneumoniae interacting Received: 15 September 2017 Accepted: 14 March 2018 with human brain microvascular Published: xx xx xxxx endothelial cells Irene Jiménez-Munguía1, Lucia Pulzova1, Evelina Kanova1, Zuzana Tomeckova1, Petra Majerova2, Katarina Bhide1, Lubos Comor1, Ivana Sirochmanova1, Andrej Kovac2 & Mangesh Bhide1,2 The mechanisms by which Streptococcus pneumoniae penetrates the blood-brain barrier (BBB), reach the CNS and causes meningitis are not fully understood. Adhesion of bacterial cells on the brain microvascular endothelial cells (BMECs), mediated through protein-protein interactions, is one of the crucial steps in translocation of bacteria across BBB. In this work, we proposed a systematic workfow for identifcation of cell wall associated ligands of pneumococcus that might adhere to the human BMECs. The proteome of S. pneumoniae was biotinylated and incubated with BMECs. Interacting proteins were recovered by afnity purifcation and identifed by data independent acquisition (DIA). A total of 44 proteins were identifed from which 22 were found to be surface-exposed. Based on the subcellular location, ontology, protein interactive analysis and literature review, fve ligands (adhesion lipoprotein, endo-β-N-acetylglucosaminidase, PhtA and two hypothetical proteins, Spr0777 and Spr1730) were selected to validate experimentally (ELISA and immunocytochemistry) the ligand- BMECs interaction. In this study, we proposed a high-throughput approach to generate a dataset of plausible bacterial ligands followed by systematic bioinformatics pipeline to categorize the protein candidates for experimental validation. The approach proposed here could contribute in the fast and reliable screening of ligands that interact with host cells.
    [Show full text]
  • Identification and the Significance of Selective Proteins in Bile And
    University of Arkansas, Fayetteville ScholarWorks@UARK Theses and Dissertations 12-2014 Identification and the Significance of Selective Proteins in Bile and Plasma of Normal and Health- compromised Chickens Balamurugan Packialakshmi University of Arkansas, Fayetteville Follow this and additional works at: http://scholarworks.uark.edu/etd Part of the Analytical Chemistry Commons, Animal Diseases Commons, and the Poultry or Avian Science Commons Recommended Citation Packialakshmi, Balamurugan, "Identification and the Significance of Selective Proteins in Bile and Plasma of Normal and Health- compromised Chickens" (2014). Theses and Dissertations. 2083. http://scholarworks.uark.edu/etd/2083 This Dissertation is brought to you for free and open access by ScholarWorks@UARK. It has been accepted for inclusion in Theses and Dissertations by an authorized administrator of ScholarWorks@UARK. For more information, please contact [email protected], [email protected]. Identification and the Significance of Selective Proteins in Bile and Plasma of Normal and Health-compromised Chickens Identification and the Significance of Selective Proteins in Bile and Plasma of Normal and Health-compromised Chickens A dissertation submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy in Cell and Molecular Biology by Balamurugan Packialakshmi Tamilnadu Agricultural University, Bachelor of Technology in Agricultural Biotechnology, 2007 Govind Ballabh Pant University of Agriculture and Technology, Master of Science in Molecular Biology and Biotechnology, 2009 December 2014 University of Arkansas This dissertation is approved for the recommendation to the graduate council. _____________________ Dr. Narayan. C. Rath Dissertation Director _____________________ _____________________ Dr. Jackson O. Lay, Jr. Dr. Robert F. Wideman, Jr. Committee member Committee member ____________________ Dr.
    [Show full text]
  • Supplementary Table 1. the List of Proteins with at Least 2 Unique
    Supplementary table 1. The list of proteins with at least 2 unique peptides identified in 3D cultured keratinocytes exposed to UVA (30 J/cm2) or UVB irradiation (60 mJ/cm2) and treated with treated with rutin [25 µM] or/and ascorbic acid [100 µM]. Nr Accession Description 1 A0A024QZN4 Vinculin 2 A0A024QZN9 Voltage-dependent anion channel 2 3 A0A024QZV0 HCG1811539 4 A0A024QZX3 Serpin peptidase inhibitor 5 A0A024QZZ7 Histone H2B 6 A0A024R1A3 Ubiquitin-activating enzyme E1 7 A0A024R1K7 Tyrosine 3-monooxygenase/tryptophan 5-monooxygenase activation protein 8 A0A024R280 Phosphoserine aminotransferase 1 9 A0A024R2Q4 Ribosomal protein L15 10 A0A024R321 Filamin B 11 A0A024R382 CNDP dipeptidase 2 12 A0A024R3V9 HCG37498 13 A0A024R3X7 Heat shock 10kDa protein 1 (Chaperonin 10) 14 A0A024R408 Actin related protein 2/3 complex, subunit 2, 15 A0A024R4U3 Tubulin tyrosine ligase-like family 16 A0A024R592 Glucosidase 17 A0A024R5Z8 RAB11A, member RAS oncogene family 18 A0A024R652 Methylenetetrahydrofolate dehydrogenase 19 A0A024R6C9 Dihydrolipoamide S-succinyltransferase 20 A0A024R6D4 Enhancer of rudimentary homolog 21 A0A024R7F7 Transportin 2 22 A0A024R7T3 Heterogeneous nuclear ribonucleoprotein F 23 A0A024R814 Ribosomal protein L7 24 A0A024R872 Chromosome 9 open reading frame 88 25 A0A024R895 SET translocation 26 A0A024R8W0 DEAD (Asp-Glu-Ala-Asp) box polypeptide 48 27 A0A024R9E2 Poly(A) binding protein, cytoplasmic 1 28 A0A024RA28 Heterogeneous nuclear ribonucleoprotein A2/B1 29 A0A024RA52 Proteasome subunit alpha 30 A0A024RAE4 Cell division cycle 42 31
    [Show full text]
  • Cross-Talk Between Overlap Interactions in Biomolecules: a Case Study of the Β-Turn Motif
    molecules Article Cross-Talk between Overlap Interactions in Biomolecules: A Case Study of the b-Turn Motif Jayashree Nagesh Solid State and Structural Chemistry Unit, Indian Institute of Science Bangalore, Bengaluru 560012, Karnataka, India; [email protected] Abstract: Noncovalent interactions play a pivotal role in regulating protein conformation, stability and dynamics. Among the quantum mechanical (QM) overlap-based noncovalent interactions, n ! p∗ is the best understood with studies ranging from small molecules to b-turns of model proteins such as GB1. However, these investigations do not explore the interplay between multiple overlap interactions in contributing to local structure and stability. In this work, we identify and characterize all noncovalent overlap interactions in the b-turn, an important secondary structural element that facilitates the folding of a polypeptide chain. Invoking a QM framework of natural bond orbitals, we demonstrate the role of several additional interactions such as n ! s∗ and p ! p∗ that are energetically comparable to or larger than n ! p∗. We find that these interactions are sensitive to changes in the side chain of the residues in the b-turn of GB1, suggesting that the n ! p∗ may not be the only component in dictating b-turn conformation and stability. Furthermore, a database search of n ! s∗ and p ! p∗ in the PDB reveals that they are prevalent in most proteins and have significant interaction energies (∼1 kcal/mol). This indicates that all overlap interactions must be taken into account to obtain a comprehensive picture of their contributions to protein structure and energetics. Lastly, based on the extent of QM overlaps and interaction energies, we propose geometric criteria using which these additional interactions can be efficiently tracked in broad database searches.
    [Show full text]
  • Biological Recognition of Graphene Nanoflakes
    ARTICLE DOI: 10.1038/s41467-018-04009-x OPEN Biological recognition of graphene nanoflakes V. Castagnola1, W. Zhao1, L. Boselli1, M.C. Lo Giudice 1, F. Meder1, E. Polo 1, K.R. Paton2, C. Backes2, J.N. Coleman2 & K.A. Dawson1 The systematic study of nanoparticle–biological interactions requires particles to be repro- ducibly dispersed in relevant fluids along with further development in the identification of biologically relevant structural details at the materials–biology interface. Here, we develop a biocompatible long-term colloidally stable water dispersion of few-layered graphene nano- 1234567890():,; flakes in the biological exposure medium in which it will be studied. We also report the study of the orientation and functionality of key proteins of interest in the biolayer (corona) that are believed to mediate most of the early biological interactions. The evidence accumulated shows that graphene nanoflakes are rich in effective apolipoprotein A-I presentation, and we are able to map specific functional epitopes located in the C-terminal portion that are known to mediate the binding of high-density lipoprotein to binding sites in receptors that are abundant in the liver. This could suggest a way of connecting the materials' properties to the biological outcomes. 1 Centre for BioNano Interactions, School of Chemistry, University College Dublin, Belfield Dublin 4, Ireland. 2 School of Physics, CRANN and AMBER, Trinity College Dublin, Dublin 2, Ireland. Correspondence and requests for materials should be addressed to V.C. (email: [email protected]) or to K.A.D. (email: [email protected]) NATURE COMMUNICATIONS | (2018) 9:1577 | DOI: 10.1038/s41467-018-04009-x | www.nature.com/naturecommunications 1 ARTICLE NATURE COMMUNICATIONS | DOI: 10.1038/s41467-018-04009-x ecent years have seen very active interest in understanding Results the factors that influence nanoparticle interactions with Graphene nanoflakes protein corona composition.
    [Show full text]
  • Jakub Paś Application and Implementation of Probabilistic
    Jakub Paś Application and implementation of probabilistic profile-profile comparison methods for protein fold recognition (pol.: Wdrożenie i zastosowania probabilistycznych metod porównawczych profil-profil w rozpoznawaniu pofałdowania białek) Rozprawa doktorska ma formę spójnego tematycznie zbioru artykułów opublikowanych w czasopismach naukowych* Wydział Chemii, Uniwersytet im. Adama Mickiewicza w Poznaniu Promotor: dr hab. Marcin Hoffmann, prof. UAM Promotor pomocniczy: dr Krystian Eitner * Ustawa o stopniach naukowych i tytule naukowym oraz o stopniach i tytule w zakresie sztuki Dz.U.2003.65.595 - Ustawa z dnia 14 marca 2003 r. o stopniach naukowych i tytule naukowym oraz o stopniach i tytule w zakresie sztuki Art 13 1. Rozprawa doktorska, przygotowywana pod opieką promotora albo pod opieką promotora i promotora pomocniczego, o którym mowa w art. 20 ust. 7, powinna stanowić oryginalne rozwiązanie problemu naukowego lub oryginalne dokonanie artystyczne oraz wykazywać ogólną wiedzę teoretyczną kandydata w danej dyscyplinie naukowej lub artystycznej oraz umiejętność samodzielnego prowadzenia pracy naukowej lub artystycznej. 2. Rozprawa doktorska może mieć formę maszynopisu książki, książki wydanej lub spójnego tematycznie zbioru rozdziałów w książkach wydanych, spójnego tematycznie zbioru artykułów opublikowanych lub przyjętych do druku w czasopismach naukowych, określonych przez ministra właściwego do spraw nauki na podstawie przepisów dotyczących finansowania nauki, jeżeli odpowiada warunkom określonym w ust. 1. 3. Rozprawę doktorską może stanowić praca projektowa, konstrukcyjna, technologiczna lub artystyczna, jeżeli odpowiada warunkom określonym w ust. 1. 4. Rozprawę doktorską może także stanowić samodzielna i wyodrębniona część pracy zbiorowej, jeżeli wykazuje ona indywidualny wkład kandydata przy opracowywaniu koncepcji, wykonywaniu części eksperymentalnej, opracowaniu i interpretacji wyników tej pracy, odpowiadający warunkom określonym w ust. 1. 5.
    [Show full text]
  • April 11, 2003 1:34 WSPC/185-JBCB 00018 RAPTOR: OPTIMAL PROTEIN THREADING by LINEAR PROGRAMMING
    April 11, 2003 1:34 WSPC/185-JBCB 00018 Journal of Bioinformatics and Computational Biology Vol. 1, No. 1 (2003) 95{117 c Imperial College Press RAPTOR: OPTIMAL PROTEIN THREADING BY LINEAR PROGRAMMING JINBO XU∗ and MING LIy Department of Computer Science, University of Waterloo, Waterloo, Ont. N2L 3G1, Canada ∗[email protected] [email protected] DONGSUP KIMz and YING XUx Protein Informatics Group, Life Science Division and Computer Sciences and Mathematics Division, Oak Ridge National Lab, Oak Ridge, TN 37831, USA [email protected] [email protected] Received 22 January 2003 Revised 7 March 2003 Accepted 7 March 2003 This paper presents a novel linear programming approach to do protein 3-dimensional (3D) structure prediction via threading. Based on the contact map graph of the protein 3D structure template, the protein threading problem is formulated as a large scale inte- ger programming (IP) problem. The IP formulation is then relaxed to a linear program- ming (LP) problem, and then solved by the canonical branch-and-bound method. The final solution is globally optimal with respect to energy functions. In particular, our en- ergy function includes pairwise interaction preferences and allowing variable gaps which are two key factors in making the protein threading problem NP-hard. A surprising result is that, most of the time, the relaxed linear programs generate integral solutions directly. Our algorithm has been implemented as a software package RAPTOR{RApid Protein Threading by Operation Research technique. Large scale benchmark test for fold recogni- tion shows that RAPTOR significantly outperforms other programs at the fold similarity level.
    [Show full text]
  • USE of CLUSTERING TECHNIQUES for PROTEIN DOMAIN ANALYSIS Eric Rodene University of Nebraska - Lincoln, [email protected]
    University of Nebraska - Lincoln DigitalCommons@University of Nebraska - Lincoln Computer Science and Engineering: Theses, Computer Science and Engineering, Department of Dissertations, and Student Research Summer 7-29-2016 USE OF CLUSTERING TECHNIQUES FOR PROTEIN DOMAIN ANALYSIS Eric Rodene University of Nebraska - Lincoln, [email protected] Follow this and additional works at: http://digitalcommons.unl.edu/computerscidiss Part of the Biochemistry, Biophysics, and Structural Biology Commons, Bioinformatics Commons, Computer Engineering Commons, and the Computer Sciences Commons Rodene, Eric, "USE OF CLUSTERING TECHNIQUES FOR PROTEIN DOMAIN ANALYSIS" (2016). Computer Science and Engineering: Theses, Dissertations, and Student Research. 109. http://digitalcommons.unl.edu/computerscidiss/109 This Article is brought to you for free and open access by the Computer Science and Engineering, Department of at DigitalCommons@University of Nebraska - Lincoln. It has been accepted for inclusion in Computer Science and Engineering: Theses, Dissertations, and Student Research by an authorized administrator of DigitalCommons@University of Nebraska - Lincoln. USE OF CLUSTERING TECHNIQUES FOR PROTEIN DOMAIN ANALYSIS by Eric T. Rodene A THESIS Presented to the Faculty of The Graduate College at the University of Nebraska In Partial Fulfillment of Requirements For the Degree of Master of Science Major: Computer Science Under the Supervision of Professors Stephen D. Scott and Etsuko N. Moriyama Lincoln, Nebraska July 2016 USE OF CLUSTERING TECHNIQUES FOR PROTEIN DOMAIN ANALYSIS Eric Rodene, M.S. University of Nebraska, 2016 Advisors: Stephen Scott, Etsuko Moriyama Next-generation sequencing has allowed many new protein sequences to be identified. However, this expansion of sequence data limits the ability to determine the structure and function of most of these newly-identified proteins.
    [Show full text]