Protein Structure Prediction Rachel
Total Page:16
File Type:pdf, Size:1020Kb
From: Nurul Affiah Binte Sahrin Sent: Tuesday, November 7, 2017 9:07 AM To: Ting Ai Hua Cc: Rachel Seah Mei Hui Subject: FW: RE: FW: Re: Fw: 巜物理学报》软物质专题英文版 Dear Ai Hua, Please kindly refer to the email below. I will come down later and explain to u in detailed Name of Journal: 物理学报 Article title: 水的奇异性质与液液相变 December 11, 2017 9:5 IJMPB S021797921840009X page 1 Thanks & Regards, Affiah Sahrin(Ms) Journal Publishing Administrator World Scientific Publishing Company 5 Toh Tuck Link Singapore 596224 From: Rachel Seah Mei Hui Monday, 6 November, 2017 3:46 PM International Journal of ModernSent: Physics B Vol. 32 (2018) 1840009 (17 pages)To: Nurul Affiah Binte Sahrin <[email protected]> c World Scientific PublishingSubject: Company FW: RE: FW: Re: Fw: 巜物理学报》软物质专题英文版 DOI: 10.1142/S021797921840009X Dear Affiah, Could we try implementing the new layout that you suggested, but add to the back of the citation: Please cite the original DOI. Also, put in brackets the Chinese name of the journal 物理学报 and the Chinese title of the article? Do let me know if you need help with that. Let’s do a sample layout before we typeset all the papers. Thanks Best regards, Protein structure prediction Rachel Haiyou发件人: Deng Rachel∗, YaSeah Jia Meiy and Hui Yang Zhangz;x 发送时间: 2017-11-06∗ 10:39:05 收件人: StevenCollege Shi Hong of Science, Bing - EXT 抄送:Huazhong Agricultural University, 主题:Wuhan RE: FW: 430070, Re: Fw: P.巜物理学报》软物质专题英文版 R. China yCollegeDear Steven, of Physical Science and Technology, Central China Normal University, I do agreeWuhan with their 430079, recommendation P. R. China to add the Chinese citation, although their case seems to be slightly different because it appears that they are one journal which published both the English and Chinese versions. But hopefully that will be fine for the APS editor. I’ll check with the production team again. zDepartmentThanks! of Computational Medicine and Bioinformatics, BestUniversity regards, of Michigan, Ann Arbor, MI 45108, USA Rachel [email protected] From: AcceptedSteven Shi Hongbing 26 September [mailto:[email protected] 2017 ] Published 11 December 2017 Sent: Monday, November 6, 2017 10:13 AM To: Rachel Seah Mei Hui <[email protected]> Translated fromSubject: Acta PhysicaRe: FW: Re: Sinica Fw: 巜物理学报》软物质专题英文版 ( ), 2016, 65(17): 178701. Please reference the original doi:10.7498/aps.65.178701 (⺮ⓑૅৈߣࡹ). Dear Rachel, Predicting 3D structureThank of protein you fromfor your its email amino and acid thank sequence you for is onethe advice. of the most I have impor- also asked the editor of APS for their suggestion to the different DOI. They have consulted some other editorial office with experience for two language version of 1 journal, and agreed 2 different DOIs are necessary. tant unsolved problemsBelow in biophysics is their suggestion and computational and I think biology.it will be Thiseasy to paper do since attempts very litter change needed. Please let me know your comments. to give a comprehensive introduction of the most recent effort and progress on protein 关于DOI号,我咨询了同时刊登中英文的一本期刊,他们采用的办法也是两个DOI号,但是统计引用的时候因为是同一本期刊,不受影响。为方便读者,我的建议是,在脚注《物理学报》的引文信息后面,括号注明 in Chinese,这样,愿意看中文版的读者知道这个是中文的,直接看中文,愿意看英文的就直接看译文。请你们确定一 structure prediction. Following the general flowchart of structure prediction, related con- Int. J. Mod. Phys. B Downloaded from www.worldscientific.com 下。 cepts and methods are presented and discussed. Moreover, brief introductions are made to several widely-used predictionAnd the pointer methods URL link and on the the community-wide original DOI number critical is good, assessment please keep it. of protein structure prediction (CASP) experiments. by UNIVERSITY OF MICHIGAN ANN ARBOR on 03/06/18. For personal use only. Thank you and best regards Keywords: Protein structure prediction; homology modeling; ab initio prediction; struc- ture refinement. Steven PACS numbers: 87.14.E-, 82.30.-b 2017-11-06 Steven Shi Hongbing 1. Introduction 发件人: Rachel Seah Mei Hui Since the latter half of发送时间: 20th century,2017-11-06 a 09:36:10 growing number of researchers from 收件人: Steven Shi Hong Bing - EXT diverse academic backgrounds抄送: are devoted to bio-related researches. Protein, as one of the most widespread主题: and FW: complicated Re: Fw: 巜物理学报》软物质专题英文版 macromolecules within life organisms, attracts a great deal of attentions. Proteins differ from one another primarily in file:////AIHUA/aihua/FWREFW~1.HTM[11/7/2017 1:16:01 PM] xCorresponding author. 1840009-1 December 11, 2017 9:5 IJMPB S021797921840009X page 2 H. Deng, Y. Jia & Y. Zhang their sequence of amino acids, which usually results in different spatial shape and structure and therefore different biological functionalities in cells. However, so for little is known about how protein folds into the specific three-dimensional structure from its one-dimensional sequence. In comparison with the genetic code by which a triple-nucleotide codon in a nucleic acid sequence specifies a single amino acid in protein sequence, the relationship between protein sequence and its steric structure is called the second genetic code (or folding code).1 The various advanced experimental techniques to determine protein sequence and structure have been developed in the last several decades. The number of pro- teins depositing in both UniProt2 and Protein Data Bank (PDB)3 keeps almost an exponential growth, especially in last two decades. It's much easier to obtain protein sequences than to obtain their structures. With the development of advanced DNA sequencing technology, the sequences of proteins have been rapidly accumulated. The UniProt/TrEMBL database contains currently more than 85 million of protein sequences. On the structure side, X-ray crystallography and NMR spectroscopy are currently the two major experimental techniques for protein structure determina- tion. Both of them are, however, time- and manpower-consuming, and have their own technical limitations for different protein targets. As of April 2017, the number of protein structures in PDB increases to ∼ 120;000, which counts however only < 0:2% of the protein sequences in the UniProt. The computational methods for predicting protein structure from its amino acid sequence spring up like mushrooms since the end of 20th century. The research article by Anfinsen in 19734 demonstrated that all the information a protein needs to fold properly is encoded in its amino acid sequence (called Anfinsen’s dogma). From the physical point of view, the amino acid sequence determines the basic molecular composition of the protein, and its native structure corresponds to the most stable conformation with the lowest free energy. Although we know that the protein folding process is governed by various physical laws, it is very hard to give Int. J. Mod. Phys. B Downloaded from www.worldscientific.com such complicated macromolecule (including its interaction with surrounding solvent molecules) an accurate physical description. On the other hand, the huge amount of conformational space needed to search through is also a crucial issue remaining to by UNIVERSITY OF MICHIGAN ANN ARBOR on 03/06/18. For personal use only. be solved on the basis of the current computing power. Many different approaches have been proposed, including the employment of coarse-grained physical model and optimized conformational search algorithms, to address these issues. There is a basic observation that similar sequences from the same evolutionary family often adopt similar protein structures, which forms the foundation of homol- ogy modeling. So far it is the most accurate way to predict protein structure by taking its homologous structure in PDB as template. With the rapid growth of PDB database, an increasing proportion of target proteins can be predicted via homol- ogy modeling. When no structure with obvious sequence similarity to the target protein can be found in PDB, it is still possible to find out proteins with struc- tural similarity to the target protein.5 The method to identify template structures from the PDB is called threading or fold recognition,6{8 which actually matches the 1840009-2 December 11, 2017 9:5 IJMPB S021797921840009X page 3 Protein structure prediction Fig. 1. The general flowchart of protein structure prediction. target sequence to homologous and distant-homologous structures based on some algorithm and take the best matches as structural template. The basic premise for threading to work is that protein structure is highly conservative in evolution and the number of unique structural folds are limited in nature.9,10 Both homology modeling (based on sequence comparison) and threading meth- ods (based on fold-recognition) can be called template-based structure prediction methods.11 Unlike homology modeling and threading methods, ab initio method aims to build structure from the first principles of physics which does not rely on any previously solved structure. The development of ab initio method is also the exploration of the second genetic code. However, successful ab initio methods are very rare and there are still many problems and challenges waiting to be con- quered. Currently Nearly all protein structure prediction methods make use of the structural information from previously solved structures to some extent. Therefore, \template-free"12 is often used for naming the methods that do not belong to the category of homology modeling and threading. Although the detailed prediction processes involved in different prediction methods vary widely, the fundamental steps are consistent, including conformation Int. J. Mod. Phys. B Downloaded from www.worldscientific.com initialization, conformational search, structure selection, all-atom structure recon- struction and structure refinement (as shown in Fig. 1). In this paper, we will in- by UNIVERSITY OF MICHIGAN ANN ARBOR on 03/06/18. For personal use only. troduce these steps in turn, including some well-known methods or tools commonly used in each step. We will also present a short overview of the critical assessment of protein structure prediction (CASP) experiments13,14 and some discussions about the current difficulties and future directions of protein structure prediction.