Proquest Dissertations

Total Page:16

File Type:pdf, Size:1020Kb

Proquest Dissertations BOSTON UNIVERSITY GRADUATE SCHOOL OF ARTS AND SCIENCES AND COLLEGE OF ENGINEERING Dissertation DEVELOPMENT OF STRUCTURE-BASED COMPUTATIONAL METHODS FOR PREDICTION AND DESIGN OF PROTEIN-PROTEIN INTERACTIONS by BRIAN GREGORY PIERCE B.S., Duke University, 2000 Submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy 2008 UMI Number: 3298669 INFORMATION TO USERS The quality of this reproduction is dependent upon the quality of the copy submitted. Broken or indistinct print, colored or poor quality illustrations and photographs, print bleed-through, substandard margins, and improper alignment can adversely affect reproduction. In the unlikely event that the author did not send a complete manuscript and there are missing pages, these will be noted. Also, if unauthorized copyright material had to be removed, a note will indicate the deletion. ® UMI UMI Microform 3298669 Copyright 2008 by ProQuest LLC. All rights reserved. This microform edition is protected against unauthorized copying under Title 17, United States Code. ProQuest LLC 789 E. Eisenhower Parkway PO Box 1346 Ann Arbor, Ml 48106-1346 Approved by First Reader "23 Zhiping Weng, Ph.D. Associate Professor of Biomedical Engineering Second Readei Charles DeLisi,Ph.D. Arthur G. B. Metcalf Professor of Science and Engineering ACKNOWLEDGEMENTS There are so many people that have provided help and inspiration to me as I've performed this work, far too many to recount in this text. That being said, I will now acknowledge some of the most prominent. First, I would like to thank my wife, Laura, for inspiration, moral support, listening, and understanding as I worked so diligently on this research. As I grew and became involved in science, my parents and sister have also provided invaluable support. Without question, this work could not have been performed without the guidance and expertise of my advisor, Professor Zhiping Weng. She gave me the opportunity to learn and pursue a variety of projects, providing insightful comments and criticism when necessary. I would also like to thank my thesis committee, Professor Charles DeLisi, Professor John Straub, Dr. Enoch Huang, and Professor Scott Mohr for their time, and for their useful comments and suggestions. All of the members of Zlab and the BU Bioinformatics Program have provided invaluable discussions, ideas, and feedback; in particular, Jaafar Haidar, Yong Yu, Kevin Wiehe, Weiwei Tong, Julian Mintseris, Rong Chen, and Howook Hwang. Yong Yu, Jaafar Haidar, and Bruce Miller deserve special mention for their hard work on the experiments related to the TCR, CD4, and LANA proteins. For computing support and expertise, I would like to thank Mary Ellen Fitzpatrick, and for administrative support, Caroline Lyman, Jessica Barros, and Robert Henry. iii I would also like to acknowledge the late Professor Michael Laskowski for gift of the dataset of mutation energies for the OMTKY system to my lab. Also, Marcia Osburne and Cassidy Dobson for the gift of the pelB-CD4 constructs, the late Prof. Don Wiley for the constructs for HLA-A2, and Prof. Brian Baker for the A6 TCR constructs. This work was supported by NSF grants DBI-0078194, DBI-0133834 and DBI-0116574. iv DEVELOPMENT OF STRUCTURE-BASED COMPUTATIONAL METHODS FOR PREDICTION AND DESIGN OF PROTEIN-PROTEIN INTERACTIONS (Order No. ) BRIAN GREGORY PIERCE Boston University Graduate School of Arts and Sciences and College of Engineering, 2008 Major Professor: Zhiping Weng, Associate Professor of Biomedical Engineering ABSTRACT Protein-protein interactions play a key role in the functioning of cells and pathways, and understanding these interactions on a physical and structural level can help greatly in developing therapeutics for diseases. The large amount of protein structures available presents an immense opportunity to model and predict protein interactions using computational techniques. Here we describe the development of algorithms to predict protein complex structures (referred to as protein docking) and to design proteins to improve their interaction affinities. We also present experimental results validating our protein design approach. The protein docking work we present includes the symmetric multimer docking program M- ZDOCK as well as ZRANK which rescores docking predictions using a weighted potential. Both programs have been successful when applied to docking benchmarks and in the CAPRI experiment. In addition, we have used the M-ZDOCK program to produce a tetrameric model for a disease-associated protein, the latent nuclear antigen of the Kaposi's sarcoma-associated herpesvirus. v We have also developed a protein design algorithm to improve the binding between two proteins, given their complex structure. This was applied to a T cell receptor (TCR) to enhance its binding to the Major Histocompatibility Complex and peptide. Several of the point mutations predicted by our algorithm were verified experimentally to bind several times stronger than wild type; we then combined these mutations to produce a TCR with approximately 100-fold affinity improvement. Further testing of combinations of TCR point mutations has led to striking results regarding the kinetics and cooperativity of the mutations. Finally, we have used our protein design algorithm to predict designability of protein complexes from the Protein Data Bank, and identified the complex between CD4 and HIV gpl20 as a target for future structure-based design efforts. Preliminary results for this project are given. vi TABLE OF CONTENTS List of Tables viii List of Figures x List of Abbreviations xii Chapter 1 Introduction 1 Chapter 2 M-ZDOCK: Incorporating Symmetric Searching 4 into Rigid-Body Docking Chapter 3 ZRANK: Optimal Reranking of Initial-Stage Protein 27 Docking Predictions Chapter 4 Refining Protein Docking Predictions Using ZRANK 51 and RosettaDock Chapter 5 Modeling Protein Interaction Affinity Enhancement, 81 and Application to TCR/peptide/MHC complex Chapter 6 Studying Cooperativity of TCR Mutations Through 110 Modeling and Experiments Chapter 7 Exploring the In Silico Designability of Protein 126 Complexes, and Affinity Enhancement of CD4/gpl20 Chapter 8 Summary and Future Directions 143 List of Journal Abbreviations 146 Bibliography 148 Curriculum Vitae 161 vii LIST OF TABLES 2.1 The Unbound Multimer Test Cases 14 2.2 Residues Removed from Multimeric Structures Before Determining Interface 15 Ca Atoms 2.3 M-ZDOCK Results for Quasi-Bound and Bound Test Cases 17 2.4 M-ZDOCK Results for Unbound Test Cases 18 3.1 Results from ZDOCK 2.1 and ZDOCK 2.3 Before and After ZRANK 36 3.2 Charge Terms Used for the ZRANK Candidate Electrostatics Functions 41 4.1 Results for Near-Hit Cases of the ZD3.OZR Set, Before and After Refinement 64 4.2 Number of Cases with Top-Ranked Hits Before and After Refinement 67 4.3 CAPRI Scoring Results Using ZRANK and RosettaDock 69 5.1 Binding Kinetics and Prediction Method for A6 TCR Point Mutants 90 5.2 TCR Point Mutations that Exhibited No Measurable Binding to pepMHC 91 5.3 Weighted ZAFFI Terms, Total Energy Scores, and Measured AAGs for TCR 93 Point mutations 5.4 Contribution of ZAFFI Scoring Terms to Correlation with Measured TCR 95 Point Mutations 5.5 ZAFFI Scores, Filter Scores, and Measured Binding for All TCR Point 96 Mutants 5.6 Binding Kinetics of Combinations of Point Mutants, and Cooperativity 98 of the Energetics 5.7 Binding Kinetics and Specificity of WFGMT TCR Mutant for HLA-A2 with 99 Tax Peptide and V7R Point Mutant of Tax Peptide 6.1 Binding Energy Changes of Combinations of A6 TCR Point Mutants, and 113 Cooperativity 6.2 Inter-Residue Energy Terms for Combinations of Point Mutations on the 115 TCR a Chain viii 6.3 Association Rates, Dissociation Rates, and Binding Energy Changes for 116 Combinations of Mutations from Different Chains 7.1 In Silico Designability of Nonredundant Transient Enzyme/Inhibitor and 132 Other Complexes 7.2 In Silico Designability of Nonredundant Transient Antibody/Antigen 132 Complexes LIST OF FIGURES 2.1 Diagram of Successive Rotations Through Euler Angles <j) and 0 23 2.2 The Relative Positions of the Subunits of a C3 Multimer 24 2.3 The Highest-Ranked Hits of M-ZDOCK Superposed onto the Structures 25 of the Complexes 2.4 Structural Model of the Latent Nuclear Antigen (LANA) Carboxy Terminal 26 Domain Tetramer Bound to the LBS-1 and LBS-2 DNA Binding Sites 3.1 Success Rate Versus the Number of Predictions for Reranked Predictions 47 3.2 Score Versus Interface RMSD Plots Using ZDOCK and ZRANK for 48 Several Test Cases 3.3 Success Rate for Reranking Various Numbers of ZDOCK Predictions, and 49 Success Rate comparison for Various Short-Range Electrostatics Formulations 3.4 Success Rate as a Function of the Quality of the Predictions for ZDOCK 2.3 50 and Reranked ZDOCK 2.3 4.1 Hit Success Rate and Hit and Near-hit success rate for ZDOCK 2.3 and 73 ZDOCK 3.0 with and without ZRANK 4.2 Protocol Employed for Docking and Refinement 74 4.3 Histogram of Interface RMSD Change for All Hit and Near-Hit Models After 75 Refinement 4.4 Percent of Models with RMSD Improvement for Several Search/Scoring 76 Strategies 4.5 Percent of Models with Hits After Refinement for Several Search/Scoring 77 Strategies 4.6 Refinement of Three Test Cases, with Rosetta Scores and ZRANK Scores 78 Versus Interface RMSD 4.7 Success Rates of Refinement for Predictions for Hit and Near-Hit 79 Cases for Various Numbers of RosettaDock Refinement Models 4.8 Refined Structure for Test Case 1IQD 80 x 5.1 A AG k^ and A AG k^ versus A AG for 18 Measured
Recommended publications
  • Genetically Engineered Plants: a Potential Solution to Climate Change
    Genetically Engineered Plants: A Potential Solution to Climate Change Dr Charles DeLisi GENETICALLY ENGINEERED PLANTS: A POTENTIAL SOLUTION TO CLIMATE CHANGE Climate change is already having devastating effects felt across the globe. Without adequate measures to counteract the human drivers behind climate change, these negative consequences are guaranteed to increase in severity in the coming decades. Esteemed biomedical scientist, Dr Charles DeLisi of Boston University, urges that a multi-disciplinary approach to mitigating climate change is vital. Using predictive modelling, he has demonstrated the potential power of genetically engineering plants to remove excess carbon dioxide from the atmosphere, thereby mitigating climate change. ‘During the post-industrial period, the planetary temperature has increased rapidly,’ explains Dr Charles DeLisi of Boston University. ‘Ice sheet melting has accelerated, ocean levels have risen with concomitant increases in coastal flooding, and extreme weather events have increased in frequency.’ Emissions Reductions: Not the Whole Solution Thus far, measures to tackle climate change have focused on reducing our emissions of carbon dioxide and other greenhouse gases. Over the last three The Climate Emergency These changes are driven by increasing decades, over 100 nations across the levels of atmospheric ‘greenhouse globe have signed multiple agreements Climate change is, arguably, the most gases’, such as carbon dioxide. Released to voluntarily limit their emissions, significant global threat that society as a by-product of fossil fuel combustion but these measures alone are proving faces. Even small perturbations and other human activities, the level ineffective at slowing climatic in the planet’s climatic system of carbon dioxide in the atmosphere disruption.
    [Show full text]
  • Bioinformatics
    Bioinformatics 1 ioinformatics is a hybrid science that links biological data with techniques for information storage, distribution, and analysis to support multiple areas of scientific Bresearch, including biomedicine. Bioinformatics is fed by high-throughput data- generating experiments, including genomic sequence determinations and measurements of gene expression patterns. Database projects curate and annotate the data and then distribute it via the World Wide Web. Mining these data leads to scientific discoveries and to the identification of new clinical applications. In the field of medicine in particular, a number of important applications for bioinformatics have been discovered. For example, it is used to identify correlations between gene sequences and diseases, to predict protein structures from amino acid sequences, to aid in the design of novel drugs, and to tailor treatments to individual patients based on their DNA sequences (pharmacogenomics). “[Helix]”, ill. under ‘’Bioinformatics’’, S [&] T: Dependable Solutions, www.stcorp.nl/step/bioinformatics 120431 Bibliotheca Alexandrina Updated by Ghada Sami 1 The goal of bioinformatics is the extension of experimental data by predictions. A fundamental goal of computational biology is the prediction of protein structure from an amino acid sequence. The spontaneous folding of proteins shows that this should be possible. Progress in the development of methods to predict protein folding is measured by biennial Critical Assessment of Structure Prediction (CASP) programs, which involve blind tests of structure prediction methods. Bioinformatics is also used to predict interactions between proteins, given individual structures of the partners. This is known as the “docking problem.” Protein-protein complexes show good complementarity in surface shape and polarity and are stabilized largely by weak interactions, such as burial of hydrophobic surface, hydrogen bonds, and van der Waals forces.
    [Show full text]
  • Boston University Graduate Program in Bioinformatics Handbook 2012
    1 Boston University Graduate Program in Bioinformatics Handbook 2012-2013 http://www.bu.edu/bioinformatics Department of Biochemistry (School of Medicine) Department of Biology (College of Arts and Sciences) Biomedical Engineering Department (College of Engineering) Department of Biostatistics (School of Public Health) Department of Chemistry (College of Arts and Sciences) Department of Computational Biomedicine (School of Medicine) Department of Computer Science (College of Arts and Sciences) Electrical and Computer Engineering Department (College of Engineering) Genetics and Genomics Department (School of Medicine) Department of Mathematics and Statistics (College of Arts and Sciences) Department of Medicine (School of Medicine) Department of Microbiology (School of Medicine) Department of Mechanical Engineering (College of Engineering) Molecular and Cell Biology (Goldman School of Dental Health) Department of Neurology (School of Medicine) Periodontology & Oral Biology (Goldman School of Dental Health) Department of Physics (College of Arts and Sciences) Pulmonary Medicine (School of Medicine) Systems Engineering Division (College of Engineering) Center for Computational Science Center for Advanced Biotechnology Center for Advanced Genomic Technology Last Updated: 10/11/12 2 Bioinformatics Faculty and Staff Faculty Director Thomas Tullius Director of Bioinformatics Associate Directors Gary Benson* Associate Director of IGERT Scott Mohr Administrative Director, Director of Graduate Studies Department of Biochemistry (BUSM) Joseph Zaia Professor Associate Director, Center for Biomedical Mass Spectrometry Department of Biology (CAS) Cynthia Bradham Assistant Professor Geoffrey M. Cooper Professor, Associate Dean of the Faculty, Natural Sciences John Finnerty Associate Professor Ulla Hansen Professor Edward Loechler Professor Kimberly McCall Associate Professor # Daniel Segre* Associate Professor Dean Tolan Professor David Waxman Professor Department of Biomedical Engineering (ENG) Charles Cantor* Professor, Director of Center for Advanced Biotechnology James J.
    [Show full text]
  • Molecular Biology, Cell Biology & Biochemistry (MCBB) Graduate
    2016-2017 Molecular Biology, Cell Biology & Biochemistry (MCBB) Graduate Program Guide Table of Contents MCBB Graduate Program Administration.................................................................................................................................... 5 Participating Staff by Department.................................................................................................................................................. 6 Participating Faculty by Department............................................................................................................................................ 7 Facilities.................................................................................................................................................................................................... 10 MCBB Program Requirements.......................................................................................................................................................... 12 • Seminars................................................................................................................................................................................... 12 • Ph.D. Course Requirements............................................................................................................................................... 13 • M.A. Course Requirements...............................................................................................................................................
    [Show full text]
  • Human Genome Project
    Human Genome Project The Human Genome Project (HGP) was an international scientific research project with the goal of determining the sequence of nucleotide base pairs that make up human DNA, and of identifying and mapping all of the genes of the human genome from both a physical and a functional standpoint.[1] It remains the world's largest collaborative biological project.[2] After the idea was picked up in 1984 by the US government when the planning started, the project formally launched in 1990 and was declared complete in 2003[3]. Funding came from the US government through the National Institutes of Health (NIH) as well as numerous other groups from around the world. A parallel project was conducted outside government by the Celera Corporation, or Celera Genomics, which was formally launched in 1998. Most of the government-sponsored sequencing was performed in twenty universities and research centers in the United States, the United Kingdom, Japan, France, Logo HGP; Vitruvian Man, Leonardo Germany, Spain and China.[4] da Vinci The Human Genome Project originally aimed to map the nucleotides contained in a human haploid reference genome (more than three billion). The "genome" of any given individual is unique; mapping the "human genome" involved sequencing a small number of individuals and then assembling these together to get a complete sequence for each chromosome. Therefore, the finished human genome is a mosaic, not representing any one individual. Contents Human Genome Project History State of completion Applications and proposed
    [Show full text]
  • The High Road to the Human Genome
    A ccomplishing S e q u e n c in g t h e H u m a n G e n o m e Andrew Bartlett Submitted to Cardiff University in fulfilment of the requirements for the degree of Doctor of Philosophy September 2008 i UMI Number: U584600 All rights reserved INFORMATION TO ALL USERS The quality of this reproduction is dependent upon the quality of the copy submitted. In the unlikely event that the author did not send a complete manuscript and there are missing pages, these will be noted. Also, if material had to be removed, a note will indicate the deletion. Dissertation Publishing UMI U584600 Published by ProQuest LLC 2013. Copyright in the Dissertation held by the Author. Microform Edition © ProQuest LLC. All rights reserved. This work is protected against unauthorized copying under Title 17, United States Code. ProQuest LLC 789 East Eisenhower Parkway P.O. Box 1346 Ann Arbor, Ml 48106-1346 D e c l a r a t io n This work has not previously been accepted in substance for any degree and is not concurrently submitted in candidature for any degree. Signed ..... y . .........(candidate) Date .............. STATEMENT 1 This thesis is being submitted in partial fulfilment of the requirements for the degree of PhD. S igned /. JjTU..../...... .J.........(cand idate) Date........... STATEMENT 2 This thesis is the result of my own independent work/investigation, except where otherwise stated. Other sources are acknowledged by explicit references. A bibliography is appended. Signed KJ. ) .... (candidate) Date............ .?. .4?.. ■ STATEMENT 3 I hereby give consent for my thesis, if accepted, to be available for photocopying and for inter-library loan, and for the title and summary to be made available to outside organisations.
    [Show full text]
  • Human Genome Program in 1986.”
    extracted from BER Exceptional Service Awards 1997 EXCEPTIONAL SERVICE AWARD for Exploring Genomes Charles DeLisi .................................................................. 20 Betty Mansfield ................................................................ 22 J. Craig Venter .................................................................. 24 Exploring Genomes Exploring OE initiated the world’s first development of biological resources; cost- genome program in 1986 after effective, automated technologies for mapping Dconcluding that the most useful approach for and sequencing; and tools for genome-data detecting inherited mutations—an important analysis. The project currently is on track to DOE health mission—is to obtain a complete deliver the sequence of 3 billion human base DNA reference sequence. In addition, the pairs by 2005. analytical power developed in pursuit of that Vital to the project’s continued suc- goal will lead to myriad applications in widely cess is DOE’s consistent and focused com- disparate fields including bioremediation, mitment to disseminating information about medicine, agriculture, and renewable energy. the progress, resources, and other results Many are surprised to learn that the generated in the Human Genome Project. longest-running federally funded genome These communication efforts also inform research effort is the 12-year-old DOE Human researchers across the broader scientific Genome Program. Its goal is to analyze the community, who are beginning to apply the genetic material—the genome—that
    [Show full text]
  • Boston University Photonics Center Annual Report 2019 the Above Image Is an Overview of MEMS Mirror Design
    Boston University Photonics Center Annual Report 2019 The above image is an overview of MEMS mirror design. The image, featuring Professor David Bishop’s research, is a false color SEM image of the MEMS magnet mirror, comprised of four bimorphs that lift a polysilicon platform off the substrate. A 250 μm cubed N52 magnet is attached to the platform using a custom pick and place micro-gluing technique and a gold plated mirror is glued on top of the magnet using the same gluing technique. (Source: Reprinted with permission from © The Optical Society. C. Pollock, J. Javor, A. Stange, L. K. Barrett, and D. J. Bishop, “Extreme angle, tip-tilt MEMS micromirror enabling full-hemispheric, quasi-static optical coverage,” Optics Express, 2019, 27(11), 15318-15326.) Front cover image: Coupled nonlinear metamaterials, featuring a self-adaptive or intelligent response, serve to enhance the signal-to-noise ratio of magnetic resonance imaging by more than tenfold. (Source: X. Zhao, G. Duan, K. Wu, S.W. Anderson, and X. Zhang, “Intelligent metamaterials based on nonlinearity for magnetic resonance imaging,” Advanced Materials, 2019, 31(49): 1905461. Copyright Wiley-VCH Verlag GmbH & Co. KGaA. Reproduced with permission.) Letter from the Director THIS ANNUAL REPORT summarizes activities of the Boston University Photonics Center for the 2018-2019 academic year. In it, you will find quantitative and descriptive information regarding our photonics programs in education, interdisciplinary research, business innovation, and technology development. Located at the heart of Boston University’s urban campus, the Photonics Center is an interdisciplinary hub for education, research, scholarship, innovation, and technology development associated with practical uses of light.
    [Show full text]
  • 112N16 Tox Focus
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by PubMed Central Environews Focus A 938 VOLUME 112 | NUMBER 16 | November 2004 • Environmental Health Perspectives Focus | Systems Biology Systems Biology The Big Picture enomics, proteomics, and metab- “Traditionally, scientists—toxicologists G olomics have all vastly advanced included—have relied on a reductionist our understanding of human biology and approach to biology,” says William Suk, disease. But the functioning of even a director of the NIEHS Center for Risk and simple system such as a single yeast cell or Integrated Sciences. Even now, many stud- bacterium is much more complicated ies examine complex systems by looking at than the sum of its genes or proteins or cellular components in isolation. For metabolites; it’s the activity of all those instance, a common experiment involves components and their relationships to using DNA microarrays to observe the one another that add up to a living organ- effect of a chemical exposure on thousands ism. Recognizing that complexity, the of genes at once. This technique can emerging field of systems biology quickly tell a scientist which genes may be attempts to harness the power of mathe- vulnerable to that exposure. But a systems matics, engineering, and computer sci- biology approach would attempt to model ence to analyze and integrate data from all not only the chemical’s effect on gene the “omics” and ultimately create working expression but also how that expression Photodisc, Matt Ray/EHP, Joseph Tart/EHP Photodisc, Matt Ray/EHP, models of entire biological systems.
    [Show full text]
  • Genes Linked by Fusion Events Are Generally of the Same Functional Category: a Systematic Analysis of 30 Microbial Genomes
    Genes linked by fusion events are generally of the same functional category: A systematic analysis of 30 microbial genomes Itai Yanai*, Adnan Derti*†, and Charles DeLisi*‡ *Bioinformatics Graduate Program and Department of Biomedical Engineering, Boston University, Boston, MA 02215; and †Lipper Center for Computational Genetics, Harvard Medical School, Boston, MA 02115 Communicated by Charles R. Cantor, Sequenom, Inc., San Diego, CA, May 10, 2001 (received for review June 16, 2000) Recent work in computational genomics has shown that a functional association between two genes can be derived from the existence of a fusion of the two as one continuous sequence in another genome. For each of 30 completely sequenced microbial genomes, we estab- lished all such fusion links among its genes and determined the distribution of links within and among 15 broad functional categories. We found that 72% of all fusion links related genes of the same functional category. A comparison of the distribution of links to simulations on the basis of a random model further confirmed the significance of intracategory fusion links. Where a gene of annotated function is linked to an unclassified gene, the fusion link suggests that the two genes belong to the same functional category. The predic- tions based on fusion links are shown here for Methanobacterium thermoautotrophicum, and another 661 predictions are available at Fig. 1. Correspondence between functional associations and genes linked by http:͞͞fusion.bu.edu. the fusion method. Independent genes in one genome may be found as one continuous gene in other genomes. These fusion links can confirm known functional relationships between genes: M.
    [Show full text]
  • THE GENE WARS PT 2.Pdf 5.8 Mb
    PART TWO Origins of the Genome Project 5 Putting Santa Cruz on the Map T 1 HE FIRST MEETING focused specifically on sequencing the human JL genome was convened in 1985 by Robert Sinsheimer of the University of California at Santa Cruz. While the genome project did not grow out of the meeting, or even emerge as a topic of discussion, the 1985 Santa Cruz gathering did plant the seed. Planning for Sinsheimer's May 1985 meeting at Santa Cruz began the previous October, when Sinsheimer called several faculty biologists—Robert Edgar, Harry Noller, and Robert Ludwig—into his office. Sinsheimer was then chancellor at UCSC. As such, he had been a participant in several major science planning efforts. These included relations with the three national lab- oratories managed for the Department of Energy (DOE) by the University of California (Los Alamos, Lawrence Berkeley, and Lawrence Livermore na- tional laboratories), discussions of the California state proposal to house the Superconducting Super Collider, and, most directly, the Lick Observatory. The UCSC faculty in astronomy had an international reputation. As a biolo- gist, Sinsheimer wanted biology to achieve similar stature. He wanted, he said, to "put Santa Cruz on the map."1 Others had previously conceived of large, concerted mapping projects and technology development, but these did not grow into the genome project. 80 The GENE WARS The European Molecular Biology Laboratory had in 1980 seriously contem- plated sequencing the entire 4,700,000-base-pair genome of the bacterium Escherichia
    [Show full text]
  • Biology for Students: Bioinformatics
    Biology for Students: Bioinformatics The Human Genome Project (HGP) is an international effort to determine the biochemical code for the entire human genome – the base sequences of the 100,000 or so genes and their regulatory elements that serve as the blueprint of human biology. The HGP has generated massive data on genome sequences, disease genes, protein 3-dimensional structures and functions, protein-protein interactions, and gene regulation. Bioinformatics is a new field centered on the development and application of computational methods to organize, integrate and analyze these data. It is closely tied to two other nascent fields – genomics (identification and functional characterization of genes in a massively parallel and high throughput fashion) and proteomics (analysis of the biological functions of gene products and their interactions), which have also resulted from the HGP. The HGP will have major impacts on understanding evolution and developmental biology, and our ability to diagnose and treat diseases. Its impact is also being felt in areas outside those of traditional biology, such as anthropology and forensic medicine. In 1985, a group of visionary scientists led by Charles DeLisi, who was then the director of the office of health and environmental research at the U.S. Department of Energy (DOE), realized that having the entire human genome in hand would provide the foundation for a revolution in biology and medicine (DeLisi, 1988, p.488). As a result, the 1988 presidential budget submission to Congress requested funds to start the Human Genome Project. Momentum built quickly and by 1990 DOE and the U.S. National Institutes of Health (NIH) had coordinated their plans for a fifteen-year project (Collins and Gallas, 1993, p.43).
    [Show full text]