In Silico Systems Analysis of Biopathways

Total Page:16

File Type:pdf, Size:1020Kb

In Silico Systems Analysis of Biopathways In Silico Systems Analysis of Biopathways Dissertation zur Erlangung des akademischen Grades eines Doktors der Naturwissenschaften der Universität Bielefeld. Vorgelegt von Ming Chen Bielefeld, im März 2004 MSc. Ming Chen AG Bioinformatik und Medizinische Informatik Technische Fakultät Universität Bielefeld Email: [email protected] Genehmigte Dissertation zur Erlangung des akademischen Grades Doktor der Naturwissenschaften (Dr.rer.nat.) Von Ming Chen am 18 März 2004 Der Technischen Fakultät an der Universität Bielefeld vorgelegt Am 10 August 2004 verteidigt und genehmiget Prüfungsausschuß: Prof. Dr. Ralf Hofestädt, Universität Bielefeld Prof. Dr. Thomas Dandekar, Universität Würzburg Prof. Dr. Robert Giegerich, Universität Bielefeld PD Dr. Klaus Prank, Universität Bielefeld Dr. Dieter Lorenz / Dr. Dirk Evers, Universität Bielefeld ii Abstract In the past decade with the advent of high-throughput technologies, biology has migrated from a descriptive science to a predictive one. A vast amount of information on the metabolism have been produced; a number of specific genetic/metabolic databases and computational systems have been developed, which makes it possible for biologists to perform in silico analysis of metabolism. With experimental data from laboratory, biologists wish to systematically conduct their analysis with an easy-to-use computational system. One major task is to implement molecular information systems that will allow to integrate different molecular database systems, and to design analysis tools (e.g. simulators of complex metabolic reactions). Three key problems are involved: 1) Modeling and simulation of biological processes; 2) Reconstruction of metabolic pathways, leading to predictions about the integrated function of the network; and 3) Comparison of metabolism, providing an important way to reveal the functional relationship between a set of metabolic pathways. This dissertation addresses these problems of in silico systems analysis of biopathways. We developed a software system to integrate the access to different databases, and exploited the Petri net methodology to model and simulate metabolic networks in cells. It develops a computer modeling and simulation technique based on Petri net methodology; investigates metabolic networks at a system level; proposes a markup language for biological data interchange among diverse biological simulators and Petri net tools; establishes a web- based information retrieval system for metabolic pathway prediction; presents an algorithm for metabolic pathway alignment; recommends a nomenclature of cellular signal transduction; and attempts to standardize the representation of biological pathways. Hybrid Petri net methodology is exploited to model metabolic networks. Kinetic modeling strategy and Petri net modeling algorithm are applied to perform the processes of elements functioning and model analysis. The proposed methodology can be used for all other 1 metabolic networks or the virtual cell metabolism. Moreover, perspectives of Petri net modeling and simulation of metabolic networks are outlined. A proposal for the Biology Petri Net Markup Language (BioPNML) is presented. The concepts and terminology of the interchange format, as well as its syntax (which is based on XML) are introduced. BioPNML is designed to provide a starting point for the development of a standard interchange format for Bioinformatics and Petri nets. The language makes it possible to exchange biology Petri net diagrams between all supported hardware platforms and versions. It is also designed to associate Petri net models and other known metabolic simulators. A web-based metabolic information retrieval system, PathAligner, is developed in order to predict metabolic pathways from rudimentary elements of pathways. It extracts metabolic information from biological databases via the Internet, and builds metabolic pathways with data sources of genes, sequences, enzymes, metabolites, etc. The system also provides a navigation platform to investigate metabolic related information, and transforms the output data into XML files for further modeling and simulation of the reconstructed pathway. An alignment algorithm to compare the similarity between metabolic pathways is presented. A new definition of the metabolic pathway is proposed. The pathway defined as a linear event sequence is practical for our alignment algorithm. The algorithm is based on strip scoring the similarity of 4-heirachical EC numbers involved in the pathways. The algorithm described has been implemented and is in current use in the context of the PathAligner system. Furthermore, new methods for the classification and nomenclature of cellular signal transductions are recommended. For each type of characterized signal transduction, a unique ST number is provided. The Signal Transduction Classification Database (STCDB), based on the proposed classification and nomenclature, has been established. By merging the ST numbers with EC numbers, alignments of biopathways are possible. Finally, a detailed model of urea cycle that includes gene regulatory networks, metabolic pathways and signal transduction is demonstrated by using our approaches. A system biological interpretation of the observed behavior of the urea cycle and its related transcriptomics information is proposed to provide new insights for metabolic engineering and medical care. 2 Table of Contents Abstract ...................................................................................................1 Table of Contents ....................................................................................3 Chapter 1 Introduction...........................................................................7 1.1 The Problem....................................................................................7 1.2 Three Profiles..................................................................................9 1.2.1 Metabolic Networks Modeling, Simulation and Analysis.....10 1.2.2 Biopathway Prediction .........................................................11 1.2.3 Biopathway Alignment.........................................................12 1.3 Content of Dissertation..................................................................13 Chapter 2 State of the Art.....................................................................14 2.1 Biology Basics ..............................................................................14 2.1.1 Biological Complexity..........................................................14 2.1.2 Biopathways.........................................................................15 2.1.2.1 Metabolic pathways.....................................................16 2.1.2.2 Gene regulatory networks............................................18 2.1.2.3 Signal transduction pathways ......................................19 2.2 Molecular Databases and Integration.............................................21 2.3 Modeling and Simulation of Metabolic Networks .........................24 2.3.1 Model Classification.............................................................24 2.3.2 Modeling Metabolism ..........................................................26 2.3.3 Related Simulation Environments ........................................29 2.3.4 Petri Net Modeling and Simulation.......................................30 2.4 In Silico Prediction of Metabolic Pathways ...................................35 2.4.1 Existing Resources ...............................................................35 2.4.2 Reconstruction Algorithms...................................................36 2.5 Analysis and Alignment of Biopathways.......................................38 3 2.5.1 Functional Analysis..............................................................38 2.5.2 Comparative Analysis ..........................................................40 2.6 Summary.......................................................................................42 Chapter 3 Hybrid Petri Net Based Modelling and Simulation of Biopathways...........................................................................................44 3.1 Hybrid Petri Nets...........................................................................44 3.2 Cellular Model Development.........................................................47 3.2.1 Petri Net Model Construction of Metabolic Networks..........47 3.2.2 Petri Net Model of Metabolic Reactions...............................50 3.2.3 Models of Gene Regulatory Networks..................................51 3.2.4 Diffusion Transportation ......................................................54 3.3 Petri Net Modeling Strategy ..........................................................55 3.4 Large Scale Network Modeling and Simulation ............................56 3.4.1 Problems and Methods .........................................................57 3.4.1.1. Constitutive model development.................................57 3.4.1.2. Model simplification...................................................58 3.4.2 Prospect of Petri Net Tools...................................................59 3.4.2.1 Cell modeling theory ...................................................59 3.4.2.2 Computation method ...................................................61 3.5 Biology Petri Net Markup Language .............................................62 3.5.1 Introduction..........................................................................62 3.5.1.1 Bioinformatics & XML
Recommended publications
  • Rule-Based Modeling of Biochemical Systems with Bionetgen
    Rule-Based Modeling of Biochemical Systems with BioNetGen James R. Faeder1, Michael L. Blinov2, and William S. Hlavacek3 Once a model specification is processed by BNG2, Short Abstract — Rule-based modeling involves the models can be exported in one of a variety of formats, representation of molecules as structured objects and including MATLAB M-files and Systems Biology Markup molecular interactions as rules for transforming the attributes language (SBML) (3). BNG2 can also generate a network of of these objects. The approach allows systematic incorporation species and reactions for simulation using either an ODE of site-specific details about protein-protein interactions into a solver or Gillespie’s stochastic simulation algorithm (4). The model for the dynamics of a signal-transduction system, as well as other applications. The consequences of protein-protein dashed line in Fig. 1 indicates that the network generation interactions are difficult to specify and track with a and stochastic simulation engines are coupled so that for conventional modeling approach because of the large number large reaction networks species and reactions can be of protein phosphoforms and protein complexes that these generated on-the-fly as needed (5, 6). Recently, particle- interactions potentially generate. Here, we focus on how a rule- based simulation methods (7, 8) have been developed that based model is specified in the BioNetGen language (BNGL) avoid the computational costs associated with explicit and how a model specification is analyzed using the BioNetGen generation of the reachable network of species and software tool. We also discuss new developments in rule-based interactions, which can become a major bottleneck for the modeling that should enable the construction and analysis of simulation of even moderate-sized rule-based models (9).
    [Show full text]
  • Neurogenic Decisions Require a Cell Cycle Independent Function of The
    RESEARCH ARTICLE Neurogenic decisions require a cell cycle independent function of the CDC25B phosphatase Fre´ de´ ric Bonnet1†, Angie Molina1†, Me´ lanie Roussat1, Manon Azais2, Sophie Bel-Vialar1, Jacques Gautrais2, Fabienne Pituello1*, Eric Agius1* 1Centre de Biologie du De´veloppement, Centre de Biologie Inte´grative, Universite´ de Toulouse, CNRS, UPS, Toulouse, France; 2Centre de Recherches sur la Cognition Animale, Centre de Biologie Inte´grative., Universite´ de Toulouse, CNRS, UPS, Toulouse, France Abstract A fundamental issue in developmental biology and in organ homeostasis is understanding the molecular mechanisms governing the balance between stem cell maintenance and differentiation into a specific lineage. Accumulating data suggest that cell cycle dynamics play a major role in the regulation of this balance. Here we show that the G2/M cell cycle regulator CDC25B phosphatase is required in mammals to finely tune neuronal production in the neural tube. We show that in chick neural progenitors, CDC25B activity favors fast nuclei departure from the apical surface in early G1, stimulates neurogenic divisions and promotes neuronal differentiation. We design a mathematical model showing that within a limited period of time, cell cycle length modifications cannot account for changes in the ratio of the mode of division. Using a CDC25B point mutation that cannot interact with CDK, we show that part of CDC25B activity is independent *For correspondence: of its action on the cell cycle. [email protected] (FP); [email protected] (EA) †These authors contributed equally to this work Introduction In multicellular organisms, managing the development, homeostasis and regeneration of tissues Competing interests: The requires the tight control of self-renewal and differentiation of stem/progenitor cells.
    [Show full text]
  • In Silico Tools for Splicing Defect Prediction: a Survey from the Viewpoint of End Users
    © American College of Medical Genetics and Genomics REVIEW In silico tools for splicing defect prediction: a survey from the viewpoint of end users Xueqiu Jian, MPH1, Eric Boerwinkle, PhD1,2 and Xiaoming Liu, PhD1 RNA splicing is the process during which introns are excised and informaticians in relevant areas who are working on huge data sets exons are spliced. The precise recognition of splicing signals is critical may also benefit from this review. Specifically, we focus on those tools to this process, and mutations affecting splicing comprise a consider- whose primary goal is to predict the impact of mutations within the able proportion of genetic disease etiology. Analysis of RNA samples 5′ and 3′ splicing consensus regions: the algorithms used by different from the patient is the most straightforward and reliable method to tools as well as their major advantages and disadvantages are briefly detect splicing defects. However, currently, the technical limitation introduced; the formats of their input and output are summarized; prohibits its use in routine clinical practice. In silico tools that predict and the interpretation, evaluation, and prospection are also discussed. potential consequences of splicing mutations may be useful in daily Genet Med advance online publication 21 November 2013 diagnostic activities. In this review, we provide medical geneticists with some basic insights into some of the most popular in silico tools Key Words: bioinformatics; end user; in silico prediction tool; for splicing defect prediction, from the viewpoint of end users. Bio- medical genetics; splicing consensus region; splicing mutation INTRODUCTION TO PRE-mRNA SPLICING AND small nuclear ribonucleoproteins and more than 150 proteins, MUTATIONS AFFECTING SPLICING serine/arginine-rich (SR) proteins, heterogeneous nuclear ribo- Sixty years ago, the milestone discovery of the double-helix nucleoproteins, and the regulatory complex (Figure 1).
    [Show full text]
  • On Cellular Automaton Approaches to Modeling Biological Cells
    ON CELLULAR AUTOMATON APPROACHES TO MODELING BIOLOGICAL CELLS MARK S. ALBER∗, MARIA A. KISKOWSKIy , JAMES A. GLAZIERz , AND YI JIANGx Abstract. We discuss two different types of Cellular Automata (CA): lattice-gas- based cellular automata (LGCA) and the cellular Potts model (CPM), and describe their applications in biological modeling. LGCA were originally developed for modeling ideal gases and fluids. We describe several extensions of the classical LGCA model to self-driven biological cells. In partic- ular, we review recent models for rippling in myxobacteria, cell aggregation, swarming, and limb bud formation. These LGCA-based models show the versatility of CA in modeling and their utility in addressing basic biological questions. The CPM is a more sophisticated CA, which describes individual cells as extended objects of variable shape. We review various extensions to the original Potts model and describe their application to morphogenesis; the development of a complex spatial struc- ture by a collection of cells. We focus on three phenomena: cell sorting in aggregates of embryonic chicken cells, morphological development of the slime mold Dictyostelium discoideum and avascular tumor growth. These models include intercellular and extra- cellular interactions, as well as cell growth and death. 1. Introduction. Cellular automata (CA) consist of discrete agents or particles, which occupy some or all sites of a regular lattice. These par- ticles have one or more internal state variables (which may be discrete or continuous) and a set of rules describing the evolution of their state and position (in older models, particles usually occupied all lattice sites, one particle per node, and did not move).
    [Show full text]
  • Proteomic and Bioinformatic Investigation of Altered Pathways in Neuroglobin-Deficient Breast Cancer Cells
    molecules Article Proteomic and Bioinformatic Investigation of Altered Pathways in Neuroglobin-Deficient Breast Cancer Cells Michele Costanzo 1,2 , Marco Fiocchetti 3 , Paolo Ascenzi 3, Maria Marino 3 , Marianna Caterino 1,2,* and Margherita Ruoppolo 1,2,* 1 Department of Molecular Medicine and Medical Biotechnology, School of Medicine, University of Naples Federico II, 80131 Naples, Italy; [email protected] 2 CEINGE—Biotecnologie Avanzate S.C.Ar.L., 80145 Naples, Italy 3 Department of Science, University Roma Tre, 00146 Rome, Italy; marco.fi[email protected] (M.F.); [email protected] (P.A.); [email protected] (M.M.) * Correspondence: [email protected] (M.C.); [email protected] (M.R.) Abstract: Neuroglobin (NGB) is a myoglobin-like monomeric globin that is involved in several processes, displaying a pivotal redox-dependent protective role in neuronal and extra-neuronal cells. NGB remarkably exerts its function upon upregulation by NGB inducers, such as 17β-estradiol (E2) and H2O2. However, the molecular bases of NGB’s functions remain undefined, mainly in non- neuronal cancer cells. Human MCF-7 breast cancer cells with a knocked-out (KO) NGB gene obtained using CRISPR/Cas9 technology were analyzed using shotgun label-free quantitative proteomics in comparison with control cells. The differential proteomics experiments were also performed after treatment with E2, H2O2, and E2 + H2O2. All the runs acquired using liquid chromatography–tandem mass spectrometry were elaborated within the same MaxQuant analysis, leading to the quantification Citation: Costanzo, M.; Fiocchetti, of 1872 proteins in the global proteomic dataset. Then, a differentially regulated protein dataset was M.; Ascenzi, P.; Marino, M.; Caterino, M.; Ruoppolo, M.
    [Show full text]
  • Quantitative Cell Biology with the Virtual Cellq
    570 Review TRENDS in Cell Biology Vol.13 No.11 November 2003 Quantitative cell biology with the Virtual Cellq Boris M. Slepchenko1, James C. Schaff1, Ian Macara2 and Leslie M. Loew1 1Center For Biomedical Imaging Technology and National Resource for Cell Analysis and Modeling, Department of Physiology, University of Connecticut Health Center, Farmington, CT 06030, USA 2Center for Cell Signaling and Department of Pharmacology, University of Virginia, Charlottesville, VA 22908, USA Cell biological processes are controlled by an interact- techniques such as patch-clamp single-channel recording. ing set of biochemical and electrophysiological events Once such data are acquired, a crucial and obvious that are distributed within complex cellular structures. challenge is to determine how these, often disparate and Computational models, comprising quantitative data complex, details can explain the cellular process under on the interacting molecular participants in these events, investigation. The ideal way to meet this challenge is to provide a means for applying the scientific method to integrate and organize the data into a predictive model. these complex systems. The Virtual Cell is a compu- Indeed, the value of computational approaches in cell tational environment designed for cell biologists, to biology has been increasingly recognized as demonstrated facilitate the construction of models and the generation by several recent reviews [4–6] and a book [37]. However, of predictive simulations from them. This review sum- for cell biologists, formulating a mathematical model marizes how a Virtual Cell model is assembled and based on quantitative data and solving its equations to describes the physical principles underlying the calcu- generate simulations can present significant technical lations that are performed.
    [Show full text]
  • Modeling Without Borders: Creating and Annotating Vcell Models Using the Web
    Modeling without Borders: Creating and Annotating VCell Models Using the Web Michael L. Blinov, Oliver Ruebenacker, James C. Schaff, and Ion I. Moraru Center for Cell Analysis and Modeling University of Connecticut Health Center Farmington, CT 06030, USA {blinov,oruebenacker,schaff,moraru}@exchange.uchc.edu http:vcell.org/sybil Abstract. Biological research is becoming increasingly complex and data-rich, with multiple public databases providing a variety of resources: hundreds of thousands of substances and interactions, hundreds of ready to use models, controlled terms for locations and reaction types, links to reference materials (data and/or publications), etc. Mathematical mod- eling can be used to integrate this complex data and create quantita- tive, testable predictions based on the current state of knowledge of a biological process. Data retrieval, visualization, flexible querying, and model annotation for future reuse, are some of the important require- ments for modeling-based research in the modern age. Here we describe an approach that we implement within the popular Virtual Cell (VCell) modeling and simulation framework in order to help connect the mod- eling community with the web of machine-processable systems biology knowledge. A new software application, called SyBiL (Systems Biology Linker), has been designed and developed for simultaneous querying of multiple systems biology knowledge bases and data sources, such as web repositories, databases, and user files, and converting the extracted and refined data into model elements. Integration of SyBiL as a component of VCell makes these capabilities easily available to a wide modeling community. Keywords: Biological databases, mathematical modeling, VCell, data conversion. 1 Introduction Mathematical modeling is increasingly necessarry to investigate the function of molecular pathways and networks, and a growing number of resources that fa- cilitate computational approaches are becoming available.
    [Show full text]
  • Vcell Quick Start Guide
    Virtual Cell Quick Start Guide 1. Pre-requisites: a. Java (version 1.4.2 or later; 1.5.x recommended) b. An Internet connection (broadband, no firewall recommended). 2. Go to http://www.vcell.org/ a. If you are a new user, first register and create a user name and password. b. If you are registered, navigate to the “VCell” page. 3. Click on one of the 2 links to run VCell as an application or as an applet: a. The software will download and start automatically (it can take a couple of minutes the first time or whenever VCell was updated since your last login) – unless you have pop-up blockers enabled and you are trying to run it as an applet, in which case you must click on the new link “Run the Virtual Cell” after logging in. • Tip: Most operating systems and browsers will work as long as you are running Java 1.4 or later, and there are no real hardware requirements. However, check the “Technical requirements” link for recommended configurations. b. VCell as an application will us Java webstart and will give the option to install a local shortcut. • Tip: Whenever you launch VCell from the local shortcut, if you are connected to the Internet, it will automatically check for and download any updated version. • Tip: You can run the local VCell application without an Internet connection and without logging in to the model database; in this case, however, you will only be able to save your work by using File Æ Export, and you will not be able to run simulations and view results until logging in.
    [Show full text]
  • Increased Processing of SINE B2 Ncrnas Unveils a Novel Type Of
    RESEARCH ARTICLE Increased processing of SINE B2 ncRNAs unveils a novel type of transcriptome deregulation in amyloid beta neuropathology Yubo Cheng1,2,3,4†, Luke Saville1,2,3,4†, Babita Gollen1,2,3,4†, Christopher Isaac1,2,3,4†, Abel Belay1,2,3,4, Jogender Mehla3, Kush Patel1,2,3, Nehal Thakor1,2,3, Majid H Mohajerani3, Athanasios Zovoilis1,2,3,4* 1Department of Chemistry and Biochemistry, University of Lethbridge, Lethbridge, Canada; 2Southern Alberta Genome Sciences Centre, University of Lethbridge, Lethbridge, Canada; 3Canadian Centre for Behavioral Neuroscience, University of Lethbridge, Lethbridge, Canada; 4Alberta RNA Research and Training Institute, University of Lethbridge, Lethbridge, Canada Abstract The functional importance of many non-coding RNAs (ncRNAs) generated by repetitive elements and their connection with pathologic processes remains elusive. B2 RNAs, a class of ncRNAs of the B2 family of SINE repeats, mediate through their processing the transcriptional activation of various genes in response to stress. Here, we show that this response is dysfunctional during amyloid beta toxicity and pathology in the mouse hippocampus due to increased levels of B2 RNA processing, leading to constitutively elevated B2 RNA target gene expression and high Trp53 levels. Evidence indicates that Hsf1, a master regulator of stress response, mediates B2 RNA *For correspondence: processing in hippocampal cells and is activated during amyloid toxicity, accelerating the [email protected] processing of SINE RNAs and gene hyper-activation. Our study reveals that in mouse, SINE RNAs † constitute a novel pathway deregulated in amyloid beta pathology, with potential implications for These authors contributed equally to this work similar cases in the human brain, such as Alzheimer’s disease (AD).
    [Show full text]
  • Sudan University of Science and Technology College of Graduate
    Sudan University of Science and Technology College of Graduate Studies Detection of CDX2 Tumor Marker and Human Papilloma Virus in Esophageal Tumors اﻟﻜﺸﻒ ﻋﻦ اﻟﻮاﺳﻤﺔ اﻟﻮرﻣﯿﺔ CDX2 و ﻓﯿﺮوس اﻟﻮرم اﻟﺤﻠﯿﻤﻲ اﻟﺒﺸﺮي ﻓﻰ اورام اﻟﻤﺮيء A dissertation submitted for partial fulfillment for the requirement for M.Sc degree in Medical laboratory Science (Histopathology and cytology) Submitted by: Eslam Mohamed khoghali Ahmed B.SC (honors) in chemical pathology and histopathology – National Ribat University (2010) Supervised by: Dr. Mohammed Siddig Abdelaziz 2014 ﻗﺎل ﷲ ﺗﻌﺎﻟﻰ ۖ◌ ٰ◌ ﺻدق ﷲ اﻟﻌظﯾم ﺳورة اﻟﺗوﺑﺔ اﻵﯾﺔ 105 Dedication To my father…. My mother…. My brothers…. My friends… Eslam, 2014 Acknowledgement All great thanks are firstly to Allah. I would like to send thanks to my supervisor Dr. Mohamed Siddige Abdalaziz for his guidance and continuous assistance. Thanks are also extend to staff of histopathology and cytology in Ibn-Seina hospital and to members of histopathology and cytology department in Sudan University for science and technology for providing me materials and equipment, finally thanks to every one helped me. Eslam, 2014 Abstract This is a descriptive retrospective study conducted at Ibn Seina hospital and Sudan University of Science and Technology during the period from April 2014 to October 2014. The study aimed at detecting CDX2 and association of Human Papilloma Virus (HPV) in esophageal tumor among Sudanese patients using immune-histochemistry and polymerase chain reaction. Samples from 30 patients previously diagnosed with esophageal tumor (20 with esophageal cancer and 10 with benign). Their ages ranging between 8 to 82 years with mean age 37. From each block two slides were cut one for CDX2 and other for HPV detection, also 10µ was cut from each block in eppendorf tube for detection of HPV type 18 gene using PCR.SPSS version 16 computer program was used to analyze the data.
    [Show full text]
  • Genetic Algorithms and Their Application to In-Silico Evolution of Genetic Regulatory Networks
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by University of Hertfordshire Research Archive Genetic Algorithms and their Application to in-silico Evolution of Genetic Regulatory Networks Johannes F. Knabe‡†, Katja Wegner†, Chrystopher L. Nehaniv‡†, and Maria J. Schilstra†* †Biological and Neural Computation Laboratory, and ‡Adaptive Systems Research Group, STRI, University of Hertfordshire, Hatfield, AL10 9AB United Kingdom *To whom correspondence should be addressed ([email protected]) i. Abstract A genetic algorithm (GA) is a procedure that mimics processes occurring in Darwinian evolution to solve computational problems. A GA introduces variation through “mutation” and “recombination” in a “population” of possible solutions to a problem, encoded as strings of characters in “genomes”, and allows this population to evolve, using selection procedures that favor the gradual enrichment of the gene pool with the genomes of the “fitter” individuals. GA’s are particularly suitable for optimization problems in which an effective system design or set of parameter values is sought. In nature, genetic regulatory networks (GRN’s) form the basic control layer in the regulation of gene expression levels. GRN’s are composed of regulatory interactions between genes and their gene products, and are, inter alia, at the basis of the development of single fertilized cells into fully grown organisms. This paper describes how GA’s may be applied to find functional regulatory schemes and parameter values for models that capture the fundamental GRN characteristics. The central ideas behind evolutionary computation and GRN modeling, and the considerations in GA design and use are discussed, and illustrated with an extended example.
    [Show full text]
  • In Silico Protein Design: a Combinatorial and Global Optimization Approach by John L
    From SIAM News, Volume 37, Number 1, January/February 2004 In Silico Protein Design: A Combinatorial and Global Optimization Approach By John L. Klepeis and Christodoulos A. Floudas The use of computational techniques to create peptide- and protein-based therapeutics is an important challenge in medicine. The ultimate goal, defined about two decades ago, is to use computer algorithms to identify amino acid sequences that not only adopt particular three-dimensional structures but also perform specific functions. To those familiar with the field of structural biology, it is certainly not surprising that this problem has been described as “inverse protein folding” [16]. That is, while the grand challenge of protein folding is to understand how a particular protein, defined by its amino acid sequence, finds its unique three-dimensional structure, protein design involves the discovery of sets of amino acid sequences that form functional proteins and fold into specific target structures. Experimental, computational, and hybrid approaches have all contributed to advances in protein design. Applying mutagenesis and rational design techniques, for example, experimentalists have created enzymes with altered functionalities and increased stability. The coverage of sequence space is highly restricted for these techniques, however [4]. An approach that samples more diverse sequences, called directed protein evolution, iteratively applies the techniques of genetic recombination and in vitro functional assays [1]. These methods, although they do a better job of sampling sequence space and generating functionally diverse proteins, are still restricted to the screening of 103 – 106 sequences [22]. Challenges of Generic Computational Protein Design The limitations of experimental techniques serve to highlight the importance of computational protein design.
    [Show full text]