In Silico Systems Analysis of Biopathways
Total Page:16
File Type:pdf, Size:1020Kb
In Silico Systems Analysis of Biopathways Dissertation zur Erlangung des akademischen Grades eines Doktors der Naturwissenschaften der Universität Bielefeld. Vorgelegt von Ming Chen Bielefeld, im März 2004 MSc. Ming Chen AG Bioinformatik und Medizinische Informatik Technische Fakultät Universität Bielefeld Email: [email protected] Genehmigte Dissertation zur Erlangung des akademischen Grades Doktor der Naturwissenschaften (Dr.rer.nat.) Von Ming Chen am 18 März 2004 Der Technischen Fakultät an der Universität Bielefeld vorgelegt Am 10 August 2004 verteidigt und genehmiget Prüfungsausschuß: Prof. Dr. Ralf Hofestädt, Universität Bielefeld Prof. Dr. Thomas Dandekar, Universität Würzburg Prof. Dr. Robert Giegerich, Universität Bielefeld PD Dr. Klaus Prank, Universität Bielefeld Dr. Dieter Lorenz / Dr. Dirk Evers, Universität Bielefeld ii Abstract In the past decade with the advent of high-throughput technologies, biology has migrated from a descriptive science to a predictive one. A vast amount of information on the metabolism have been produced; a number of specific genetic/metabolic databases and computational systems have been developed, which makes it possible for biologists to perform in silico analysis of metabolism. With experimental data from laboratory, biologists wish to systematically conduct their analysis with an easy-to-use computational system. One major task is to implement molecular information systems that will allow to integrate different molecular database systems, and to design analysis tools (e.g. simulators of complex metabolic reactions). Three key problems are involved: 1) Modeling and simulation of biological processes; 2) Reconstruction of metabolic pathways, leading to predictions about the integrated function of the network; and 3) Comparison of metabolism, providing an important way to reveal the functional relationship between a set of metabolic pathways. This dissertation addresses these problems of in silico systems analysis of biopathways. We developed a software system to integrate the access to different databases, and exploited the Petri net methodology to model and simulate metabolic networks in cells. It develops a computer modeling and simulation technique based on Petri net methodology; investigates metabolic networks at a system level; proposes a markup language for biological data interchange among diverse biological simulators and Petri net tools; establishes a web- based information retrieval system for metabolic pathway prediction; presents an algorithm for metabolic pathway alignment; recommends a nomenclature of cellular signal transduction; and attempts to standardize the representation of biological pathways. Hybrid Petri net methodology is exploited to model metabolic networks. Kinetic modeling strategy and Petri net modeling algorithm are applied to perform the processes of elements functioning and model analysis. The proposed methodology can be used for all other 1 metabolic networks or the virtual cell metabolism. Moreover, perspectives of Petri net modeling and simulation of metabolic networks are outlined. A proposal for the Biology Petri Net Markup Language (BioPNML) is presented. The concepts and terminology of the interchange format, as well as its syntax (which is based on XML) are introduced. BioPNML is designed to provide a starting point for the development of a standard interchange format for Bioinformatics and Petri nets. The language makes it possible to exchange biology Petri net diagrams between all supported hardware platforms and versions. It is also designed to associate Petri net models and other known metabolic simulators. A web-based metabolic information retrieval system, PathAligner, is developed in order to predict metabolic pathways from rudimentary elements of pathways. It extracts metabolic information from biological databases via the Internet, and builds metabolic pathways with data sources of genes, sequences, enzymes, metabolites, etc. The system also provides a navigation platform to investigate metabolic related information, and transforms the output data into XML files for further modeling and simulation of the reconstructed pathway. An alignment algorithm to compare the similarity between metabolic pathways is presented. A new definition of the metabolic pathway is proposed. The pathway defined as a linear event sequence is practical for our alignment algorithm. The algorithm is based on strip scoring the similarity of 4-heirachical EC numbers involved in the pathways. The algorithm described has been implemented and is in current use in the context of the PathAligner system. Furthermore, new methods for the classification and nomenclature of cellular signal transductions are recommended. For each type of characterized signal transduction, a unique ST number is provided. The Signal Transduction Classification Database (STCDB), based on the proposed classification and nomenclature, has been established. By merging the ST numbers with EC numbers, alignments of biopathways are possible. Finally, a detailed model of urea cycle that includes gene regulatory networks, metabolic pathways and signal transduction is demonstrated by using our approaches. A system biological interpretation of the observed behavior of the urea cycle and its related transcriptomics information is proposed to provide new insights for metabolic engineering and medical care. 2 Table of Contents Abstract ...................................................................................................1 Table of Contents ....................................................................................3 Chapter 1 Introduction...........................................................................7 1.1 The Problem....................................................................................7 1.2 Three Profiles..................................................................................9 1.2.1 Metabolic Networks Modeling, Simulation and Analysis.....10 1.2.2 Biopathway Prediction .........................................................11 1.2.3 Biopathway Alignment.........................................................12 1.3 Content of Dissertation..................................................................13 Chapter 2 State of the Art.....................................................................14 2.1 Biology Basics ..............................................................................14 2.1.1 Biological Complexity..........................................................14 2.1.2 Biopathways.........................................................................15 2.1.2.1 Metabolic pathways.....................................................16 2.1.2.2 Gene regulatory networks............................................18 2.1.2.3 Signal transduction pathways ......................................19 2.2 Molecular Databases and Integration.............................................21 2.3 Modeling and Simulation of Metabolic Networks .........................24 2.3.1 Model Classification.............................................................24 2.3.2 Modeling Metabolism ..........................................................26 2.3.3 Related Simulation Environments ........................................29 2.3.4 Petri Net Modeling and Simulation.......................................30 2.4 In Silico Prediction of Metabolic Pathways ...................................35 2.4.1 Existing Resources ...............................................................35 2.4.2 Reconstruction Algorithms...................................................36 2.5 Analysis and Alignment of Biopathways.......................................38 3 2.5.1 Functional Analysis..............................................................38 2.5.2 Comparative Analysis ..........................................................40 2.6 Summary.......................................................................................42 Chapter 3 Hybrid Petri Net Based Modelling and Simulation of Biopathways...........................................................................................44 3.1 Hybrid Petri Nets...........................................................................44 3.2 Cellular Model Development.........................................................47 3.2.1 Petri Net Model Construction of Metabolic Networks..........47 3.2.2 Petri Net Model of Metabolic Reactions...............................50 3.2.3 Models of Gene Regulatory Networks..................................51 3.2.4 Diffusion Transportation ......................................................54 3.3 Petri Net Modeling Strategy ..........................................................55 3.4 Large Scale Network Modeling and Simulation ............................56 3.4.1 Problems and Methods .........................................................57 3.4.1.1. Constitutive model development.................................57 3.4.1.2. Model simplification...................................................58 3.4.2 Prospect of Petri Net Tools...................................................59 3.4.2.1 Cell modeling theory ...................................................59 3.4.2.2 Computation method ...................................................61 3.5 Biology Petri Net Markup Language .............................................62 3.5.1 Introduction..........................................................................62 3.5.1.1 Bioinformatics & XML