Network Inference in Systems Biology: Recent Developments, Challenges, and Applications

Total Page:16

File Type:pdf, Size:1020Kb

Network Inference in Systems Biology: Recent Developments, Challenges, and Applications Network Inference in Systems Biology: Recent Developments, Challenges, and Applications Michael M. Saint-Antoine1 and Abhyudai Singh2 Abstract One of the most interesting, difficult, and potentially useful topics in compu- tational biology is the inference of gene regulatory networks (GRNs) from ex- pression data. Although researchers have been working on this topic for more than a decade and much progress has been made, it remains an unsolved prob- lem and even the most sophisticated inference algorithms are far from perfect. In this paper, we review the latest developments in network inference, includ- ing state-of-the-art algorithms like PIDC, Phixer, and more. We also discuss unsolved computational challenges, including the optimal combination of algo- rithms, integration of multiple data sources, and pseudo-temporal ordering of static expression data. Lastly, we discuss some exciting applications of net- work inference in cancer research, and provide a list of useful software tools for researchers hoping to conduct their own network inference analyses. 1. Introduction Networks of gene interactions, in which genes activate and repress the tran- scription of other genes, are responsible for much of the complexity of cellular life [1,2], and their malfunction can be disastrous for an organism [3]. So, un- arXiv:1911.04046v1 [q-bio.MN] 11 Nov 2019 derstanding these networks has long been a goal of systems biology. Discovering gene interactions experimentally can be very difficult, and given the approxi- 1Center for Bioinformatics and Computational Biology, University of Delaware, Newark, Delaware 19716, USA 2Electrical and Computer Engineering, University of Delaware, Newark, Delaware 19716, USA Preprint submitted to Current Opinion in Biotechnology November 12, 2019 mately 20,000 genes in the human genome, it is not feasible to do an experiment for every possible pair to check for an interaction. However, with traditional DNA microarray [4], or next-generation sequencing technologies, most notably (single-cell or bulk) RNA-sequencing (RNA-Seq) [5,6], one can get a quantita- tive peek into the transcriptomic profile of an individual cell, or a population of cells. Unfortunately, these measurement techniques involve killing the cell, so each cell can provide only one timepoint of data. A significant disadvantage of this static data is that it will not allow us to detect self-edges (in which a gene regulates itself). In some cases, it may be possible to construct pseudo-temporal data from static single-cell measurements. For example, a researcher may administer a stimulus to a set of cells, and then perform an RNA-Seq experiment on some cells after one hour, then on some cells after two hours, and so on, in order to collect data on the post-stimulus gene expression dynamics. In other cases, it may be possible to infer the temporal ordering of a set of gene expression measurements using computational methods (more on that in the Computational Challenges section). In this paper, the input gene expression data will be assumed to be static single cell data (meaning that each measured cell provides us with only one timepoint of data), unless otherwise noted. 2. Problem Formulation For convenience, it can be useful to abstract the problem of network inference into a graph theory framework. Figure 1 shows the workflow of a hypotheti- cal network inference scheme where, for example, single-cell RNA-Seq is used to measure the expression of a large number of genes in many individual cells. Given the large number of genes measured in these experiments it maybe pru- dent to select a smaller a subset of genes with high biological variance for further analysis. Such genes are typically identified by first plotting the Coefficient of Variation or CV (standard deviation over mean) in expression levels across cells with respect to the mean expression levels for all genes, and then selecting genes 2 Measure gene expression in single cells Biological Variation Variation of Find genes with biological variance Coefficient Mean Expression Level Technical Variation Infer gene regulatory networks Figure 1: A typical workflow to identity genes with high biological variance for network map- ping studies. RNA-Seq typically quantifies the expression of thousands of genes across individ- ual cells or environmental/genetic conditions. The Coefficient of Variation or CV (Standard deviation over mean) in the expression level across cells or conditions is plotted as a function of the mean expression level for all genes. Fitting a trend line through this data provides the expected CV of a gene at given mean level, and this trend line is then used to select genes that have significantly higher CVs (representing high biological variance) while filtering out genes with low CVs that could result from technical noise. 3 with significantly higher CVs than what is expected for a gene at the same mean level. Consider N genes that are identified for network analysis and let their expression levels be represented by random variables fX1;X2; :::; XN g. Each variable corresponds to a node in the GRN, and each edge Xi ! Xj represents a regulatory relationship between Xi and Xj. We can think of the true biological network as a real, unknown set of interactions, and our goal is to approximate it by constructing a set of weighted interactions, where each weight corresponds to our confidence that an edge exists in the true network. A network prediction can be either directed, meaning that a prediction is made about the direction of causality for each interaction, or undirected, mean- ing that no such prediction is made. Directed networks are preferable, as they convey more information, but are also more difficult to construct. A network can also be signed, meaning that activation edges are labeled with a \+" and repression edges are labeled with a \-", or unsigned, meaning the edges are not labeled as positive or negative. Signed networks are important for drawing bio- logical insights. However, unsigned networks are often used for theoretical work on network inference, since the difficult problem is determining which edges represent true interactions. Once an interaction is known, determining its sign is very easy and can be found with a simple covariance. Inferred networks have been shown to contain several motifs (smaller modules that occur more often in the network than expected by random chance), and these motifs are illustrated in Figure 2. The output of a network inference algorithm is a set of weighted edge pre- dictions, where each edge-weight corresponds to the confidence that a real in- teraction exists between two genes. So, the accuracy of these algorithms can be evaluated by running them on a \gold standard" dataset for which the true network structure is already known. The algorithms' ability to recover the true interactions can then be scored using receiver operating characteristic (ROC) curves or precision-recall (PR) curves. Some researchers have suggested that PR curves are the superior metric [7], although both metrics are commonly used. Fortunately for network inference researchers, several gold standard datasets 4 1 2 3 4 5 6 7 8 Figure 2: Illustration of some common GRN motifs. 1. Modularity (clustering of nodes), 2. Self-edge, 3. Fan-out, 4. Fan-in, 5. Feed-back loop, 6. Feed-forward loop, 7. Cascade, 8. Cascade false positive error (a feed-forward loop is incorrectly predicted). In the \Algorithms" section, we will discuss the strengths and weaknesses of various algorithm classes when it comes to detecting different network motifs. 5 are publicly available in the form of DREAM Challenges [8]. These are com- putational biology competitions for which researchers are invited to design new algorithms to make predictions from data, and many involve predicting GRN structures from expression data. Once each competition is over, the datasets and true network structures are released to the public, and are commonly used for algorithm evaluation [9,10,11]. 3. Algorithms Several broad classes of algorithms are used for network inference. An excel- lent introductory overview of these can be found in Huynh-Thu and Sanguinetti 2019 [12]. A thorough evaluation of the different types of algorithms can be found in Marbach et al. 2012 [9], in which many different algorithms were benchmarked on the DREAM5 gold standard networks, with varying results. To put it simply, there is no overall \best" class of algorithms. Rather, each has its own advantages and disadvantages, so choosing the best one depends on the context. Here, we will give a brief description of each class, but will mainly focus on recent developments and unsolved challenges. Figure 3 provides a summary of some of the properties of different algorithm types. 3.1. Correlation One of the most basic methods of network inference is to simply compute the correlation for each pair of genes. This method is very simplistic, but is also fast and scalable for large datasets. It can be especially useful in cases where researchers want a general idea of which genes are related, without caring much about causal direction or distinguishing between direct and indirect regulation. For this reason, the resulting network prediction may sometimes be referred to as a \gene co-expression network" rather than a GRN. In general, correlation- based algorithms tend to perform relatively well on detecting feed-forward loops, fan-ins, and fan-outs, but also have an increased false positive rate for cascades [9] (please see Figure 2 for an explanation of these motifs). 6 Algorithm Class Temporal Data Directionality Advantages Disadvantages Examples Required? Correlation No Undirected • Fast, scalable • Possibly WGCNA [13] • Detection of over-simplistic PGCNA [14] feed-forward loops, • False positives for fan-ins, and fan-outs cascades Regression No Directed • Good overall accuracy • Bad detection of TIGRESS [15], feed-forward loops, GENIE3 [16], fan-ins, and fan-outs bLARS [17] Bayesian - No Directed • Performance on small • Performance on large [19,20] Simple networks networks.
Recommended publications
  • Dynamical Gene Regulatory Networks Are Tuned by Transcriptional Autoregulation with Microrna Feedback Thomas G
    www.nature.com/scientificreports OPEN Dynamical gene regulatory networks are tuned by transcriptional autoregulation with microRNA feedback Thomas G. Minchington1, Sam Grifths‑Jones2* & Nancy Papalopulu1* Concepts from dynamical systems theory, including multi‑stability, oscillations, robustness and stochasticity, are critical for understanding gene regulation during cell fate decisions, infammation and stem cell heterogeneity. However, the prevalence of the structures within gene networks that drive these dynamical behaviours, such as autoregulation or feedback by microRNAs, is unknown. We integrate transcription factor binding site (TFBS) and microRNA target data to generate a gene interaction network across 28 human tissues. This network was analysed for motifs capable of driving dynamical gene expression, including oscillations. Identifed autoregulatory motifs involve 56% of transcription factors (TFs) studied. TFs that autoregulate have more interactions with microRNAs than non‑autoregulatory genes and 89% of autoregulatory TFs were found in dual feedback motifs with a microRNA. Both autoregulatory and dual feedback motifs were enriched in the network. TFs that autoregulate were highly conserved between tissues. Dual feedback motifs with microRNAs were also conserved between tissues, but less so, and TFs regulate diferent combinations of microRNAs in a tissue‑dependent manner. The study of these motifs highlights ever more genes that have complex regulatory dynamics. These data provide a resource for the identifcation of TFs which regulate the dynamical properties of human gene expression. Cell fate changes are a key feature of development, regeneration and cancer, and are ofen thought of as a “land- scape” that cells move through1,2. Cell fate changes are driven by changes in gene expression: turning genes on or of, or changing their levels above or below a threshold where a cell fate change occurs.
    [Show full text]
  • Systematic Comparison of Sea Urchin and Sea Star Developmental Gene Regulatory Networks Explains How Novelty Is Incorporated in Early Development
    ARTICLE https://doi.org/10.1038/s41467-020-20023-4 OPEN Systematic comparison of sea urchin and sea star developmental gene regulatory networks explains how novelty is incorporated in early development Gregory A. Cary 1,3,5, Brenna S. McCauley1,4,5, Olga Zueva1, Joseph Pattinato1, William Longabaugh2 & ✉ Veronica F. Hinman 1 1234567890():,; The extensive array of morphological diversity among animal taxa represents the product of millions of years of evolution. Morphology is the output of development, therefore phenotypic evolution arises from changes to the topology of the gene regulatory networks (GRNs) that control the highly coordinated process of embryogenesis. A particular challenge in under- standing the origins of animal diversity lies in determining how GRNs incorporate novelty while preserving the overall stability of the network, and hence, embryonic viability. Here we assemble a comprehensive GRN for endomesoderm specification in the sea star from zygote through gastrulation that corresponds to the GRN for sea urchin development of equivalent territories and stages. Comparison of the GRNs identifies how novelty is incorporated in early development. We show how the GRN is resilient to the introduction of a transcription factor, pmar1, the inclusion of which leads to a switch between two stable modes of Delta-Notch signaling. Signaling pathways can function in multiple modes and we propose that GRN changes that lead to switches between modes may be a common evolutionary mechanism for changes in embryogenesis. Our data additionally proposes a model in which evolutionarily conserved network motifs, or kernels, may function throughout development to stabilize these signaling transitions. 1 Department of Biological Sciences, Carnegie Mellon University, Pittsburgh, PA 15213, USA.
    [Show full text]
  • An Automatic Design Framework of Swarm Pattern Formation Based on Multi-Objective Genetic Programming
    JOURNAL OF LATEX CLASS FILES, VOL. 14, NO. 8, AUGUST 2015 1 An Automatic Design Framework of Swarm Pattern Formation based on Multi-objective Genetic Programming Zhun Fan, Senior Member, IEEE, Zhaojun Wang, Xiaomin Zhu, Bingliang Hu, Anmin Zou, and Dongwei Bao Abstract—Most existing swarm pattern formation methods predetermined trajectory and maintain a specific swarm pattern depend on a predefined gene regulatory network (GRN) structure in the execution of tasks. For example, Jin [11] proposed a that requires designers’ priori knowledge, which is difficult to hierarchical gene regulatory network for adaptive multi-robot adapt to complex and changeable environments. To dynamically adapt to the complex and changeable environments, we propose pattern formation. In this work, the swarm pattern is designed an automatic design framework of swarm pattern formation as a band of circle, which encircles targets in a dynamic based on multi-objective genetic programming. The proposed environments. In the latter case of using adaptive formation, framework does not need to define the structure of the GRN- swarm robots follow an adaptive pattern to encircle targets in based model in advance, and it applies some basic network motifs the environment. For example, Oh et al. [12] have introduced to automatically structure the GRN-based model. In addition, a multi-objective genetic programming (MOGP) combines with an evolving hierarchical gene regulatory network for morpho- NSGA-II, namely MOGP-NSGA-II, to balance the complexity genetic pattern formation in order to generate adaptive patterns and accuracy of the GRN-based model. In evolutionary process, which are adaptable to dynamic environments. an MOGP-NSGA-II and differential evolution (DE) are applied to In addition, swarm pattern formation has been widely ex- optimize the structures and parameters of the GRN-based model plored in recent years, which can be divided into four cate- in parallel.
    [Show full text]
  • Temporal Flexibility of Gene Regulatory Network Underlies a Novel Wing Pattern in Flies
    Temporal flexibility of gene regulatory network underlies a novel wing pattern in flies Héloïse D. Dufoura,b, Shigeyuki Koshikawa (越川滋行)a,b,1,2,3, and Cédric Fineta,b,3,4 aHoward Hughes Medical Institute, University of Wisconsin, Madison, WI 53706; and bLaboratory of Molecular Biology, University of Wisconsin, Madison, WI 53706 Edited by Denis Duboule, University of Geneva, Geneva 4, Switzerland, and approved April 6, 2020 (received for review February 3, 2020) Organisms have evolved endless morphological, physiological, and Nevertheless, it does not explain how the new expression of behavioral novel traits during the course of evolution. Novel traits the coopted toolkit genes does not interfere with the develop- were proposed to evolve mainly by orchestration of preexisting ment of the tissue. Some authors have suggested that the reuse genes. Over the past two decades, biologists have shown that of toolkit genes might only happen during late development after cooption of gene regulatory networks (GRNs) indeed underlies completion of the early function of the redeployed genes (21, 22, numerous evolutionary novelties. However, very little is known 29). However, little is known about the properties of a GRN that about the actual GRN properties that allow such redeployment. allow the cooption of one or several of its components/genes Here we have investigated the generation and evolution of the without impairing the development of the tissue. complex wing pattern of the fly Samoaia leonensis. We show that In this study, we use the complex wing pigmentation pattern of the transcription factor Engrailed is recruited independently from the fly species Samoaia leonensis as a model to address how the the other players of the anterior–posterior specification network to generate a new wing pattern.
    [Show full text]
  • Gene Regulatory Networks
    Gene Regulatory Networks 02-710 Computaonal Genomics Seyoung Kim Transcrip6on Factor Binding Transcrip6on Control • Gene transcrip.on is influenced by – Transcrip.on factor binding affinity for the regulatory regions of target genes – Transcrip.on factor concentraon – Nucleosome posi.oning and chroman states – Enhancer ac.vity Gene Transcrip6onal Regulatory Network • The expression of a gene is controlled by cis and trans regulatory elements – Cis regulatory elements: DNA sequences in the regulatory region of the gene (e.g., TF binding sites) – Trans regulatory elements: RNAs and proteins that interact with the cis regulatory elements Gene Transcrip6onal Regulatory Network • Consider the following regulatory relaonships: Target gene1 Target TF gene2 Target gene3 Cis/Trans Regulatory Elements Binding site: cis Target TF regulatory element TF binding affinity gene1 can influence the TF target gene Target gene2 expression Target TF gene3 TF: trans regulatory element TF concentra6on TF can influence the target gene TF expression Gene Transcrip6onal Regulatory Network • Cis and trans regulatory elements form a complex transcrip.onal regulatory network – Each trans regulatory element (proteins/RNAs) can regulate mul.ple target genes – Cis regulatory modules (CRMs) • Mul.ple different regulators need to be recruited to ini.ate the transcrip.on of a gene • The DNA binding sites of those regulators are clustered in the regulatory region of a gene and form a CRM How Can We Learn Transcriponal Networks? • Leverage allele specific expressions – In diploid organisms, the transcript levels from the two copies of the genes may be different – RNA-seq can capture allele- specific transcript levels How Can We Learn Transcriponal Networks? • Leverage allele specific gene expressions – Teasing out cis/trans regulatory divergence between two species (WiZkopp et al.
    [Show full text]
  • A Computational Model Inspired by Gene Regulatory Networks
    Rui Miguel Lourenço Lopes A Computational Model Inspired by Gene Regulatory Networks Doctoral thesis submitted to the Doctoral Program in Information Science and Technology, supervised by Full Professor Doutor Ernesto Jorge Fernandes Costa, and presented to the Department of Informatics Engineering of the Faculty of Sciences and Technology of the University of Coimbra. January 2015 A Computational Model Inspired by Gene Regulatory Networks A thesis submitted to the University of Coimbra in partial fulfillment of the requirements for the Doctoral Program in Information Science and Technology by Rui Miguel Lourenço Lopes [email protected] Department of Informatics Engineering Faculty of Sciences and Technology University of Coimbra Coimbra, January 2015 Financial support by Fundação para a Ciência e a Tecnologia, through the PhD grant SFRH/BD/69106/2010. A Computational Model Inspired by Gene Regulatory Networks ©2015 Rui L. Lopes ISBN 978-989-20-5460-5 Cover image: Composition of a Santa Fe trail program evolved by ReNCoDe overlaid on the ancestors, with a 3D model of the DNA crystal structure (by Paul Hakimata). This dissertation was prepared under the supervision of Ernesto J. F. Costa Full Professor of the Department of Informatics Engineering of the Faculty of Sciences and Tecnology of the University of Coimbra In loving memory of my father. Acknowledgements Thank you very much to Prof. Ernesto Costa, for his vision on research and education, for all the support and contributions to my ideas, and for the many life stories that always lighten the mood. Thank you also to Nuno Lourenço for the many discussions, shared books, and his constant helpfulness.
    [Show full text]
  • Gene Regulatory Network Reconstruction Using Bayesian
    1 Gene regulatory network reconstruction using Bayesian networks, the Dantzig selector, the Lasso and their meta-analysis Matthieu Vignes1∗, Jimmy Vandel1, David Allouche1, Nidal Ramadan-Alban1, Christine Cierco-Ayrolles1, Thomas Schiex1, Brigitte Mangin1, Simon de Givry1 1 SaAB Team/BIA Unit, INRA Toulouse, Castanet-Tolosan, France ∗ E-mail: [email protected] Abstract Modern technologies and especially next generation sequencing facilities are giving a cheaper access to genotype and genomic data measured on the same sample at once. This creates an ideal situation for multifactorial experiments designed to infer gene regulatory networks. The fifth \Dialogue for Reverse Engineering Assessments and Methods" (DREAM5) challenges are aimed at assessing methods and as- sociated algorithms devoted to the inference of biological networks. Challenge 3 on \Systems Genetics" proposed to infer causal gene regulatory networks from different genetical genomics data sets. We in- vestigated a wide panel of methods ranging from Bayesian networks to penalised linear regressions to analyse such data, and proposed a simple yet very powerful meta-analysis, which combines these inference methods. We present results of the Challenge as well as more in-depth analysis of predicted networks in terms of structure and reliability. The developed meta-analysis was ranked first among the 16 teams participating in Challenge 3A. It paves the way for future extensions of our inference method and more accurate gene network estimates in the context of genetical genomics. Introduction Inferring gene regulatory networks. Gene regulatory networks (GRN) are simplified representa- tions of mechanisms that make up the functioning of an organism under given conditions. A node in such a network stands for a gene i.e.
    [Show full text]
  • 'Non-Genetic' Inheritance: Insights from Molecular-Evolutionary Crosstalk
    Adrian-Kalchhauser et al / Understanding NGI: Insights from Molecular-Evolutionary Crosstalk / EcoEvoRxiv 20200501 Understanding 'Non-genetic' Inheritance: Insights from Molecular-Evolutionary Crosstalk Irene Adrian-Kalchhauser1, Sonia E. Sultan2, Lisa Shama3, Helen Spence-Jones4, Stefano Tiso5, Claudia Isabelle Keller Valsecchi6, Franz J. Weissing5 Addresses 1Centre for Fish and Wildlife Health, Department for Infectious Diseases and Pathobiology, Vetsuisse Faculty, University of Bern, Länggassstrasse 122, 3012 Bern, Switzerland 2Biology Department, Wesleyan University, Middletown CT 06459 USA 3Coastal Ecology Section, Alfred Wegener Institute Helmholtz Centre for Polar and Marine Research, Wadden Sea Station Sylt, Hafenstrasse 43, 25992 List, Germany 4Centre for Biological Diversity, School of Biology, University of St Andrews, UK 5Groningen Institute for Evolutionary Life Sciences, University of Groningen, Nijenborgh 7, 9747 AG Groningen, The Netherlands 6Institute of Molecular Biology (IMB), Ackermannweg 4, 55128 Mainz, Germany Corresponding author: Adrian-Kalchhauser, I. ([email protected]) Keywords epigenetics, epiallelic variation, methylation, transgenerational effects, inherited gene regulation, gene regulatory network model Abstract Understanding the evolutionary and ecological roles of 'non-genetic' inheritance is daunting due to the complexity and diversity of epigenetic mechanisms. We draw on precise insights from molecular structures and events to identify three general features of 'non-genetic' inheritance systems that are central to broader investigations: (i) they are functionally interdependent with, rather than separate from, DNA sequence; (ii) each of these mechanisms is not uniform but instead varies phylogenetically and operationally; and (iii) epigenetic elements are probabilistic, interactive regulatory factors and not deterministic 'epi-alleles' with defined genomic locations and effects. We explain each feature and offer research recommendations.
    [Show full text]
  • Algorithmic Information Theory Applications in Bright Field Microscopy and Epithelial Pattern Formation
    Utah State University DigitalCommons@USU All Graduate Theses and Dissertations Graduate Studies 5-2015 Algorithmic Information Theory Applications in Bright Field Microscopy and Epithelial Pattern Formation Hamid Mohamadlou Utah State University Follow this and additional works at: https://digitalcommons.usu.edu/etd Part of the Computer Sciences Commons Recommended Citation Mohamadlou, Hamid, "Algorithmic Information Theory Applications in Bright Field Microscopy and Epithelial Pattern Formation" (2015). All Graduate Theses and Dissertations. 4539. https://digitalcommons.usu.edu/etd/4539 This Dissertation is brought to you for free and open access by the Graduate Studies at DigitalCommons@USU. It has been accepted for inclusion in All Graduate Theses and Dissertations by an authorized administrator of DigitalCommons@USU. For more information, please contact [email protected]. ALGORITHMIC INFORMATION THEORY APPLICATIONS IN BRIGHT FIELD MICROSCOPY AND EPITHELIAL PATTERN FORMATION by Hamid Mohamadlou A dissertation submitted in partial fulfillment of the requirements for the degree of DOCTOR OF PHILOSOPHY in Computer Science Approved: Dr. Nicholas Flann Dr. Gregory Podgorski Major Professor Committee Member Dr. Xiaojun Qi Dr. Minghui Jiang Committee Member Committee Member Dr. Haitao Wang Dr. Mark R. McLellan Committee Member Vice President for Research and Dean of the School of Graduate Studies UTAH STATE UNIVERSITY Logan, Utah 2015 ii Copyright c Hamid Mohamadlou 2015 All Rights Reserved iii ABSTRACT Algorithmic Information Theory applications in Bright Field Microscopy and Epithelial Pattern Formation by Hamid Mohamadlou, Doctor of Philosophy Utah State University, 2015 Major Professor: Dr. Nicholas Flann Department: Computer Science Algorithmic Information Theory (AIT), also known as Kolmogorov complexity, is a quantitative approach to defining information.
    [Show full text]
  • Computational Representation of Developmental Genetic Regulatory Networks
    Developmental Biology 283 (2005) 1 – 16 www.elsevier.com/locate/ydbio Review Computational representation of developmental genetic regulatory networks William J.R. Longabaugha, Eric H. Davidsonb,1, Hamid Bolouria,b,* aInstitute for Systems Biology, Seattle, WA 98103-8904, USA bDivision of Biology, California Institute of Technology, Pasadena, CA 91125, USA Received for publication 12 February 2005, revised 30 March 2005, accepted 16 April 2005 Available online 23 May 2005 Abstract Developmental genetic regulatory networks (GRNs) have unique architectural characteristics. They are typically large-scale, multi- layered, and organized in a nested, modular hierarchy of regulatory network kernels, function-specific building blocks, and structural gene batteries. They are also inherently multicellular and involve changing topological relationships among a growing number of cells. Reconstruction of developmental GRNs requires unique computational tools that support the above representational requirements. In addition, we argue that DNA-centered network modeling, separate descriptions of network organization and network behavior, and support for network documentation and annotation are essential requirements for computational modeling of developmental GRNs. Based on these observations, we have developed a freely available, platform-independent, open source software package (BioTapestry) which supports both the process of model construction and also model visualization, analysis, documentation, and dissemination. We provide an overview of the main features of BioTapestry. The BioTapestry software and additional documents are available from http://www.biotapestry.org.We recommend BioTapestry as the substrate for further co-development for and by the developmental biology community. D 2005 Elsevier Inc. All rights reserved. Keywords: Genetic regulatory networks; Computational modeling; Visualization; Documentation Introduction can be fully and unambiguously verified by well-established experimental procedures.
    [Show full text]
  • The Sonic Hedgehog Signaling System As a Bistable Genetic Switch
    2748 Biophysical Journal Volume 86 May 2004 2748–2757 The Sonic Hedgehog Signaling System as a Bistable Genetic Switch Karen Lai, Matthew J. Robertson, and David V. Schaffer Department of Chemical Engineering and the Helen Wills Neuroscience Institute, University of California, Berkeley, California ABSTRACT Sonic hedgehog (Shh) controls critical cellular decisions between distinct fates in many systems, particularly in stem cells. The Shh network functions as a genetic switch, and we have theoretically and computationally analyzed how its structure can endow it with the ability to switch fate choices at a threshold Shh concentration. The network is composed of a positive transcriptional feedback loop embedded within a negative signaling feedback loop. Specifically, positive feedback by the transcription factor Gli, which upregulates its own expression, leads to a switch that can adopt two distinct states as a function of Shh. However, Gli also upregulates the signaling repressor Patched, negative feedback that reins in the strong Gli autoregulatory loop. Mutations that have been associated with cancer are predicted to yield an irreversible switch to a high Gli state. Finally, stochastic simulation reveals the negative Patched feedback loop serves a critical function of dampening Gli fluctuations to reduce spontaneous state switching and preserve the network’s robust, switch-like behavior. Tightly linked positive and negative feedback loops are present in many signaling systems, and the Shh system is therefore likely representative of a large set of gene regulation networks that control stem cell fate throughout development and into adulthood. INTRODUCTION Throughout development and adulthood, cells are exposed to development by forming a concentration gradient, a phe- complex regulatory signals and must correctly interpret these nomenon best characterized in the neural tube and spinal signals to implement necessary functional decisions.
    [Show full text]
  • Analysis on Gene Modular Network Reveals Morphogen-Directed Development Robustness in Drosophila
    Zhang et al. Cell Discovery (2020) 6:43 Cell Discovery https://doi.org/10.1038/s41421-020-0173-z www.nature.com/celldisc ARTICLE Open Access Analysis on gene modular network reveals morphogen-directed development robustness in Drosophila Shuo Zhang1,2,JuanZhao1,3, Xiangdong Lv1,2, Jialin Fan1,2,YiLu1,TaoZeng 1,3, Hailong Wu1, Luonan Chen 1,3,4,5 and Yun Zhao1,4,6 Abstract Genetic robustness is an important characteristic to tolerate genetic or nongenetic perturbations and ensure phenotypic stability. Morphogens, a type of evolutionarily conserved diffusible molecules, govern tissue patterns in a direction-dependent or concentration-dependent manner by differentially regulating downstream gene expression. However, whether the morphogen-directed gene regulatory network possesses genetic robustness remains elusive. In the present study, we collected 4217 morphogen-responsive genes along A-P axis of Drosophila wing discs from the RNA-seq data, and clustered them into 12 modules. By applying mathematical model to the measured data, we constructed a gene modular network (GMN) to decipher the module regulatory interactions and robustness in morphogen-directed development. The computational analyses on asymptotical dynamics of this GMN demonstrated that this morphogen-directed GMN is robust to tolerate a majority of genetic perturbations, which has been further validated by biological experiments. Furthermore, besides the genetic alterations, we further demonstrated that this morphogen-directed GMN can well tolerate nongenetic perturbations (Hh production changes) via computational fi 1234567890():,; 1234567890():,; 1234567890():,; 1234567890():,; analyses and experimental validation. Therefore, these ndings clearly indicate that the morphogen-directed GMN is robust in response to perturbations and is important for Drosophila to ensure the proper tissue patterning in wing disc.
    [Show full text]