Model-Guided Design of Mammalian Genetic Programs Exclusive Licensee J

Total Page:16

File Type:pdf, Size:1020Kb

Model-Guided Design of Mammalian Genetic Programs Exclusive Licensee J SCIENCE ADVANCES | RESEARCH ARTICLE GENETICS Copyright © 2021 The Authors, some rights reserved; Model-guided design of mammalian genetic programs exclusive licensee J. J. Muldoon1,2, V. Kandula3, M. Hong2, P. S. Donahue1,2,4, J. D. Boucher1,2, American Association 1,2,5,6 1,2,5 for the Advancement N. Bagheri , J. N. Leonard * of Science. No claim to original U.S. Government Genetically engineering cells to perform customizable functions is an emerging frontier with numerous techno- Works. Distributed logical and translational applications. However, it remains challenging to systematically engineer mammalian cells under a Creative to execute complex functions. To address this need, we developed a method enabling accurate genetic program Commons Attribution design using high-performing genetic parts and predictive computational models. We built multifunctional pro- License 4.0 (CC BY). teins integrating both transcriptional and posttranslational control, validated models for describing these mechanisms, implemented digital and analog processing, and effectively linked genetic circuits with sensors for multi-input evaluations. The functional modularity and compositional versatility of these parts enable one to satisfy a given design objective via multiple synonymous programs. Our approach empowers bioengineers to predictively de- sign mammalian cellular functions that perform as expected even at high levels of biological complexity. Downloaded from INTRODUCTION these parts (25), and computational and conceptual tools that facilitate Early demonstrations of genetically engineering customized func- the identification of designs that function robustly despite biological tions in mammalian cells indicate a vast potential to benefit applica- variability and cross-talk (26–28). In this study, we sought to address tions including directed stem cell differentiation (1, 2) and cell-based these issues by developing a model-driven process that enables one to therapies (3). In general, most applications require precise control propose a tractable set of candidate circuits for construction and of gene expression and the capability to sense and respond to exter- testing without the need for empirical trial-and-error tuning. We http://advances.sciencemag.org/ nal cues (4–8). Despite the growing availability of biological parts validated this framework by using it to implement a variety of func- (such as libraries of promoters and regulatory proteins) that could tions including digital and analog information processing, as well as be used to control cell states, assembling parts to compose customized sense-and-respond circuits. genetic programs that function as intended remains a challenge, and it often requires iterative experimental tuning or down- selection to identify functional configurations. This highly empirical process limits RESULTS the scope of programs that one can feasibly compose and fine-tune, Biological parts for integrating transcriptional as well as the performance of functional programs identified in and posttranslational control of gene expression this manner. Thus, the need for systematic and precise design pro- The strategy that we pursued for genetic program design was cesses represents a grand challenge in the field of mammalian syn- uniquely enabled by the COmposable Mammalian Elements of on August 8, 2021 thetic biology. Transcription (COMET): a toolkit of TFs and promoters with tun- Model-guided predictive design has been demonstrated in the able properties enabling precise and orthogonal control of gene composition of some cellular functions, including transcriptional expression (13). These TFs comprise a ZF DNA-binding domain logic in bacteria (9) as well as logical (10) and analog behaviors in and a functional domain, e.g., VP16 or VP64, which are activation yeast (11); however, this type of approach is less developed in mam- domains (ADs) that with a ZF form an activator (ZFa). A protein malian systems. To date, transcription factors (TFs) based on zinc including a ZF domain but lacking an AD functions as an inhibitor fingers (ZFs) (12, 13), transcription activator-like effectors (TALEs) by competing with the cognate ZFa for binding. Promoters in this (14–17), dCas9 (18, 19), and other proteins (20) have been used to library contain ZF binding sites arranged in different configurations implement transcriptional logic in mammalian cells. Some of these (e.g., ZF1x6-C has six compactly arranged ZF1 sites). Each combi- studies make use of protein splicing (12, 14, 18). Other studies have nation of a promoter and a ZFa (and potentially an inhibitor) con- used RNA-binding proteins (21), proteases (22, 23), and synthetic fers a characteristic level of transcriptional activity (fig. S1, A to E), protein-binding domains (17). Yet, it remains difficult to predict and as part of this previous work, we developed mathematical models circuit performance based only upon descriptions of the components. to characterize these relationships (13). Here, we investigate whether Associated challenges include the availability of appropriate parts these biological parts and descriptive computational tools can be (24), suitably descriptive models that support predictions using adapted and applied to achieve predictive genetic program design. Although COMET includes many parts for implementing tran- scriptional regulation, we hypothesized that complex genetic program 1Interdisciplinary Biological Sciences Program, Northwestern University, Evanston, design could be facilitated by introducing a mechanism for regula- IL 60208, USA. 2Department of Chemical and Biological Engineering, Northwestern tion at the posttranslational level (Fig. 1, A and B). To investigate University, Evanston, IL 60208, USA. 3Honors Program in Medical Education, North- western University Feinberg School of Medicine, Chicago, IL 60611, USA. 4Medical this strategy, we evaluated new parts based on split inteins: comple- Scientist Training Program, Northwestern University Feinberg School of Medicine, mentary domains that fold and trans-splice to covalently ligate Chicago, IL 60611, USA. 5Center for Synthetic Biology, Chemistry of Life Processes flanking domains (exteins) (29). We selected the split intein gp41-1 Institute, and Robert H. Lurie Comprehensive Cancer Center, Northwestern University, for its rapid splicing kinetics (30). To test an application of this Evanston, IL 60208, USA. 6Departments of Biology and Chemical Engineering, Uni- versity of Washington, Seattle, WA 98195, USA. mechanism, we appended an AD to the gp41-1 N-terminal fragment *Corresponding author. Email: [email protected] (intN) and a ZF to the C-terminal fragment (intC). These parts were Muldoon et al., Sci. Adv. 2021; 7 : eabe9375 19 February 2021 1 of 12 SCIENCE ADVANCES | RESEARCH ARTICLE + + AND – IMPLY – A B C – + D – + Functions to implement in cells Genetic program design VP64- VP64- VP64- NIMPLY intN VP64- intN ZF1 AND IMPLY NAND ZF1 VP64- Design Synthetic Predictive intC- DsDed- + ZF1 intC- ZF1 objective components model Input 2 ZF1 – –+ dx dt / Input 1 Output: OFF ON EF1α EF1α N C ... EF1α EF1α VP64 int DsDed int ZF1 N C VP64 int int ZF1 Mechanisms to use in composing genetic programs intN C Transcriptional regulation TF Enzymatic Multiple layers DsDed int VP64 ZF1 intN Activator Inhibitor Strong inhibitor VP64 ZF1 ZF1x6-C EF1α competition conversion of regulation C int ZF1x6-C mKate2 VP64 ZF1 ZF DsDed ZF AD ZF mKate2 Splicing of an inhibitor into an activatorSplicing of an inhibitor into an activator COMET promoter Target ExperimentFit Prediction Experiment Simulations to explore an expanded design space 4 4 Posttranslational regulation Base caseIncreasing complexity 1 1 N C A int B A’ int B’ 1/4 1/4 intC-ZF1 N Output Output int B Input Input A B’ 0 DsDed-intC-ZF1 0 A’ intC One response Multiple possible context- Stable folding Ligation profile dependent response profiles 01/401/10 2/5 01/401/102/5 01/801/201/5 01/801/201/5 VP64-intN Output: VP64-intN (x109 gene copies) 0 1.5 mKate2 signal 0 0.85 (x106 MEPTRs) + + + + NAND – NIMPLY – NIMPLY – NIMPLY – E – + F – + G – + H – + DsDed- DsDed- DsDed- DsDed- DsDed- intN ZF1 intN ZF1 ZF1 ZF1 Downloaded from intC- VP64- VP64- VP64- ZF1 VP64- intC- ZF1 ZF1 ZF1 ZF1 EF1α EF1α N C EF1α DsDed int int ZF1 EF1α EF1α EF1α N C ZF1 DsDed ZF1 DsDed int VP64 int ZF1 N DsDed ZF1 int intC EF1α EF1α intN VP64 ZF1 ZF1x6-C VP64 ZF1 ZF1x6-C C EF1α VP64 int DsDed ZF1 ZF1x6-C mKate2 VP64 ZF1 ZF1x6-C mKate2 mKate2 mKate2 http://advances.sciencemag.org/ Splicing to a more potent inhibitor Steric inhibition Dual mechanism of inhibition Splicing of an activator into an inhibitor Prediction Experiment Prediction Experiment Prediction Experiment Prediction Experiment 2 2/5 2/5 1/10 1/2 1/10 1/10 1/40 1/8 1/40 1/40 1/160 intC-ZF1 VP64-ZF1 VP64-ZF1 0 0 0 VP64-intC-ZF1 0 01/4 14 01/4 14 01/4 14 01/4 14 01/4 14 01/4 14 0 1/81/2 2 01/8 1/22 DsDed-intN ZF1 DsDed-ZF1 DsDed-intN 0 1.3 0 1.1 0 1.0 0 0.45 + + NIMPLY – AND – I – + J – + K Activation Double inversion DsDed- DsDed- DsDed- DsDed- DsDed- ZF1- intN DsDed- ZF10 ZF10 ZF1- DsDed- VP64-ZF1 PEST ZF10 VP64- PEST intC- ZF1- VP64- PEST ZF10 VP64- ZF10 DsDed- ZF10 VP64- ZF1 ZF1 VP64- ZF10 VP64- on August 8, 2021 ZF1 ZF1 EF1α DsDed ZF10 EF1α EF1α N C DsDed int int ZF10 EF1α EF1α EF1α EF1α VP64 ZF10 ZF10x6-C VP64 ZF1 ZF1x6-C DsDed ZF10 DsDed ZF1 DsDed ZF1 PEST intN mKate2 DsDed ZF10 C EF1α int EF1α EF1α VP64 ZF1 ZF1x6-C VP64 ZF10 ZF10x6-C VP64 ZF10 ZF10x6-C mKate2 DsDed ZF1 PEST Standard activation Inhibition cascade DsDed ZF1 PEST EF1α Fit Prediction VP64 ZF1 ZF1x6-C EF1α 1 0.5 mKate2 VP64 ZF1 ZF1x6-C n = 0.9 n = 2.4 mKate2 Ultrasensitive Inhibition cascade Splicing upstream of an inhibition cascade Prediction Experiment Prediction Experiment Reporter signal Reporter signal 0 (max.-normalized) (max.-normalized) 0 10 10 0.10.2 0.4124 10 0.10.2 0.4124 10 Experiment Experiment 2.5 2.5 1.2 6 n = 1.0 n = 2.8 5/8 5/8 Ultrasensitive 0.8 intC-ZF10 4 DsDed-ZF10 0 MEPTRs) 0 MEPTRs) 6 2 6 0.4 01/4 14 01/4 14 05/8 2.510 05/8 2.510 mKate2 signal mKate2 signal (x10 (x10 DsDed-ZF1 0 0 DsDed-intN 0.1 0.2 0.4 12410 0.10.2 0.4124 10 0 0.40 0 0.5 VP64-ZF1 (x109 gene copies) DsDed-ZF10 (x109 gene copies) Fig.
Recommended publications
  • Logic Functions Realized Using Clockless Gates for Rapid Single-Flux-Quantum Circuits
    R2-4 SASIMI 2021 Proceedings Logic Functions Realized Using Clockless Gates for Rapid Single-Flux-Quantum Circuits Takahiro Kawaguchi Kazuyoshi Takagi Naofumi Takagi Graduate School of Informatics Graduate School of Engineering Graduate School of Informatics Kyoto University Mie University Kyoto University Kyoto, 606-8501, Japan Mie, 514-8507, Japan Kyoto, 606-8501, Japan [email protected] [email protected] [email protected] Abstract— Superconducting rapid single-flux- and the area occupied by a clock distribution network are quantum (RSFQ) circuit is a promising candidate reduced. The reduction of the pipeline depth leads to a for circuit technology in the post-Moore era. RSFQ fewer DFFs for path-balancing and a smaller circuit area. digital circuits operate with pulse logic and are A clockless AND gate and a clockless NIMPLY gate was usually composed of clocked gates. We have proposed proposed in [10]. A NIMPLY (not-imply) gate is an AND clockless gates, which can reduce the hardware gate with one inverted input, which implements x∧¬y.A amount of a circuit drastically. In this paper, we confluence buffer [1], which merges a pulse from its inputs first discuss logic functions realizable using clockless to its output, can be used as a clockless OR gate. gates. We show that all the 0-preserving functions A clockless gate realizes only a 0-preserving logic func- can be realized using the existing clockless gates, tion, which outputs 0 from the input value of all 0s, be- i.e., AND, OR, and NIMPLY (not-imply) gates.
    [Show full text]
  • Layering Genetic Circuits to Build a Single Cell, Bacterial Half Adder
    bioRxiv preprint doi: https://doi.org/10.1101/019257; this version posted May 30, 2015. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. 1 Layering genetic circuits to build a single cell, bacterial half adder 2 Adison Wong1,2,†, Huijuan Wang1, Chueh Loo Poh1* and Richard I Kitney2* 3 4 1School of Chemical and Biomedical Engineering, Nanyang Technological University, Singapore 637459, SG 5 2Department of Bioengineering, Imperial College London, London SW7 2AZ, UK 6 7 * To whom correspondence should be addressed: Chueh Loo Poh, Division of Bioengineering, School of 8 Chemical and Biomedical Engineering, Nanyang Technological University, Singapore 637459, SG. Tel: +65 9 6514 1088; Email: [email protected] 10 11 * Correspondence may also be addressed to: Richard Ian Kitney, Centre for Synthetic Biology and Innovation, 12 and Department of Bioengineering, Imperial College London, London SW7 2AZ, UK. Tel: +44 (0)20 7594 6226; 13 Email: [email protected] 14 15 †Present Address: NUS Synthetic Biology for Clinical and Technological Innovation, National University of 16 Singapore, Singapore 117456, SG 17 18 ABSTRACT 19 Gene regulation in biological systems is impacted by the cellular and genetic context-dependent 20 effects of the biological parts which comprise the circuit. Here, we have sought to elucidate the 21 limitations of engineering biology from an architectural point of view, with the aim of compiling a set of 22 engineering solutions for overcoming failure modes during the development of complex, synthetic 23 genetic circuits.
    [Show full text]
  • Layering Genetic Circuits to Build a Single Cell, Bacterial Half Adder
    bioRxiv preprint doi: https://doi.org/10.1101/019257; this version posted May 12, 2015. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. 1 Layering genetic circuits to build a single cell, bacterial half adder 2 Adison Wong1,2,†, Huijuan Wang1, Chueh Loo Poh1* and Richard I Kitney2* 3 4 1School of Chemical and Biomedical Engineering, Nanyang Technological University, Singapore 637459, SG 5 2Department of Bioengineering, Imperial College London, London SW7 2AZ, UK 6 7 * To whom correspondence should be addressed: Chueh Loo Poh, Division of Bioengineering, School of 8 Chemical and Biomedical Engineering, Nanyang Technological University, Singapore 637459, SG. Tel: +65 9 6514 1088; Email: [email protected] 10 11 * Correspondence may also be addressed to: Richard Ian Kitney, Centre for Synthetic Biology and Innovation, 12 and Department of Bioengineering, Imperial College London, London SW7 2AZ, UK. Tel: +44 (0)20 7594 6226; 13 Email: [email protected] 14 15 †Present Address: NUS Synthetic Biology for Clinical and Technological Innovation, National University of 16 Singapore, Singapore 117456, SG 17 18 ABSTRACT 19 Gene regulation in biological systems is impacted by the cellular and genetic context-dependent 20 effects of the biological parts which comprise the circuit. Here, we have sought to elucidate the 21 limitations of engineering biology from an architectural point of view, with the aim of compiling a set of 22 engineering solutions for overcoming failure modes during the development of complex, synthetic 23 genetic circuits.
    [Show full text]
  • Keeping up with Expressing the Sense of Structured Natural Language Semantics
    NWI-IMK013 Keeping up with expressing the sense of structured natural language semantics Master thesis Information Sciences Author: Student number: T. (Thomas) Nobel S1013787 Supervisor dr. S.J.B.A. (Stijn) Hoppenbrouwers Assessor dr. ir. E. (Eelco) Herder Sunday 10th November, 2019 Abstract Domain-specific languages are not a static entity. On the contrary, they are con- tinuously changing and adapting to their respective environments. Cybernetics can be used to explain why domain-specific language evolution exists. We observe that objects, entities, and terms in different applications such as Fact-Based Modelling (FBM) and lexical databases (such as Wordnet) are fundamentally (logically) equiv- alent. To assert this claim, we will use a diverse set of theories including first- and second-order cybernetics, Wittgenstein, and Kripke semantics to argue why we be- lieve that in essence the challenges of defining domain-specific languages are strongly related with the challenges of formal information modelling. In this thesis, we aim for soundness and reliability while defining the domain-specific language. The goal of combining seemingly different theories is to propose a new framework and basis for further (applied) research in correctly defining, mapping, and comparing domain- specific languages. Our explanation can be used to improve the newly developed tool called A-Lex, which aims to make building domain-specific lexicons and busi- ness data models more approachable for non-IT experts by the use of self-service modelling. A-Lex gives private persons, academics, public authorities and organisa- tions the opportunity to make public collections which can be reused, shared, and imported freely by others (a form of crowdsourcing).
    [Show full text]
  • The Complexity of Boolean Functions
    The Complexity of Boolean Functions Ingo Wegener Johann Wolfgang Goethe-Universit¨at WARNING: This version of the book is for your personal use only. The material is copyrighted and may not be redistributed. Copyright c 1987 by John Wiley & Sons Ltd, and B. G. Teubner, Stuttgart. All rights reserved. No part of this book may be reproduced by any means, or transmitted, or translated into a machine language without the written permission of the publisher. Library of Congress Cataloguing in Publication Data: Wegener, Ingo The complexity of boolean functions. (Wiley-Teubner series in computer science) Bibliography: p. Includes index. 1. Algebra, Boolean. 2. Computational complexity. I. Title. II. Series. AQ10.3.W44 1987 511.3’24 87-10388 ISBN 0 471 91555 6 (Wiley) British Library Cataloguing in Publication Data: Wegener, Ingo The complexity of Boolean functions.—(Wiley-Teubner series in computer science). 1. Electronic data processing—Mathematics 2. Algebra, Boolean I. Title. II. Teubner, B. G. 004.01’511324 QA76.9.M3 ISBN 0 471 91555 6 CIP-Kurztitelaufnahme der Deutschen Bibliothek Wegener, Ingo The complexity of Boolean functions/Ingo Wegener.—Stuttgart: Teubner; Chich- ester; New York; Brisbane; Toronto; Singapore: Wiley, 1987 (Wiley-Teubner series in computer science) ISBN 3 519 02107 2 (Teubner) ISBN 0 471 91555 6 (Wiley) Printed and bound in Great Britain On this version of the “Blue Book” This version of “The Complexity of Boolean Functions,” for some people simply the “Blue Book” due to the color of the cover of the orig- inal from 1987, is not a print-out of the original sources.
    [Show full text]
  • Layering Genetic Circuits to Build a Single Cell, Bacterial Half Adder Adison Wong1,2,3, Huijuan Wang1, Chueh Loo Poh1* and Richard I
    Wong et al. BMC Biology (2015) 13:40 DOI 10.1186/s12915-015-0146-0 RESEARCH ARTICLE Open Access Layering genetic circuits to build a single cell, bacterial half adder Adison Wong1,2,3, Huijuan Wang1, Chueh Loo Poh1* and Richard I. Kitney2* Abstract Background: Gene regulation in biological systems is impacted by the cellular and genetic context-dependent effects of the biological parts which comprise the circuit. Here, we have sought to elucidate the limitations of engineering biology from an architectural point of view, with the aim of compiling a set of engineering solutions for overcoming failure modes during the development of complex, synthetic genetic circuits. Results: Using a synthetic biology approach that is supported by computational modelling and rigorous characterisation, AND, OR and NOT biological logic gates were layered in both parallel and serial arrangements to generate a repertoire of Boolean operations that include NIMPLY, XOR, half adder and half subtractor logics in a single cell. Subsequent evaluation of these near-digital biological systems revealed critical design pitfalls that triggered genetic context-dependent effects, including 5′ UTR interferences and uncontrolled switch-on behaviour of the supercoiled σ54 promoter. In particular, the presence of seven consecutive hairpins immediately downstream of the promoter transcription start site severely impeded gene expression. Conclusions: As synthetic biology moves forward with greater focus on scaling the complexity of engineered genetic circuits, studies which thoroughly evaluate failure modes and engineering solutions will serve as important references for future design and development of synthetic biological systems. This work describes a representative case study for the debugging of genetic context-dependent effects through principles elucidated herein, thereby providing a rational design framework to integrate multiple genetic circuits in a single prokaryotic cell.
    [Show full text]
  • Nonlinear Delay Differential Equations and Their Application to Modeling Biological Network Motifs
    bioRxiv preprint doi: https://doi.org/10.1101/2020.08.02.233619; this version posted August 4, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. Nonlinear delay differential equations and their application to modeling biological network motifs David S.Glass1, Xiaofan Jin2, Ingmar H. Riedel-Kruse3,* 1 Department of Molecular Cell Biology, Weizmann Institute of Science, Rehovot 76100, Israel 2 Gladstone Institutes, San Francisco, CA 94158, USA 3 Department of Molecular and Cellular Biology, and (by courtesy) Departments of Applied Mathematics and Biomedical Engineering, University of Arizona, Tucson, Arizona 85721, USA * [email protected] Abstract Biological regulatory systems, such as transcription factor or kinase networks, consist of complex dynamical interactions among many components. \Network motif" models focus on small sub-networks to provide quantitative insight into overall behavior. However, conventional network motif models often ignore time delays either inherent to biological processes or associated with multi-step interactions. Here we systematically examine explicit-delay versions of the most common network motifs via delay differential equations (DDEs), both analytically and numerically. We find many broadly applicable results, such as the reduction in number of parameters compared to canonical descriptions via ordinary differential equations (ODE), criteria for when delays may be ignored, a complete phase space for autoregulation, explicit dependence of feedforward loops on a difference of delays, a unified framework for Hill-function logic, and conditions for oscillations and chaos.
    [Show full text]
  • Nonlinear Delay Differential Equations and Their Application to Modeling Biological Network Motifs ✉ David S
    ARTICLE https://doi.org/10.1038/s41467-021-21700-8 OPEN Nonlinear delay differential equations and their application to modeling biological network motifs ✉ David S. Glass 1, Xiaofan Jin 2 & Ingmar H. Riedel-Kruse 3 Biological regulatory systems, such as cell signaling networks, nervous systems and ecolo- gical webs, consist of complex dynamical interactions among many components. Network motif models focus on small sub-networks to provide quantitative insight into overall 1234567890():,; behavior. However, such models often overlook time delays either inherent to biological processes or associated with multi-step interactions. Here we systematically examine explicit-delay versions of the most common network motifs via delay differential equation (DDE) models, both analytically and numerically. We find many broadly applicable results, including parameter reduction versus canonical ordinary differential equation (ODE) models, analytical relations for converting between ODE and DDE models, criteria for when delays may be ignored, a complete phase space for autoregulation, universal behaviors of feedforward loops, a unified Hill-function logic framework, and conditions for oscillations and chaos. We conclude that explicit-delay modeling simplifies the phenomenology of many biological networks and may aid in discovering new functional motifs. 1 Department of Molecular Cell Biology, Weizmann Institute of Science, Rehovot, Israel. 2 Gladstone Institutes, San Francisco, CA, USA. 3 Department of Molecular and Cellular Biology, and (by courtesy) Departments of Applied Mathematics and Biomedical Engineering, University of Arizona, Tucson, AZ, USA. ✉ email: [email protected] NATURE COMMUNICATIONS | (2021) 12:1788 | https://doi.org/10.1038/s41467-021-21700-8 | www.nature.com/naturecommunications 1 ARTICLE NATURE COMMUNICATIONS | https://doi.org/10.1038/s41467-021-21700-8 iological regulation consists of complex networks of is a corresponding DDE, where x(t − τ) represents the value of x dynamical interactions1–5.
    [Show full text]
  • Reasoning in Description Logics Using Resolution and Deductive Databases
    Zur Erlangung des akademischen Grades eines Doktors der Wirtschaftswissenschaften (Dr. rer. pol.) von der Fakult¨atf¨urWirtschaftswissenschaften der Universit¨atFridericiana zu Karlsruhe genehmigte Dissertation. Reasoning in Description Logics using Resolution and Deductive Databases M.Sc. Boris Motik Tag der m¨undlichen Pr¨ufung: 09. Januar 2006 Referent: Prof. Dr. Rudi Studer, Universit¨atKarlsruhe (TH) Korreferentin 1: Dr. habil. Ulrike Sattler, University of Manchester Korreferent 2: Prof. Dr. Karl-Heinz Waldmann, Universit¨atKarlsruhe (TH) 2006 Karlsruhe ii Abstract Description logics (DLs) are knowledge representation formalisms with well-understood model-theoretic semantics and computational properties. The DL SHIQ(D) provides the logical underpinning for the family of Semantic Web ontology languages. Metadata management applications, such as the Semantic Web, often require reasoning with large quantities of assertional information; however, existing algorithms for DL reasoning have been optimized mainly for efficient terminological reasoning. Although techniques for terminological reasoning can be used for assertional reasoning as well, they often do not exhibit good performance in practice. Deductive databases are knowledge representation systems specifically designed to efficiently reason with large data sets. To see if deductive database techniques can be used to optimize assertional reasoning in DLs, we studied the relationship between the two formalisms. Our main result is a novel algorithm that reduces a SHIQ(D) knowledge base KB to a disjunctive datalog program DD(KB) such that KB and DD(KB) entail the same set of ground facts. In this way, we allow DL reasoning to be performed in DD(KB) using known, optimized algorithms. The reduction algorithm is based on several novel results.
    [Show full text]
  • Theorem Proving and Algebra
    SSSS Theorem Proving and Algebra SSSS Theorem Proving and SSSS Algebra Joseph A. Goguen SSSS Joseph A. Goguen SSSS SSSS S Theorem Proving and Algebra Theorem Proving and Algebra Joseph A. Goguen University of California, San Diego Joseph A. Goguen (1941–2006) Department of Computer Science and Engineering University of California, San Diego 9500 Gilman Drive, La Jolla CA 92093-0114 USA April 2006 © Joseph A. Goguen, 1990–2006. Edited by: Kokichi Futatsugi Narciso Martí-Oliet José Meseguer Text reviewed and commented by: Kokichi Futatsugi Daniel Mircea Gaina Narciso Martí-Oliet José Meseguer Masaki Nakamura Miguel Palomino Book designed and typeset by: Alberto Verdejo Contents Foreword by Editors xiii 1 Introduction 1 1.1 What is Theorem Proving? . 1 1.2 Applications of Theorem Proving . 2 1.3 Logic for Foundations versus Logic for Applications . 2 1.4 Why Equational Logic and Algebra? . 3 1.5 What Kind of Algebra? . 4 1.6 Term Rewriting . 5 1.7 Logical Systems and Proof Scores . 5 1.8 Semantics and Soundness . 6 1.9 Loose versus Standard Semantics . 7 1.10 Human Interface Design . 8 1.11 OBJ . 9 1.12 Some History and Related Work . 10 1.13 Using this Text . 12 1.13.1 Synopsis . 13 1.13.2 Novel Features . 13 1.14 Acknowledgements . 14 2 Signature and Algebra 15 2.1 Sets and Arrows . 15 2.2 Sorted Sets . 16 2.3 Signature . 18 2.4 Signatures in OBJ . 20 2.5 Algebras . 22 2.6 Term Algebras . 24 2.7 Unsorted Signature and Algebra . 26 2.8 Literature .
    [Show full text]