Exact Maximal Reduction of Stochastic Reaction Networks by Species

Bioinformatics, 2021, 1–8 doi: 10.1093/bioinformatics/btab081 Advance Access Publication Date: 3 February 2021 Original Paper Downloaded from https://academic.oup.com/bioinformatics/advance-article/doi/10.1093/bioinformatics/btab081/6126804 by DTU Library user on 06 July 2021 Systems Biology Exact maximal reduction of stochastic reaction networks by species lumping Luca Cardelli,1 Isabel Cristina Perez-Verona,2 Mirco Tribastone ,2,* Max Tschaikowski,3 Andrea Vandin 4 and Tabea Waizmann2 1Department of Computer Science, University of Oxford, Oxford 34127, OX1 3QD, UK, 2IMT School for Advanced Studies, Lucca 55100, Italy, 3Department of Computer Science, University of Aalborg, Aalborg 34127, 9220, Denmark and 4Sant’Anna School of Advanced Studies, Pisa 56127, Italy *To whom correspondence should be addressed. Associtate Editor: Alfonso Valencia Received on October 16, 2020; revised on January 9, 2021; editorial decision on January 27, 2021; accepted on January 28, 2021 Abstract Motivation: Stochastic reaction networks are a widespread model to describe biological systems where the presence of noise is relevant, such as in cell regulatory processes. Unfortunately, in all but simplest models the resulting discrete state-space representation hinders analytical tractability and makes numerical simulations expensive. Reduction methods can lower complexity by computing model projections that preserve dynamics of interest to the user. Results: We present an exact lumping method for stochastic reaction networks with mass-action kinetics. It hinges on an equivalence relation between the species, resulting in a reduced network where the dynamics of each macro- species is stochastically equivalent to the sum of the original species in each equivalence class, for any choice of the initial state of the system. Furthermore, by an appropriate encoding of kinetic parameters as additional species, the method can establish equivalences that do not depend on speciﬁc values of the parameters. The method is sup- ported by an efﬁcient algorithm to compute the largest species equivalence, thus the maximal lumping. The effect- iveness and scalability of our lumping technique, as well as the physical interpretability of resulting reductions, is demonstrated in several models of signaling pathways and epidemic processes on complex networks. Availability and implementation: The algorithms for species equivalence have been implemented in the software tool ERODE, freely available for download from https://www.erode.eu. Contact: [email protected] Supplementary information: Supplementary data are available at Bioinformatics online. 1 Introduction on the set of reactions of the network. SE gives rise to a reduced stochastic reaction network where the population of each macro- Stochastic reaction networks are a foundational model to study bio- species tracks the sum of the population levels of all species belong- logical systems where the presence of noise cannot be neglected, for ing to an equivalence class. instance in cell regulatory processes governed by low-abundance As with all reduction methods, SE implies some loss of informa- biochemical species (Guptasarma, 1995), which may introduce sig- tion; namely, the individual dynamical behavior of a species that is nificant variability in gene expression (Elowitz, 2002). Their ana- aggregated into a macro-species cannot be recovered in general. lysis—either by solution of the master equation or by stochastic However, our algorithm for computing SE gives freedom to the simulation—is fundamentally hindered by a discrete representation modeler as to which original variables to preserve in the reduced of the state space (Van Kampen, 2007), which leads to a combina- network. Indeed, building upon a celebrated result in theoretical torial growth in the number of states in the underlying Markov computer science (Paige and Tarjan, 1987), we compute SE as the chain as a function of the abundances of the species. coarsest partition that satisfies the equivalence criteria and that Here, we present a method for exact reduction that preserves the refines a given initial partition of species. Thus, a species of interest stochastic dynamics of mass-action reaction networks, a fundamen- that is isolated in a singleton block is guaranteed to be preserved in tal kinetic model in computational systems biology (Voit et al., the reduced network. Our partition-refinement algorithm is compu- 2015). The method rests on a relation between species, called species tationally efficient, in the sense that the algorithm runs in polyno- equivalence (SE), which can be checked through criteria that depend mial time as function of the number of species and reactions of the VC The Author(s) 2021. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: [email protected] 1 2 L.Cardelli et al. original network. Finally, we can prove the existence of a maximal possible states reachable from r^. We denote with outðrÞ the multiset SE, i.e. the equivalence that leads to the coarsest aggregation of the of outgoing transitions from state r: reaction network. 8 9 < Y = Formally, SE can be seen a lifting to reaction networks of the no- k a rðSÞ outðrÞ¼ r ! r þ p À qjðq ! pÞ2R; k ¼ a : tion of lumpability of Markov chains (Buchholz, 1994; Kemeny and : qðSÞ ; Snell, 1976). That is, the reduced network yields a state space where S2q each macro-state tracks the sum of the probabilities of the states in Downloaded from https://academic.oup.com/bioinformatics/advance-article/doi/10.1093/bioinformatics/btab081/6126804 by DTU Library user on 06 July 2021 For any two distinct states r and h, we denote by qðr; hÞ the sum the original Markov chain. Ordinary lumpability requires the avail- r h ability of the state space that underlies the master equation (Van of the propensities from to across all reactions, i.e.: X Kampen, 2007); thus, it also requires the initial state of the Markov qðr; hÞ¼ k: chain to be fixed. Instead, SE works at the structural level of the re- k action network, by lumping species instead of states; thus, it involves ðr !hÞ2outðrÞ the analysis of an exponentially smaller mathematical object in gen- Moreover, we set qðr; rÞ to be the negativeP sum of all possible eral. In addition, a practically useful consequence of reasoning at the transitions from state r, i.e. qðr; rÞ¼À h6¼r qðr; hÞ. These values network level is that an SE holds for any initial state. Given that a form the CTMC generator matrix, which characterizes the dynamic- reaction network can be seen as a Petri net where each species is rep- al evolution of the Markov chain by means of the master equation resented as a place (Brijder, 2019), our structural approach is close T p_ ¼ p Q. Each component of its solution, prðtÞ, is the probability in spirit to the notion of place bisimulation (Autant and of being in state r at time t starting from some initial probability dis- Schnoebelen, 1992). However, that induces a bisimulation over tribution (Van Kampen, 2007). markings in the classical, non-quantitative sense (Joyal et al., 1996). Figure 1 shows a simple running example to summarize the main There are several methods for the reduction of the deterministic results of this article using the network in Figure 1a with species S , rate equations of biochemical reaction networks, e.g. Snowden et al. 1 ..., S . The state space from the initial state r^ ¼ S þ 2S is in (2017). However, these reductions do not preserve the stochastic be- 4 1 4 shown Figure 1b. havior in general. For stochastic models in systems biology, lump- Ordinary lumpability is a partition of the state space such that ability has been studied for rule-based formalisms, providing any two states r1, r2 in each partition block H haveP equal aggregate reduction methods based on rule conditions that induce a lumping 0 rates toward states in any block H , i.e. 0 qðr ; rÞ¼ of the underlying Markov chain (Feret et al., 2012, 2013). P r2H 1 0 q r ; r (Buchholz, 1994; Kemeny and Snell, 1976). Given an For mass-action networks, the earlier approach to species lump- r2H ð 2 Þ ing by Cardelli et al. (2017b), called syntactic Markovian bisimula- ordinarily lumpable partition, a lumped CTMC can be constructed tion, suffers from two limitations. First, syntactic Markovian by associating a macro-state to each block; transitions between bisimulation is only a sufficient condition for lumpability. Here, we macro-states are labeled with the overall rate from a state in the prove that SE is the coarsest possible aggregation that yields a source block toward all states in the target. Distinct colored boxes Markov chain lumping according to an equivalence over species. in Figure 1b identify an ordinarily lumpable partition of the sample We show that this yields coarser aggregations than syntactic Markov chain. Ordinary lumpability preserves stochastic equiva- Markovian bisimulation in benchmark models. lence in the sense that the probability of each block/macro-state is The second limitation is that syntactic Markovian bisimulation equal to the sum of the probabilities in each original state belonging only supports networks where reactions involve at most two to that block. reagents. Instead, SE can be applied to arbitrary higher-order reactions. This may appear unnecessary because in models of practical 2.2 Species equivalence relevance reactions are typically of order two at most, following the Verifying the conditions for ordinary lumpability requires the full basic principle that the probability of more than two bodies probabilis- enumeration of the CTMC state space, which grows combinatorial- tically colliding at the same time can be negligible (Gillespie, 1977). ly with the multiplicities of initial state and the number of reactions. However, this generalization enables the identification of ‘qualitative’ Additionally, the presence of interactions such as constitutive tran- relations between species, i.e. equivalences that do not depend on the scription, e.g.

Exact Maximal Reduction of Stochastic Reaction Networks by Species

The Effect of Compression and Expansion on Stochastic Reaction Networks

Type Checking Physical Frames of Reference for Robotic Systems

Dynamic Extension of Typed Functional Languages

Introduction to the Literature on Programming Language Design Gary T

Incremental Type Inference for Software Engineering

Luca Cardelli • Curriculum Vitae • 2014

An Imperative Object Calculus

Two-Domain DNA Strand Displacement

Type Systems.Fm

Linguaggi Di Programmazione Non OOP

The Early History of F

Mobility and Security