CELL SIZE HOMEOSTASIS AND OPTIMAL VIRAL STRATEGIES
FOR HOST EXPLOITATION
by
Cesar Augusto Vargas-Garcia
A dissertation submitted to the Faculty of the University of Delaware in partial fulfillment of the requirements for the degree of Doctor of Philosophy in Electrical and Computer Engineering
Fall 2017
c 2017 Cesar Augusto Vargas-Garcia All Rights Reserved ACKNOWLEDGEMENTS
I want to thank my advisor Abhyudai Singh. He discovered my professional potential and helped me to achieve this important goal in my life. I am grateful to the Dean, the Faculty, and the Staff of the Department of Electrical and Computer Engineering for providing their assistance and support through the years of my Ph.D. program. I want to thank also my co-advisor and friend, Dr. Ryan Zurakowski. He gave me the opportunity to start and enjoy this field. Also he encouraged me to be resilient in pursuing my degree in the hard days. I want to thank to professor and close friend Henry Arguello for all his support and advice through this years. I also want to thank my wife and daughter, Neyla Johanna and Victoria for their support and encouragement during my studies. They provided me the home to rest after every hard day. My students and alma-mater group HDSP gave me the motivation and encour- agement to make the best effort in my research. They have been my friends and part of my family during this time. I appreciate their collaboration and company in this part of my life. Special thanks to my friends and colleagues Mohammad Soltani and Khem Ghusinga for their support, friendship and collaborations in uncountable and exciting projects which are nowadays the core of my research.
iii TABLE OF CONTENTS
ABSTRACT ...... vi
Chapter
1 PART 1: CELL SIZE CONTROL AND GENE EXPRESSION HOMEOSTASIS IN SINGLE-CELLS ...... 1
1.1 Cell-size regulation: going beyond phenomenological models ..... 1 1.2 Why do organisms control size? ...... 4 1.3 Living with size variations: gene expression homeostasis ...... 5
2 CONDITIONS FOR CELL SIZE HOMEOSTASIS: A STOCHASTIC HYBRID SYSTEMS APPROACH ...... 13
2.1 Timer-dependent growth and division ...... 14 2.2 Size-dependent growth rate ...... 16 2.3 Size-dependent division rate ...... 19
3 A MECHANISTIC STOCHASTIC FRAMEWORK FOR REGULATING BACTERIAL CELL DIVISION ...... 25
3.1 Distribution of the cell-division time given newborn cell size ..... 29 3.2 Distribution of the volume added between divisions ...... 31 3.3 Higher order moments of added volume ...... 34
4 PART 2: OPTIMALITY IN HOST-VIRUS SYSTEMS ...... 37
5 OPTIMAL ADSORPTION RATE: IMPLICATIONS OF THE SHIELDING EFFECT ...... 40
5.1 Traditional virus dynamics model ...... 41 5.2 Modeling the shielding effect ...... 43 5.3 Single virus dynamics ...... 44 5.4 Competition dynamics ...... 45
iv 6 THE EFFECT OF MULTIPLICITY OF INFECTION ON THE TEMPERATENESS OF A BACTERIOPHAGE: IMPLICATIONS FOR VIRAL FITNESS ...... 51
6.1 Model ...... 53 6.2 Why do Bacteriophages display Temperateness? ...... 54 6.3 Probability of survival of a lysogen ...... 57 6.4 The Effect of Multiplicity of infection (MOI) ...... 58
7 CONDITIONS FOR INVASION OF SYNAPSE-FORMING HIV VARIANTS...... 62
7.1 HIV Model ...... 64 7.2 Modeling Synaptic Virus ...... 66 7.3 Competition Model ...... 70
8 OPTIMAL MULTI-DRUG APPROACHES FOR REDUCTION OF THE LATENT POOL IN HIV ...... 75
8.1 HIV Model ...... 77 8.2 Optimal Control and Simulations ...... 83
9 DISCUSSION ...... 87
BIBLIOGRAPHY ...... 90
v ABSTRACT
The first part of this thesis address a question formulated more than 80 years ago (and still remains elusive): how does a cell control its size? Growth of a cell and its subsequent division into daughters is a fundamental aspect of all cellular living systems. During these processes, how do individual cells correct size aberrations so that they do not grow abnormally large or small? How do cells ensure that the concentration of essential gene products are maintained at desired levels, in spite of dynamic/stochastic changes in cell size during growth and division? In chapter 1, we introduce the reader to the field of cell size/content homeostasis. We review how advances in singe-cell technologies and measurements are providing unique insights into these questions across organisms from prokaryotes to human cells. More specifically, how diverse strategies based on timing of cell-cycle events, regulating growth, and number of daughters are employed to maintain cell size homeostasis. We further discuss how size-dependent expression or gene-replication timing can buffer concentration of a gene product from cell-to-cell size variations within a population. In chapter 2, we propose the use of stochastic hybrid systems as a framework for studying cell size homeostasis. We assume that cell grows exponentially in size (volume) over time and probabilistic division events are triggered at discrete time intervals. We first consider a scenario, where a timer (i.e., cell-cycle clock) that measures the time since the last division event regulates both the cellular growth and division rates. We also study size-dependent growth / division rate regulation mechanisms. We provide bounds on different statistical indicators (mean, variance, skewness, etc). Additionally, we assess the effect of different physiological parameters (growth rate, partition errors, etc) on cell size distribution.
vi Chapter 3 introduces a mechanistic model that might explain the recently un- covered added principle, i.e., selected species add a fixed size (volume) from birth to division, irrespective of their size at birth. To explain this principle, we consider a timekeeper protein that begins to get stochastically expressed after cell birth at a rate proportional to the volume. Cell-division time is formulated as the first-passage time for protein copy numbers to hit a fixed threshold. Consistent with data, the model predicts that the noise in division timing increases with size at birth. We show that the distribution of the volume added between successive cell-division events is independent of the newborn cell size. This fact is corroborated through experimental data avail- able. The model also suggest that the distribution of the added volume when scaled by its mean become invariant of the growth rate, a fact also verified through available experimental data. In part 2 of this thesis, we study which strategies are implemented by a viral species, ranging from bacteriophages to human immunodeficiency virus (HIV), in order to exploit host resources. In chapter 4, we review the classical theory of viral-host dynamics and describe the key knobs that viruses tweak to exploit a cell population. This theory suggest that viruses might evolved to have infinite infectivity and virulence. In the case of infectivity, chapter 5 gives an alternative to infinite infectivity: virus will evolve to moderate infectivity because of local interactions. As an example, we study a phage attacking a bacterial population. We include the effect of local interactions by assuming that the phage needs to scape from bacterial death remains (debris). Infinite virulence is also challenged as evolutionary alternative for viral propa- gation. In chapter 5 we study environments where availability of susceptible bacteria fluctuates across time. Under such scenarios bacteria behaves contrary to classical ecology theory: phages evolve to a moderate virulence (lysis time). We present this insights through the use of the stochastic hybrid system framework. In chapter 7, we present a mathematical model of HIV transmission including cell-free and cell-cell transmission pathways. A variation of this model is considered including two populations of virus. The first infects cells only by the cell-free virus
vii pathway, and the second infects cells by either the cell-free or the cell-cell pathway (synapse-forming virus). Synapse-forming HIV is shown to provide an evolutionary advantage relative to non synapse-forming virus when the average number of virus transmitted across a synapse is a sufficiently small fraction of the burst size. HIV disease is well-controlled by the use of combination antiviral therapy (cART), but lifelong adherence to the prescribed drug regimens is necessary to prevent viral re- bound and treatment failure. Populations of quiescently infected cells form a “latent pool” which causes rapid recurrence of viremia whenever antiviral treatment is inter- rupted. A “cure” for HIV will require a method by which this latent pool may be eradicated. Current efforts are focused on the development of drugs that force the quiescent cells to become active. Previous research has shown that cell-fate decisions leading to latency are heavily influenced by the concentration of the viral protein Tat. While Tat does not cause quiescent cells to become active, in high concentrations it prevents a newly infected cell from becoming quiescent. In chapter 8, we introduce a model of the effects of two drugs on the latent pool in a patient on background sup- pressive therapy. The first drug is a quiescent pool stimulator, which acts by causing quiescent cells to become active. The second is a Tat analog, which acts by preventing the creation of new quiescently infected cells. We apply optimal control techniques to explore which combination therapies are optimal for different parameter values of the model.
viii Chapter 1
PART 1: CELL SIZE CONTROL AND GENE EXPRESSION HOMEOSTASIS IN SINGLE-CELLS
1.1 Cell-size regulation: going beyond phenomenological models Size plays an important role in cellular processes and functions of a cell [1], and therefore should be actively maintained. Indeed, cell size distribution of proliferating cells is known to be stable through generations, suggesting regulation of growth and division to correct deviations from a desired cell size. Earlier attempts towards under- standing cell size control was based on population-averaged data on model organisms, bacteria and yeast, and led to proposition of phenomenological models of cell size con- trol. In particular, three models were hypothesized: Timer – a constant time between successive divisions, Sizer – cell division upon attainment of a critical size, and Adder – a constant size addition between consecutive generations. However, validation of these in various organisms remained inconclusive. With recent advances in single-cell technologies, high throughput measurements of cell size over several cell-cycles of in- dividual cells can be made, generating correlation data between different parameters such as cell size at birth, growth rate, cell size at division, division time, etc. This data has stimulated reexamination of phenomenological models of cell size homeostasis. For a broad range of microbes, an individual cell grows exponentially over time with a constant growth rate per size [2,3]. A Timer based mechanism to control division can thus be precluded since it would not be homeostatic [4,5]. Consistent with it, analysis of data for several bacterial species reveals a negative correlation between division time and cell size at birth, implying presence of a size control during cell-cycle. Further investigation reveals the phenomenological strategy: size added from birth to division is uncorrelated with cell size at birth, which is inconsistent with a Sizer model
1 and validates an Adder model (Fig. 1.1a) [6–12]. Existence of such phenomenological models, however, only provides limited perspective since it does not specify how other landmark cell-cycle events (e.g., initiation of DNA replication, assembly of division apparatus) are coordinated with division, and whether the size control is applied from birth to division or between two other cell-cycle events. Studies on E. coli have proposed several formulations that couple initiation of DNA replication to division while being consistent with an Adder between birth and division. One model postulates that size control is primarily exerted over the timing of initiation of DNA replication such that a constant volume per origin of replication is added between two consecutive initiation events. The corresponding division is assumed to occur a fixed time (C+D period; C–time to replicate DNA, D–time between end of replication to division) after initiation [13–15]. Another proposition, which suggests that initiation of DNA replication occurs at a constant size per origin and C+D period depends upon the growth rate, shows that the Adder model is valid only for fast growth rates and the size control behaves as a Sizer for slow growth conditions [16]. A third model argues that for slow growing cells, size control is exerted at two sub-periods (the time from birth to initiation, and the D period) whereas the C period resembles a Timer [17]. So far none of these models have been conclusively validated or falsified, and it would be worthwhile to carry out experiments to this end. Similar couplings between important cell-cycle events and division have been explored in other organisms as well. For C. crescentus, pre-constriction and post- constriction periods have been examined for size control, showing that it obeys a mixer model wherein the time until constriction acts as a Timer followed by the post- constriction period regulated via an Adder [18]. Likewise, the cell-cycle of budding yeast has been investigated by dividing it in two distinct periods: time until G1/S transition, and time from the G1/S transition to division. Despite having an overall Adder between birth to division [19, 20], independent control of both these periods is proposed. In particular, the first period is shown to be dependent upon the size of
2 mother cell, and the second period is controlled by the size of the daughter [20]. Col- lectively, dissecting the cell-cycle in biologically relevant periods for various organisms provides key insights in to how division might be coordinated with other events. An important step moving forward is to understand molecular mechanisms that implement size control over timing of cell-cycle events. To this end, two generic themes have been proposed. One approach is to accumulate a protein in size dependent man- ner up to a threshold. Some notable examples of this include FtsZ to control Z-ring formation, [21, 22], DnaA to control timing of initiation in E. coli [13], and Cdc25 to regulate timing of mitotic entry in S. pombe [23]. Another way to implement a size control over timing is to dilute a protein until a critical level as cell grows in size. A prominent example of this strategy is Whi5 for control of G1 duration in S. cerevisiae [19, 20, 24, 25]. Interestingly, an alternative model shows that an Adder-like behavior can also arise from a very different mechanism of maintaining a constant sur- face area to volume ratio [26]. Apparently, the nutrient intake imposes constraints on this ratio by affecting the synthesis of surface material. The candidate molecules that carry out such function have not been identified yet. It is plausible that molecular players underlying important cell-cycle events interact with each other, and therefore an overarching framework may emerge with further research. How is size control implemented in multicellular organisms? Arguably, these organisms operate in a more complex environment than bacteria and budding yeast; thus, size control strategies adopted by their cells are expected to be affected by physical constraints and thereby be relatively more complicated. Recent data indeed suggests that mammalian cells have different size control strategy in the G1 duration than budding yeast. This strategy phenomenologically resembles a Sizer for small cells, but Adder for larger cells [27]. Examining the data further reveals that for mammalian cells, not only the time spent in G1, but also the growth rate are negatively correlated with size at birth [27] (Fig. 1.1b). This observation has been strengthened by recent work showing size-dependent regulation of growth rate [30,31]. The molecular underpinnings of growth rate control are not well understood, although access to nutrients and physical
3 constraints are expected to play an important role. In a stark contrast to size control on timing of cell-cycle events and growth rate, the green alga C. reinhardtii controls the number of daughter cells it produces upon division in a size controlled manner. It has a prolonged G1 phase in which the size of a newborn cell increases by several folds. The cell then undergoes multiple divisions producing 2n daughter cells. The number of divisions n, on average, is large for large mother cells, so that the daughter cells are close to a target size (Fig. 1.1c) [28, 29]. The molecular mechanism for regulating the number of divisions in C. reinhardtii is suggested to rely on concentration dilution. Here, a cyclin dependent kinase CDKG1 is produced towards the end of G1 phase and its production depends on the size of the cell. With each division, this protein is degraded and further divisions stop once its concentration goes below a threshold [28].
1.2 Why do organisms control size? Recent experiments in proliferating animal cells unravel an interesting connec- tion between cell size homeostasis and cellular fitness. By sorting a population of growing cells based on their sizes, it was shown that cellular proliferation (i.e., fitness) is low for small and large cells, but high at intermediate sizes [32, 33] (Fig. 1.2). The decrease in the fitness for large cells could not be attributed to higher cell-death since the apoptosis rates here were similar for large and average–sized cells. Notably, a sim- ilar optimality also exists for mitochondrial activity (metabolism) of cells [33]. Since the metabolism is linked to the growth, it is not too surprising that existence of an optimal size can be theoretically shown to arise from joint regulation of timing and growth [34].
Do unicellular organisms also have an optimal size for a given growth condition? Intuitively, a bigger bacterial cell will divide faster and thus its proliferation would be higher than a smaller cell. However, the fact that the average bacterial cell size grows exponentially with growth rate (per size) imposed by the nutrients suggests that
4 there may be a specific target cell size for the given environment [36], which may be determined by several factors such as the surface to volume ratio [26], availability of fatty acids [37], etc. Observing an optimal size in bacteria via experiment such as [32] is perhaps hard because of a narrow range of cell sizes as compared to mammalian cells. It would be interesting to see results of competition between mutants with artificially large sizes, such as those obtained by expression of useless proteins in [21].
1.3 Living with size variations: gene expression homeostasis Going beyond cell size homeostasis, single-cell approaches are also elucidating mechanisms for gene-product concentration homeostasis. In particular, how is the con- centration of given RNA/protein buffered to random or cell-cycle dependent fluctua- tions in cell size. This problem is especially acute for mammalian cells, where individual synchronized cells from the same population, exposed to identical nutrients, exhibit a six-fold variation in size [38]. Since rates of intercellular biochemical processes typically depend on the concentration of molecular species, cells must maintain concentration levels despite dramatic single-cell size deviations from the population average. Adding to size variations, unsynchronized cells also differ in the number of gene copies, and it is unclear to what degree expression is compensated for gene dosage across organisms.
In principle, one can envision different strategies for maintaining a desired gene- product concentration: size-dependent regulation of synthesis and/or decay rates with perfect gene dosage compensation, or scaling of gene dosage with cell size accompa- nied by size-independent expression per gene copy (Fig. 1.3). Prior investigations on inferring these mechanisms have relied on bulk-assay expression measurement utilizing cell size mutants, or altering size through small-molecules drugs [39, 40]. Instead of artificially changing the average cell size, exploiting natural intracellular size varia- tions within a population provides a more physiologically perturbation-free setting to study concentration homeostasis. Building up on this theme, recent works have used
5 mRNA fluorescence in situ hybridization (RNA FISH) to count mRNAs inside indi- vidual cell along with precise cell size measurement within a population of mammalian cells [38,41]. For most genes, mRNA copy numbers scale linearly with size, implying a gene-specific mRNA concentration set point (Fig. 1.4a). Intriguingly, the data further points to gene dosage compensation - two similarly sized cells in G1 and G2 phase will have same number of mRNAs, on average [38, 42]. While this mRNA count vs. size scaling observed in diverse mammalian cell types seems intuitive and also previously reported in bulk-assay experiments [40], its mechanistic underpinnings and molecular implementations remained elusive. Measurements of promoter activity of individual gene copies in single cells revealed for the first time that concentration homeostasis is orchestrated through modulation of the transcriptional burst size - with increasing cell size more RNA polymerases initiate transcription when the gene randomly switches to an active state (Fig. 1.4b). In contrast, gene dosage compensation occurs through the burst frequency (i.e., how often a gene become active) which is approximately halved upon gene duplication during the S phase. It is interesting to note that sim- ilar size-dependent transcription rates have recently been found in A. thaliana [43], suggesting the homeostasis principles uncovered through single-cell measurements are broadly applicable to animal and plant cells. Studies in budding yeast show similarities and striking differences with their mammalian counterparts. For example, as in mammalian cells, gene-dosage compen- sation occurs during the S-phase of S. cerevisiae, wherein incorporations of acety- lated histones into newly replicated regions leads to suppressed transcription per gene copy [44]. However, unlike mammalian cells, nascent transcription rates for RNA poly- merase II genes have been reported to be size-independent in budding yeast [45]. This implies that a newborn yeast cell growing in G1 will have reduced mRNA synthesis with increasing size, in the sense of mRNAs added per volume per unit time. Intrigu- ingly, this decreased synthesis rate is compensated by a corresponding change in RNA stability to maintain mRNA concentration [45]. Previously, mRNA degradation rate modulation as a function of growth rate/temperature (which should indirectly affect the
6 cell-size) has also been observed in yeast cells [46–49]. In this case, the transcription- degradation cross talk for concentration homeostasis is hypothesized to occur through molecular factors that continuously shuttle between the nucleus/cytoplasm to coordi- nate and participate in both transcription and decay processes [50]. While transcription of RNA polymerase II genes is size-independent, transcription of ribosomal RNAs via RNA polymerase I is size-dependent [45], and this latter mechanism maybe critical in maintaining a fixed concentration of ribosomes.
In contrast to eukaryotic cells, the bacterial cells do not seem to compensate for DNA dosage [51]. In this case, gene-product concentrations decrease with increase in size for fixed ploidy, suggesting that a strong coupling between gene-dosage and cell size can lead to near-constant gene product concentrations. In agreement with this, [52] has shown that cyanobacteria cells are able to maintain concentration of proteins by scaling their gene dosage with cell size. (Fig. 1.3c). A related issue is how change in gene dosage affects concentration of its products through the cell-cycle. Intuitively, the gene product concentration is expected to decrease until associated gene duplicates (based on its location on the chromosome) and increase back thereafter [53]. Since genes located at different places on the chromosome are duplicated at different times, such imbalance in gene dosage is utilized by B. subtilis for coordination of sporulation [54]. It is also worth noting that the bacterial cell size distribution is quite narrow, hinting that the scaling DNA dosage with cell size may suffice to maintain near constant gene product concentrations for some genes strategically placed on the chromosome. Recent experiments on E. coli indeed suggest that the fluctuations in protein level are not very large (about 4% ) for several genes and a near constant concentration is maintained during the cell-cycle [55]. These fluctuations may be further suppressed via autoregulation which is a prevalent feedback mechanism in prokaryotes. In the remaining chapters (Chapter 8 and 9), we discuss the conditions for achieving cell size homeostasis, and describe how a specific example explains size reg- ulation mechanisms in E.coli bacterium.
7 iue11 ies taeisbsdo euaigtmn fcell-cycle of timing regulating on based strategies Diverse 1.1: Figure
Added size Cell cycle a) at division time Size Newborn length Newborn Newborn length Newborn Birth Added size Added E. coli Newborn size Newborn nege ai lentn eiso iiin Spae n ioe or mitoses and cycles phases sion 2 (S produce alga divisions to unicellular of S/M) series The alternating c) rapid G Adder undergoes [27]). the the during of to size 5A but in (corresponding exponentially cells, newborns small larger Fig. for for model; size model) size-checkpoint of or independent [27]). is Sizer of a 4C larger to G for (corresponding Fig. to weaker birth from becomes from taken size cells correlation added Leukemia the The area Basophilic and Rat surface on size), (cell (data for size G cells proxy newborn the a with of as correlation timing used negative The strong newborns. a small hibits is to size) compared by as G slower (normalized during Unlike rate b) growth growth size-dependent [7]). the of exhibit 4A contrast, 4C Fig. (Fig. In size from with [7]). taken uncorrelated data of birth model; 2F from Adder Fig. size the fixed and a to sharply add (corresponding duration to division as dur- cell-cycle so to size) the size for level, newborn proxy single-cell increasing a with the decreases as At used cell-cycle. length the (cell ing size in exponentially grows cell agtrcl iei edapoiaeycntn seFg f[29]). of 4 Fig. (see constant approximately held is size cell daughter cell homeostasis mother size per cell daughters maintaining of for exploited number are and rate, growth events, Cell cycle
Growth rate/size Newborn length Newborn Division n )sc htteaverage the that such [ 28]) ( size cell mother with increases b) Added size G1 duration n at G1/S 80 40 12
agtr.A igecl ee,tenme fdivi- of number the level, single-cell At daughters. Size 0 8 4 Mammalian cell Mammalian Birth Fast growth Newborn area Newborn Newborn area Newborn 0 4 180 140 100 0 4 180 140 100 G 8 1 /S 1 /S Growth Slow growth
1 rate/size rniindcesswt ebr size newborn with decreases transition Birth Newborn area Newborn eid(npeec flgt n then and light) of presence (in period G 1 1 /S ihlre ebrsgrowing newborns larger with ,
Daughter Divisions c) .coli E. 1 3 2 size 4 Size it aeG Late Birth C. reinhardtii .reinhardtii C. Mother size Mother size Mother Mother size Mother amla cells mammalian , )An a) . 1 1 Daughter size Daughter hs ex- phase S/M .coli E. grows iue12 notmlcl iemxmzsfins ihnapplto of population a within fitness maximizes size cell optimal An 1.2: Figure Small Small a) b)
Fitness Cell size(FSC) (Relative cell count) Intermediate sdDoohl elln) rgnlcl iedsrbto ssoni grey. in shown is widely distribution (a size Kc167 cell Original and line) line). cell cell (human primary Drosophila HUVEC used a different line), cells; three cell of endothelial for lymphocyte function vein T sorting a umbilical (human of cells as time Jurkat plotted the types: is at cell fitness FSC Measured methods subpopulation b) and average material details. the the further change to for relative referred [32] are the of readers measuring Interested by count. quantified cell 3 in (approximately is hrs fitness 72 and for cultured generations), unsynchro- is cell original different subpopulation an with Each subpopulations sort several sizes. to into used (grey) is population cytometry cell nized flow [32,35], size cell for cells mammalian Cell sorting Large Small Small Cell size(FSC) )Uigfradsatritniy(S)a proxy a as (FSC) intensity scatter forward Using a) . Intermediate Cell size(FSC) 9 Large Cell culture 72 hr
Fitness (Relative cell count) Small Small Cell size(FSC) Intermediate Size distribution Fitness − cell 5 Large iue13 oeta ehnssdiiggn-rdc concentration gene-product driving mechanisms Potential 1.3: Figure mRNA transcription a) Haploid eraewt nraigsz o xdploidy. will fixed concentrations for mRNA size as increasing two perfect first with not the decrease is Unlike homeostasis size homeostasis. here cell concentration between strategies, to coupling leads strong dosage approx- A concentration. by gene and increases replication. fixed and gene a size-independent upon maintain is 2-fold imately transcription to stability of mRNA size rate the cell The and transcription c) with size, The cell decreases and b) exhibit rate) dosage cells (decay gene compensation. (vilot) of dosage independent diploid gene is and rate to violet) due (light scaling haploid similar size-invariant Both a individual with cells, an rate. single of decay in time) size with unit proportionally per increases synthesizes gene mRNAs level of (number RNA scription the at demonstrated homeostasis Diploid b) elsize Cell 10 c) )Tert ftran- of rate The a) .
mRNA half life iue14 amla el ananmN ocnrtosindependent concentrations mRNA maintain cells Mammalian 1.4: Figure a) mRNA count 0.1 0.2 0.3 0 0 1 2 6 4 2 0 Cell size(picoliters) IER2 UBC fsz hog ouaino rncitoa us ieadfre- and size size burst cell transcriptional with of quency modulation through size of .A osqec,tosmlryszdclsi G in of cells similarly-sized fraction dosage two gene the consequence, to G replication, a leading halved As gene approximately [38]. Upon is compensation ON in cells. from promoter synthesized larger the mRNAs time in of number state) size (the with active size rate the burst transcription higher of transcription a scaling novo in- from linear transcriptionally results The de between for states. switching active relatively proxy promoter and are a active a mRNAs of is chosen Schematic Both b) count mRNA [38]). rate. indepen- of hence are 3B and half-lives and unstable, 3A mRNA (Fig. single the size across while size of population, with dent same linearly the scale IER2) from and cells (UBC genes two for FISH 2 ilhv h aemN count. mRNA same the have will 8 0 0.5 1.0 0 2
mRNA decay constant b) )TemN oynmesmaue i RNA via measured numbers copy mRNA The a) . 11 Gene off Gene on Time RNA polymerase 1 mRNA and Chapter 2
CONDITIONS FOR CELL SIZE HOMEOSTASIS: A STOCHASTIC HYBRID SYSTEMS APPROACH
Stochastic hybrid systems (SHS) constitute an important mathematical mod- eling framework that combines continuous dynamics with discrete stochastic events. Here we use SHS to model a universal feature of all living cells: growth in cell size (volume) over time and division into two viable progenies (daughters). A key ques- tion is how cells regulate their growth and timing of division to ensure that they do not get abnormally large (or small). This problem has ben referred to literature as size homeostasis and is a vigorous area of current experimental research in diverse organisms [3,7,8,11,18,59–69]. We investigate if phenomenological models of cell size dynamics based on SHS can provide insights into the control mechanisms needed for size homeostasis. The proposed model consists of two non-negative state variables: v(t), the size of an individual cell at time t, and a timer τ that measures the time elapsed from when the cell was born (i.e., last cell division event). This timer can be biologically interpreted as an internal clock that regulates cell-cycle processes. Time evolution of these variables is governed by the following ordinary differential equations
v˙ = α(v, τ )v, τ˙ = 1, (2.1) where the growth rate α(v, τ ) ≥ 0 can depend on both state variables and is such that (2.1) has a unique and well-defined solution ∀t ≥ 0 (i.e., cell size does not blow up in finite time). A constant α implies exponential growth over time. As the cell grows in size, the probability of cell division occurring in the next infinitesimal time interval (t, t + dt] is given by f(v, τ )dt, where f(v, τ ) can be inter- preted as the division rate. Whenever a division event is triggered, the timer is reset to
12 zero and the size is reduced to βv, where random variable β ∈ (0, 1) is drawn from a beta distribution. Assuming symmetric division, β is on average half, and its coefficient of variation (CVβ) quantifies the error in partitioning of volume between daughters. To be biologically meaningful, α(v, τ ) is a non-increasing function, while f(v, τ ) is a non-decreasing function of its arguments. The SHS model is illustrated in Fig. 2.1 and incorporates two key noise sources: randomness in partitioning and timing of division. Next, we explore conditions for size homeostasis, in the sense that, the mean cell size does not converge to zero, and all statistical moments of v remain bounded.
Figure 2.1: SHS model for capturing time evolution of cell size. The size of an individual cell v(t) grows exponentially with growth rate α(v, τ ), where τ represents a timer that measures the time since the last division event. The arrow represents cell division events that occur with rate f(v, τ ), which resets τ to zero and divide the size by approximately half. A sample trajectory of v(t) is shown with cycles of growth and division.
2.1 Timer-dependent growth and division We begin by considering a scenario, where both the growth and division rates are functions of τ , but do not depend on v. The SHS can be compactly written as
v˙ = α(τ )v, τ˙ = 1, (2.2) with reset maps v 7→ βv, τ 7→ 0 (2.3) that are activated at the time of division. The timer-controlled division rate f(τ ) can be interpreted as a “hazard function” [70]. Let T1, T2, ... denote independent
13 and identically distributed (i.i.d.) random variables that represent the time interval between two successive division events. Then, based on the above formulation, the probability density function (pdf) for Ti is given by
− R x f(y)dy Ti ∼ f(x)e y=0 , ∀ x ≥ 0 (2.4)
[70]. Note that a constant division rate in (2.4) would lead to an exponentially dis- tributed Ti. For this class of models, the steady-state statistics of v is given by the following theorem.
Theorem 1: Consider the SHS (2.2)-(2.3) with timer-dependent growth and division rates. Then
T D R i α(y)dyE 0 e y=0 < 2 lim hv(t)i = (2.5) T t→∞ D R i α(y)dyE ∞ e y=0 > 2, where the symbol h i is used to denote the expected value of a random variable. More- over,
0 < lim hv(t)i < ∞, lim hv2(t)i = ∞ (2.6) t→∞ t→∞
T D R i α(y)dyE when e y=0 = 2.
th Proof of Theorem 1: Let vi−1 denote the cell size just at the start of the i cell cycle. Using (2.2), the size at the time of division in the ith cell cycle is given by
T R i α(y)dy vi−1e y=0 . (2.7)
Thus, the size of the newborn cell in the next cycle is
T R i α(y)dy vi = vi−1xi, xi := βie y=0 , (2.8) where βi ∈ (0, 1) are i.i.d random variables following a beta distribution and xi are i.i.d. random variables that are a function of βi and Ti. From (2.8), the mean cell size at the start of ith cell cycle is given by
i hvii = v0hxii (2.9)
14 and will grow unboundedly over time if hxii > 1, or go to zero if hxii < 1. Using the fact that hβii = 0.5 (symmetric division of a mother cell into daughter cells), βi and Ti are independent, (2.5) is a straightforward consequence of (2.9). It also follows from (2.8) that hv2i = v2hx2ii = v2hx i2i(1 + CV 2 )i (2.10) i 0 i 0 i xi where CV 2 represents the coefficient of variation squared of x . When hx i = 1 then xi i i hvii = v0 and hv2i = v2(1 + CV 2 )i. (2.11) i 0 xi
Note that when the system is completely deterministic, i.e., pdfs for Ti and βi are given by delta functions, CV 2 = 0. However, the slightest noise in these variables will lead xi to CV 2 > 0, in which case (2.11) implies (2.6). xi T D R i α(y)dyE In summary, unless functions α(τ ) and f(τ ) are chosen such that e y=0 = 2, the mean cell size would either grow unboundedly or go extinct. Moreover, even if the mean cell size converges to a non-zero value, the statistical fluctuations in size would grow unboundedly. Hence, size-based regulation of growth/division rates is a necessary condition for size homeostasis.
2.2 Size-dependent growth rate Recent work measuring sizes of single mammalian cells over time has reported lowering of growth rates as cells become bigger [71–73]. To explore the effects of such regulation, we consider a growth rate α(v, τ ) that now depends on size. As in the previous section, timer-controlled division events occur with rate f(τ ) resulting in inter-division times Ti given by (2.4). The following result shows that size homeostasis is possible if growth rate is appropriately bounded from below and above. Theorem 2: Let the growth rate be bounded by
α(v, τ )v ≤ k(τ )vp, p ∈ [0, 1), ∀v ≥ 0 (2.12)
15 for some non-increasing function k(τ ). Moreover, the growth rate of a small cell is large enough such that
D R Ti E α0(y)dy e y=0 > 2, α0(τ ) := lim α(v, τ ). (2.13) v→0
Then 1 lhk(τ )ihT i 1−p 0 < lim hvl(t)i < i (2.14) t→∞ h1 − βli where l ∈ {1, 2,... }, hTii is the mean cell-cycle duration, and β ∈ (0, 1) is a random variable quantifying the error in partitioning of volume between daughters.
Proof of Theorem 2: Consider a newborn cell with a sufficiently small size born at time t = 0. Then, the mean cell size will grow in successive generation iff the second inequality in (2.5) is true for α0(τ ), which results in (2.13). Based on the Dynkin’s formula for the SHS (2.1) and (2.3), the time evolution of moments is given by
dhvli = f(τ )vl hβli − 1 + l α(v, τ )vl , (2.15) dt for l ∈ {1, 2,... } [74]. Using (2.12),
dhvli ≤ f(τ )vl hβli − 1 + l k(τ )vl−1+p . (2.16) dt Note that f(τ )vl = f(τ )hvl|τ i (2.17) where hvl|τ i is the expected value of vl conditioned on τ . Based on the time evolution of cell size in (2.1), hvl|τ i is an increasing function of τ (cells further along in the cell cycle, have on average, larger sizes). Since hvl|τ i and f(τ ) are monotone non- decreasing function of τ f(τ )vl ≥ hf(τ )ihvli. (2.18)
Similarly, since k(τ ) is a non-increasing function,
k(τ )vl−1+p ≤ hk(τ )ihvl−1+pi. (2.19)
16 Finally, using the fact that l − 1 + p ≤ l as p ∈ [0, 1)
l−1+p l−1+p hvl−1+pi = vl l ≤ vl l (2.20)
Using (2.18)-(2.20), (2.16) reduces to the following inequality
dhvli l−1+p ≤hf(τ )ihvli hβli − 1 + lhk(τ )i vl l . (2.21) dt Since at steady state 1 hf(τ )i = , (2.22) hTii [75], (2.21) implies (2.14).
An extreme example of size-dependent growth is k α(v, τ ) = , k > 0 (2.23) v which corresponds to cells growing linearly in size, as experimentally reported for some organisms [76]. For this case, the result below provides exact closed-form expressions for the first and second-order statistical moments of v.
Theorem 3: Consider the growth rate (2.23) that results in the following SHS contin- uous dynamics v˙ = k, τ˙ = 1. (2.24)
Then, the steady-state mean and coefficient of variation squared of cell size is given by 2 khTii 3 + CV lim hv(t)i = Ti , (2.25) t→∞ 2 3 hTi i 2 4 4 9 3 − 9 − 6CVT − 7CVT 2 1 hTii i i CVv = + 27 27 3 + CV 2 2 Ti 16CV 2 + β , (2.26) 3(3 − CV 2)(3 + CV 2 ) β Ti where CV 2 and CV 2 denote randomness in the inter-division times (T ) and parti- Ti β i tioning errors (β), respectively, as quantified by their coefficient of variation squared.
17 The proof of Theorem 3 can be found in the Appendix. Interestingly, the mean cell size in (2.25) not only depends on the mean inter-division times hTii, but also on its second-order moment CV 2 . Thus, making the cell division times more random (i.e., Ti increasing CV 2 ) will also lead to larger cells on average. Similar effects of CV 2 on Ti Ti mean gene expression levels have recently been reported in literature [77, 78]. More-
2 over, (2.26) shows that the magnitude of fluctuations in cell size (CVv ) depend on 2 Ti through its moments up to order three. Note that if CVβ = 0 (no partitioning errors) and Ti = hTii with probability one (deterministic inter-division times), then 2 2 CVv = 1/27. This non-zero value for CVv in the limit of vanishing noise sources repre- sent variability in size from cells being in different stages of the deterministic cell cycle.
2 Theorem 3 decomposes CVv into terms representing contributions from different noise 2 sources. The terms from left to right in (2.26) represent contributions to CVv from i) Deterministic cell-cycle and ii) Random timing of division events and iii) Partitioning errors at the time of division. Assuming lognormally distributed Ti,
hT 3i/hT i3 = 1 + CV 2 3 . (2.27) i i Ti
Substituting (2.27) in (2.26) and plotting CV 2 as a function of CV 2 and CV 2 , re- v β Ti veals that stochastic variations in cell size are more sensitive to partitioning errors as compared to noise in the inter-division times. In summary, our result show that appropriate regulation of growth rate by size (as seen in mammalian cells) can be an effective mechanism for achieving size homeostasis. We next consider a different class of models where size-based regulation is at the level division rather than growth.
2.3 Size-dependent division rate In contrast to growth rate control, many organisms rely on size-dependent reg- ulation of division rate for size homeostasis [2,3, 79–81]. To analyze this strategy, we consider the SHS continuous dynamics (2.2) with a timer-dependent growth rate α(τ ), and a division rate f(v, τ ) that now depends on size. The theorem below provides
18 Cell cycle time
Partitioning in cell size Mean cell size Stochastic variability
Noise in partitioning and cell cycle time
Figure 2.2: Stochastic variation in cell size (blue) and mean cell size (green) as a func- tion of CV 2 (noise in inter-division time) and CV 2 (error in partitioning Ti β of volume among daughters) for linear cell growth and a timer-based division mechanism. The mean cell size is dependent on CV 2 but in- Ti 2 dependent of CVβ . Fluctuations in cell size increase more rapidly with CV 2 than with CV 2 . β Ti
19 sufficient conditions on f(v, τ ) for size homeostasis.
Theorem 4: Let there exist a non-decreasing function g(τ ) and p > 0 such that
f(v, τ ) ≥ g(τ )vp. (2.28)
Moreover, the division rate for a sufficiently small cell size f0(τ ) := limv→0 f(v, τ ) satisfies
D R Ti E R x α0(y)dy − f(0y)dy e y=0 > 2,Ti ∼ f0(x)e y=0 . (2.29)
Then, for the SHS given by (2.2) and (2.3)
l lhα(τ )i p 0 < lim hvl(t)i < , (2.30) t→∞ hg(τ )i(1 − hβli) for l ∈ {1, 2,... }.
Proof of Theorem 4: Consider a newborn cell with a sufficiently small size at time t = 0. Then, based on Theorem 1, the mean size will grow over successive generations (and not go extinct) iff (2.29) holds. Based on the Dynkin’s formula for (2.2)-(2.3), the time evolution of moments is given by
dhvli = lα(τ )vl − f(v, τ )vl 1 − βl (2.31) dt Using (2.28), the fact that α(τ ) is a non-increasing function, while g(τ ) is a non- decreasing function,
dhvli ≤l hα(τ )i vl − hg(τ )i vl+p 1 − βl (2.32) dt
l+p l+p l l Finally, using v ≥ v in (2.32) result in (2.30) at steady state.
Next, we show that different known strategies for size-dependent regulating of inter-division times are consistent with Theorem 4. A common example of size- dependent division is the “sizer strategy”, where a cell senses its size, and divides when
20 a critical size threshold is reached [24,82–84]. Such as strategy can be implemented by
v p f(v, τ ) = (2.33) v¯ wherev ¯ and p are positive constant. A large enough p corresponds to division events occurring when size reachesv ¯. In contrast to the sizer strategy, many bacterial species use an “adder strategy”, where a cell divides after adding a fixed size from birth [6,9, 10, 19]. In the case of exponential growth (constant growth rate α), the adder strategy can be implemented by
v (1 − e−ατ )p f(v, τ ) = . (2.34) v¯
A large enough p would correspond to cells adding a fixed sizev ¯ between cell birth and division [22]. Both these division rates are consistent with the form of f required for size homeostasis in Theorem 4. We investigate the first two moments of v in more detail for the sizer strategy. Using (2.31) for a constant growth rate α and division rate (2.33) results in the following moment dynamics
d vl = lα vl − v¯−p vl+p 1 − βl . (2.35) dt
Let µ = hvi , hv2i · · · vL T be a vector of moments up to order L, where L is the order of truncation. Using (2.35), the time evolution of µ can be compactly written as
dµ T = a + Aµ + Cµ,¯ µ¯ = vL+1 ··· vL+p (2.36) dt for some vector a, matrices A and C, andµ ¯ is the vector of higher order moments. Note that nonlinearities in the division rate lead to the well known problem of moment closure, where time evolution of µ depends on higher-order momentsµ ¯. Moment closure techniques that expressµ ¯ ≈ θ (µ) are typically used to solve equations of the form (2.36). Here, we use closure schemes based on the derivative-matching technique [85– 87], that yield analytical expressions for the steady-state moments. For example, L = 2
21 in (2.36) (second order of truncation) results in the following steady-state mean and coefficient of variation squared of cell size
p+1 1 2 2p ! p 1 1 3 − CVβ 4 p p 2 hvi ≈ 2 α v¯ ,CVv ≈ 2 − 1, (2.37) 4 3 − CVβ respectively. Intriguingly, (2.37) shows that the mean cell size decreases with increasing
2 magnitude of partitioning error CVβ . While the results from (2.37) are qualitatively consistent with moments obtained via Monte Carlo simulations, a much higher order of truncation is needed in (2.36) to get an exact quantitative match (Fig. 3).
Approximation: 2nd order 20th order Simulations
Figure 2.3: Stochastic variation in cell size (blue) and mean cell size (green) as a 2 function of CVβ (error in partitioning of volume among daughters) for exponential cell growth and sizer-based division mechanism. The mean 2 cell size decreases with increasing CVβ , while noise in cell size increases with it. Results are shown for a 2nd (dashed) and a 20th (solid) order moment closure truncation, and compared with moments obtained by running a large number of Monte Carlo simulations. Errors bars show 95% confidence estimates.
Here we have used a phenomenological SHS framework to model time evolution of cell size (Fig. 8.1). The model is defined by three features: a growth rate α(v, τ ), a
22 division rate f(v, τ ), and a random variable β ∈ (0, 1) that determines the reduction in size when division occurs. A key assumption was that α and f are monotone functions: with increasing size and cell-cycle progression, the growth rate decreases, and propensity to divide increases. Our main contribution was to identify sufficient conditions on α and f that prevent size extinction and also lead to bounded moments (Theorems 2 and 4). In essence, these conditions require the growth (division) rate to decrease (increase) with cell size in a polynomial fashion. We also analyzed two strategies for size homeostasis: i) Linear growth in size with timer-controlled divisions and ii) Exponential growth in size with size-controlled divisions. Analysis reveals that in the former strategy, the mean cell size is independent of volume partitioning errors at the time of mitosis. In contrast, the mean cell size decreases with increasing partitioning errors for size-controlled divisions. Moreover, stochastic variations in cell size are found to be highly sensitive to partitioning errors for both strategies (Fig. 2 and 3). This suggests that cells may use mechanisms to minimize volume mismatch among daughter cells. In summary, theoretical tools for SHS can provide fundamental understanding of regulation needed for size homeostasis. Future work will focus on coupling cell size to gene expression, and understanding how concentration of a given protein is maintained in growing cells [38,40,88,89].
23 Chapter 3
A MECHANISTIC STOCHASTIC FRAMEWORK FOR REGULATING BACTERIAL CELL DIVISION
Recurring cycles of growth and division of a cell is a ubiquitous theme across all organisms. How an isogenic population of exponentially growing cells maintains a nar- row distribution of cell size, a property known as size homeostasis, has been extensively studied, e.g., see [1,3, 83, 90] and references therein. From a phenomenological stand- point, recent experiments reveal that diverse microorganisms achieve size homeostasis via an adder principle [7–10]. As per this strategy, cells add a constant size from birth to division regardless of their size at birth [6,91]. Interestingly, the size accumulated by a single cell between birth and division exhibits considerable cell-to-cell differences, and these differences follow unique statistical properties. For example, in a given growth condition, the added size is drawn from a fixed probability distribution independent of the newborn cell size. Moreover, the distribution of the added size normalized by its mean is invariant across growth conditions [8]. Here, we explore biophysical mod- els that lead to the adder principle of cell size control and provide insights into its statistical properties. To realize the adder principle mechanistically, a cell needs to somehow track the size it has accumulated since the previous division and trigger the next division upon addition of the desired size. One biophysical model proposed to achieve this assumes a protein which begins to get expressed right after cell birth at a rate proportional to instantaneous volume (size). The cell grows exponentially over time and division is triggered when protein copy numbers reach a critical threshold after which the protein is assumed to degrade (Fig. 3.1a) [6,9, 21]. Such copy number dependent triggering of cell division could potentially be implemented via the localization of protein into
24 compartments whose volume does not change appreciably with the cell volume [62]. Moreover, the synthesis and the degradation of the protein in this model are used in broad sense; they could as well be activation of timekeeper proteins in size dependent manner, and deactivation after triggering of division. While this deterministic model results in a constant size added from cell birth to division [6, 21], it remains to be seen how noise mechanisms can be incorporated in this model to explain statistical fluctuations in cell size. A plausible source of noise could be the inherent stochastic nature of protein expression that has been universally observed in prokaryotes and eukaryotes [92–96]. Such stochasticity in protein synthesis is amplified at the level of individual cells, where gene products are often present at low molecular counts. Considering noisy expression of the timekeeper protein, one can formulate cell- division time as a first-passage time problem: an event (division) occurs when a stochas- tic process (protein copy numbers) hits a threshold for the first time (Fig. 3.1b). Ex- ploiting this first-passage time framework, we derive an exact analytical formula for the cell-division time distribution for a given newborn cell size. Consistent with data, these results predict that the mean cell-division time decreases with increasing cell size at birth, and the randomness (quantified by coefficient of variation squared) in the cell-division time increases with newborn cell size. Intriguingly, analysis of the model further shows that the distribution of the volume added from cell birth to division is always independent of the newborn cell size. Finally, we find that the distributions of added volume and cell division time have scale invariant forms: distributions in different growth conditions collapse upon each other after rescaling them with their respective means. We discuss potential candidates for the timekeeper protein and deliberate upon model modifications that result in deviations from the adder principle.
Consider a newborn cell with volume Vb at time t = 0. Its volume at a time t after birth is given by V (t) = Vb exp(αt), where α > 0 represents the growth rate. After cell birth, the timekeeper protein begins to get transcribed at a rate r(t) = kmV (t), where km is the transcription rate in the concentration sense. Note that this scaling of protein synthesis with instantaneous cell volume is essential for preserving gene
25 product concentrations in growing cells. In the stochastic formulation, the probability of a transcription event occurring in an infinitesimal time interval (t, t + dt] is given by r(t)dt. Assuming short-lived mRNAs, each transcript degrades instantaneously after producing a burst of protein molecules [97–102]. Stochastic expression of the timekeeper protein is compactly represented by the following biochemical reaction:
r(t) ∅ −−→ Bi × P rotein, (3.1) where r(t) = kmV (t) can be interpreted as the burst arrival rate and Bi, i ∈ {1, 2, ···}, are identical and independent random variables denoting the size of protein bursts with mean b := hBii. The burst size represents the number of protein molecules synthesized in a single mRNA lifetime and typically follows a geometric distribution [98, 100, 102–105]. However, to allow a wide range of protein accumulation processes to be covered by equation (3.1), we assume that Bi follows an arbitrary non-negative integer-valued distribution. One example of such a mechanism could be to consider a protein A whose concentration is constant throughout the cell cycle. This protein is stochastically converted to an active form A∗ at a rate proportional to the number of molecules of A. In essence, this can be thought of as production of A∗ in bursts which takes place at a rate proportional to the cell volume. Let x(t) denote the number of timekeeper molecules in the cell at time t af- ter birth. Assuming a stable protein with no active proteolysis, we have x(t) = Pn i=1 Bi, x(0) = 0, where n is the number of bursts (transcription events) in [0, t]. Cell division occurs when x(t) reaches a threshold X and the protein is degraded (or deactivated) thereafter. Given this timing mechanism, cell-division time can be math- ematically represented as the first-passage time (FPT )
FPT := inf {t : x(t) ≥ X | x(0) = 0} . (3.2)
This first-passage time framework assumes that cell division occurs upon precise attainment of X protein molecules. In principle, one could generalize equation (3.2) by defining a monotonically increasing function h(x) that defines a probabilistic rate
26 Figure 3.1: Proposed molecular mechanism to realize adder principle of cell size con- trol. (a) An exponentially growing rod-shaped cell starts synthesizing a timekeeper protein after its birth. The production rate of the protein scales with the cell size (volume). When the protein’s copy number at- tains a certain level, the cell divides and the protein is degraded. (b) Stochastic evolution of the protein copy numbers is shown for cells of three different sizes at birth. The threshold for triggering cell division is assumed to be 50 molecules. The distribution of the first-passage time (generated via 1, 000 Monte Carlo realizations) for each newborn cell volume is shown above the three corresponding trajectories. The first- passage time distribution depends upon the newborn cell size: on average, the protein in a smaller cell takes more time to reach the threshold as compared to the protein in a larger cell. of cell division at time t given x(t) molecules. Interestingly, analysis reveals that the average size added from birth to division is invariant of the newborn cell size Vb iff
h(x) = 0 for x < X, h(x) = ∞ for x > X (3.3)
(see Supplementary Information (SI), section S1). Thus, a sharp threshold, where cell division cannot be triggered before attainment of a precise number of molecules seems to be a necessary ingredient of the adder principle.
27 3.1 Distribution of the cell-division time given newborn cell size Here we derive the distribution of the cell-division time (FPT ) for a given newborn cell size Vb and investigate how its statistical moments depend on Vb. We begin by finding the distribution of the minimum number of burst events N required for x(t) to reach the threshold X. In particular,
( n ) n ! X X N := inf n : Bi ≥ X =⇒ P rob (N ≤ n) = P rob Bi ≥ X . (3.4) i=1 i=1
Given a specific form for the distribution of Bi, the corresponding distribution for N can be obtained using equation (3.4). For example, if Bi is geometrically distributed, then the probability mass function of N is given by
n + X − 2 1 n−1 b X f (n) := P rob (N = n) = , n ∈ {1, 2,...}, N n − 1 b + 1 b + 1 (3.5) where b represents the mean burst size [106,107]. Having determined the number of bursts needed for cell division, we next focus
th on the timing of burst events. Let Tn represent the time at which n burst event takes place. If the burst arrival rate in equation (3.1) were constant, then the time intervals between bursts would be exponentially distributed, resulting in an Erlang distribution for Tn. However, in our case this rate is time varying (due to dependence on cell volume), the arrival of bursts is an inhomogeneous Poisson process. Employing the distribution for the timing of the nth event, and using the fact that FPT is same as the time at which the N th burst event occurs, the probability density function of FPT is obtained as
∞ ∞ n−1 X X (R(t)) f (t) = f (t) f (n) = r(t) exp(−R(t))f (n), (3.6) FPT Tn N (n − 1)! N n=1 n=1 Z t k V R(t) := r(s)ds = m b eαt − 1 , (3.7) 0 α
(see SI, section S2). One can note that fFPT (t) is dependent on the newborn cell size
Vb through the function R(t).
28 Figure 3.2: Both model prediction and data show increase in the noise in timing as newborn cell size increases. (a) Model prediction for noise (coefficient of variation squared, CV 2) in division time as computed numerically using equation (3.7) . The model parameters used are: transcription −1 rate km = 0.13 min , threshold X = 65 molecules, growth rate α = 0.03 min−1, and mean burst size b = 5 molecules. The distribution of protein burst size Bi is assumed to be geometric. For details on how these parameter values were estimated, see SI, section S6. (b) Experimental data from [90] for Escherichia coli MG1655 also shows increase in cell division time noise as newborn cell size increases. Single-cell data was categorized in one of the four bins (1−2.8 µm, 2.8−4.5 µm, 4.5−6.3 µm, and 6.3−8 µm) depending upon newborn cell sizes. CV 2 of division time with 95% confidence interval (using bootstrapping) for each bin is shown (more details in SI, section S6).
This FPT distribution qualitatively emulates the experimental observations that the mean cell division time decreases with increasing cell size at birth (see SI, section S6). Intuitively, a larger newborn cell expresses the protein at a higher rate as compared to a smaller cell. Hence, the time taken by the protein to reach the prescribed molecular threshold is shorter in larger cells. Analysis of equation (3.7) also predicts that the noise (quantified using the coefficient of variation squared, CV 2) in cell-division timing increases with increasing Vb, and we confirmed this behavior from published data (Fig. 3.2). The noise behavior can be understood from the fact that a small newborn cell requires more time for cell division. This allows for efficient time averaging of the underlying bursty process resulting in lower stochasticity in FPT .
29 3.2 Distribution of the volume added between divisions
(a) (b) Simulations Size
Time (minutes) (c) (d) Experimental data Mean size added Mean size added
Time (minutes)
Figure 3.3: The proposed mechanism results in added cell size distribution being inde- pendent of the newborn cell size. (a) The cell volume grows exponentially (shown for three different newborn cell sizes) until the timekeeper pro- tein reaches a critical threshold. (b) The size added to the newborn cell size also grows exponentially until division takes place. For three differ- ent newborn cell sizes, the distribution of the the added volume comes out to be same. (c) The added size generated via simulations is plotted against the newborn cell size in range 2 − 3.5 µm for 10, 000 cells. The cells are further binned in 13 uniformly spaced bins (number of cells per bin > 100). The dashed line shows the mean of the added volume, which is independent of the newborn cell size. (d) Data from [8] showing the added size versus newborn cell size for Escherichia coli NCM3722 grown in Glucose as carbon source. Cells were categorized into bins according to their newborn cell size (number of cells per bin > 100). For each bin, the circle shows mean of the added size whereas the error bar represents the standard deviation of the added size. It can be seen that the mean added cell size (shown by dashed line) is independent of the newborn cell size (also see Fig. 2D in [8]).
Having derived the distribution for the cell-division time (FPT ), we determine
30 the volume added by a single cell from birth to division (denoted by ∆V ). Since