Supplementary Materials for A mathematical model of ctDNA shedding predicts tumor detection size
Stefano Avanzini, David M. Kurtz, Jacob J. Chabon, Sharon Seiko Hori, Sanjiv Sam Gambhir, Ash A. Alizadeh, Maximilian Diehn, Johannes G. Reiter
Supplementary Text S1: Mathematical modeling We developed a stochastic model of cancer evolution and biomarker shedding to study the potential of cancer detection based on the biomarker level in liquid biopsies. Here we focused on somatic point mutations present in circulating tumor DNA (ctDNA). Nonetheless, our framework applies to any biomarker that is shed by cancer cells at some rate. These biomarkers can be released at different stages of a cell cycle, for example during proliferation, apoptosis, or per unit of time.
We model the dynamics of tumor growth and biomarker shedding through a continuous-time multi-type branching process (Athreya and Ney, 2004; Kimmel and Axelrod, 2015; Beerenwinkel et al., 2015; Altrock et al., 2015; Durrett, 2015b). The primary tumor grows stochastically from a single malignant cell at time t = 0. The
tumor size at time t, denoted At and expressed in number of cells, is modeled by a supercritical branching birth death process with net growth rate r = b − d > 0, where b and d denote the birth and death rate per day, respectively (Fig. 1B). This process has a probability of δ = d/b to go extinct (Athreya and Ney, 2004). Since
we are only interested in primary tumors that do not go extinct, we condition At on eventual survival, i.e. on the event Ω := {At > 0 for all t}. Cancer cells shed a biomarker into the bloodstream at a rate λ per day. In our case, the biomarker is ctDNA measured in diploid genome equivalents (GE). For now, we assume that the biomarker is exclusively shed by cells undergoing apoptosis and hence the shedding rate is given by λ = d · qd, where qd denotes the shedding probability per cell death (Fig. 1B).
1 Normal or benign tumor cells also shed cell-free DNA (cfDNA) into the bloodstream. Cells in benign tumors and expanded subclones often harbor the same cancer-associated mutations as cancer cells (Martincorena and Campbell, 2015; Makohon-Moore et al., 2018; Teixeira et al., 2019; Yokoyama et al., 2019; Martincorena et al., 2018). Hence, ctDNA shed from benign tumors can be difficult
to distinguish from ctDNA shed from malignant tumors. We define Bt as the size of the benign population of cells at time t that shed the biomarker in the bloodstream,
and denote their shedding rate as λbn . We assume that benign lesions roughly replicate at a constant size. Hence, benign cells divide and die at the same rate
bbn = dbn, and their population size Bt remains constant over time (Bt = B0 for all t). Considering again that the biomarker is exclusively shed by cells undergoing apoptosis, the shedding rate of benign cells is λbn = dbn · qd,bn per day. Although we conservatively assume that apoptotic benign cells shed ctDNA with
the same probability as malignant cells (qd = qd,bn), the shedding rate λbn of benign cells is lower than the shedding rate λ of malignant cells because benign cells typically replicate at a lower rate than cancer cells (Rew and Wilson, 2000).
The circulating biomarker is eliminated from the bloodstream at an elimination
rate ε which can be calculated from the biomarker half-life time t1/2 as A B ε = log(2)/t1/2. We denote as Ct and Ct the amount of biomarker circulating in the bloodstream at time t shed by malignant and benign cells, respectively. The A B total amount of the biomarker circulating at time t is thus Ct = Ct + Ct .
Because malignant and benign cells shed the biomarker independently from each A B other, the processes (At, Ct ) and (Bt, Ct ) can be studied separately. The stochastic A process (At, Ct ) is a two-type branching process governed by the following transitions
A −→ AA rate b
A −→ C rate d · qd
A −→ ∅ rate d (1 − qd) C −→ ∅ rate ε .
The process is initialized at time t = 0 with a single cancer cell and no circulating A biomarkers, that is A0, C0 = (1, 0).
Because we assume that the benign cell population Bt remains constant over time and biomarker units are eliminated from the bloodstream independently at the same B rate, the process Ct is a branching pure-death process with constant immigration
2 (Allen, 2011) (or equivalently a M/M/∞ queue system (Kulkarni, 2016; Steutel and B Van Harn, 2003)). We assume that Ct is at equilibrium at time t = 0. We present B additional details about this process, including the exact form of C0 , in the next section.
Biomarker Size Distributions
The processes At, Bt and Ct are indexed by the time parameter t, counted from the primary tumor onset. Clinically, this time is impossible to measure, and it is thus convenient to express the previous processes in terms of other indexing parameters. In our case, a suitable choice is to use the first passage time when the tumor reaches a size of m cells
τm := inf{t > 0 : At = m} .
A B By employing this random variable, the processes Cτm and Cτm express the biomarker levels in terms of an observable quantity, i.e. the primary tumor size. We are especially interested in our model predictions for relatively large primary tumor sizes. When m is large and the shedding rate λ is small, we are able to apply A B asymptotic results for the distributions of the processes Cτm and Cτm that greatly simplify computations and help getting insights into the main evolutionary features of the model. Hence, we first focus on these asymptotic results, while later we will show how to generalize our framework to include multiple biomarker shedding dynamics and to derive the probability distribution of the processes At, Bt and Ct indexed by time. Unless otherwise specified, the presented results are conditioned on eventual survival of At.
Note that we will employ the same sign ’∼’ to indicate ”distributed as” and ”asymptotically converges to”. Since only the former usage applies to random variables, this does not cause any notational ambiguity. Finally, whenever asymptotic probability distributions are derived, formulas including the ’∼’ sign will be accompanied by text highlighting the relevant asymptotic limit.
A Process Ct
A Here we provide a sketch derivation for the asymptotic distribution of Cτm in the small shedding rate - large primary tumor size limit (λ 1, m 1). The derivation follows from the results presented in Cheek et al.(2018).
3 The biomarker is shed by cancer cells as a Cox process (or doubly stochastic
Poisson process (D.J. Daley, 2006)) with intensity (λ · At)t≥0. Furthermore, for large times the branching birth death process At satisfies the classic result (Athreya and Ney, 2004) rt lim At = W e (1) t→∞ where W is a non-negative random variable such that W = 0 if and only if the process At goes extinct. Let us assume m 1 and consider the process At at time τm, by definition, that is when the primary tumor is made of m cells. Conditioning on Ω ≡ {W > 0} and looking backwards in time, we see that the size of the primary −rs tumor at s days before τm is given by m·e . Hence, in the small λ - large m limit, the R ∞ −rs biomarker is shed as a Poisson process with mean λ 0 me ds = λ·m/r. Moreover, by the same argument we see that the unordered times of biomarker shedding are asymptotically distributed as i.i.d. Exponential(r) random variables. Given that the elimination times of shed biomarkers are also i.i.d. Exponential(ε), the probability that a randomly selected biomarker fragment is still circulating in the bloodstream
at time τm is r/(ε + r). Due to the thinning property of Poisson processes, we find that for large primary tumor sizes m and small shedding rates λ
m · λ CA ∼ Poisson . (2) τm ε + r
B Process Ct
B The process Ct is a branching pure-death process with immigration, with death rate ε and immigration rate B0 · λbn = B0 · dbn · qd, because the population size Bt is constant over time. The probability generating function for such a process follows from Allen(2011)
B λbn −εt B −εtC0 B0(y−1)(1−e ) C (y, t) = 1 + (y − 1)e e ε . (3)
Conditioning on At survival, we have that limm→∞ τm = ∞ almost surely. Therefore, B B since the process (Bt, Ct ) is independent of At, the large m limit of Cτm coincides B with the large time limit of Ct . When t is large, equation (3) converges to
λ B bn B (y−1) lim C (y, t) = e ε 0 . (4) t→∞
independently of the initial biomarker level. The right hand side of equation (4) is the
probability generating function of a Poisson random variable with mean B0 · λbn/ε,
4 and so for large m we have
B · λ CB ∼ Poisson 0 bn . (5) τm ε
B B Since we assume that Ct is in equilibrium initially, we set C0 = B0 · dbn · qd/ε.
Process Ct and sampling
By combining the previous results, we find the asymptotic limit for the distribution of the total biomarker amount present in the bloodstream when the primary tumor
is made of a large number of cells. Conditioned on At survival, this limit is given by
m d q B d q C = CA + CB −→d Poisson d + 0 bn d (6) τm τm τm ε + r ε
Suppose that a sample of blood with volume Vs is drawn from a patient with a total blood volume equal to Vtot. Assuming that the circulating biomarker units are well mixed in the bloodstream, we sample the biomarker units from the bloodstream with probability p = Vs/Vtot. We define Xt as the total biomarker amount (i.e., number of diploid genome equivalents) present in a sample drawn at time t. Then, by applying again the thinning property of Poisson processes, we find that for large m the biomarker amount in the sample when the primary tumor is made of m cells is asymptotically distributed as
m d q B d q X ∼ Poisson p d + 0 bn d . (7) τm ε + r ε
General Framework
For the stochastic analysis of ctDNA shedding by malignant and benign cells we employed the asymptotic results derived above. We further extended these results in several ways. First, so far we focused on the case where biomarker shedding occurs exclusively at cell apoptosis. We generalize our derivations to additionally include shedding at cell necrosis (per unit of time) and proliferation. Second, we presented size distributions in the asymptotic limit of large primary tumor sizes - small shedding rates. For other shedding processes, We obtain similar distributions in the asymptotic limit for large times - small shedding rates. Finally, we compute exact
probability distributions for the processes At, Bt and Ct at any given time t for all
5 three shedding processes and their mixtures. Below we present these three extensions in greater detail.
In order to include the possibility of biomarker shedding by cancerous cells also at necrosis and proliferations in our model, we consider the following set of transitions
for the process (At, Ct)
A −→ AAC rate b qb rate λ2
A −→ AA rate b (1 − qb) rate b − λ2
A −→ AC rate qn rate λ1 (8) A −→ C rate d qd rate λ0
A −→ ∅ rate d (1 − qd) rate d − λ0 C −→ ∅ rate ε rate ε .
Here, qd and qb denote the shedding probabilities at apoptosis and proliferation, respectively. Similarly, λ0 = d qd, λ1 = qn and λ2 = b qb denote the shedding rates of malignant cells at apoptosis, necrosis, and proliferation, respectively. The total shedding rate is then defined by λ = λ0 + λ1 + λ2.
Asymptotic results for multiple shedding processes
With the generalized definition of the total shedding rate λ, we find that the biomarker is shed by cancer cells as a Cox process with intensity (λAt)t≥0. Hence, the derivation A of Cτm distribution is identical to the one above. In particular, we show again that for a large primary tumor size m and a small total shedding rate λ, the biomarker amount present in the bloodstream when At = m asymptotically follows a Poisson λ·m distribution with mean ε+r .
Asymptotic results for large times
A A similar derivation provides the size distribution for Ct in the asymptotic limit for large time and small total shedding rate. Under this limit and conditioning on At survival, the total biomarker amount shed up to time t converges in finite dimensional W λert distributions to a Poisson random variable with mean r , where W is the same as in equation (1) (for details see Cheek et al.(2018)). We recall that W > 0 if and only r if At eventually survives, and conditioned on this event, W follows an Exponential b distribution. The biomarker amount still present in the bloodstream at time t is thus
6 a compound Poisson process with probability generating function
rt h A i W λe E zCt = E exp (µ(z) − 1) . r
Here µ(z) denotes the probability generating function of the process initiated by a randomly selected biomarker unit, but because the biomarker cannot reproduce, this process is a branching pure-death process whose size can only be 1 or 0. The ε+rz probability generating function of such a process is thus µ(z) = ε+r . Using this A expression and the results for W , the probability generating function for Ct becomes
rt −1 h A i bλe (z − 1) E zCt = 1 − . (9) r(ε + r)
Notice that the same steps can be repeated to formally derive the asymptotic results for large primary tumor sizes. The last expression represents the probability generating function of a geometric random variable, and so in the asymptotic limit
for large times and small total shedding rates, conditioned on At survival, we find that r(ε + r) CA ∼ Geometric . (10) t bλert + r(ε + r)
First moments
In this section, we summarize the expected values and variances of the processes
At and Ct in the considered asymptotic limits. These moments follow from the probability distributions we previously derived. The process At is a supercritical branching birth death process with net growth rate r and extinction probability δ =
d/b. In particular, recalling that Ω denotes the event of At eventual survival, we have P(Ω) = 1 − δ. If A0 = 1, we have
1 + δ E[A ] = ert , Var(A ) ∼ e2rt t t 1 − δ and ert 1 + 2δ E[A | Ω] ∼ , Var(A | Ω) ∼ e2rt . t 1 − δ t (1 − δ)2 A A For the process Ct , given that (A0, C0 ) = (1, 0), equations (2) and (10) yield m λ m λ E CA | Ω ∼ , Var CA | Ω ∼ (11) τm ε + r τm ε + r
7 and bλ bλ bλ E CA | Ω ∼ ert , Var CA | Ω ∼ ert ert + 1 (12) t r(ε + r) t r(ε + r) r(ε + r)
asymptotically for large primary tumor sizes - small shedding rates and large times - small shedding rates, respectively.
B In the same two limits, the process Ct exhibits the same asypmtotic behaviour. B Conditional on C0 = B0 · λbn/ε we find
B B λbn B B λbn lim E Cτ = lim E[Ct ] = B0 , lim Var Cτ = lim Var Ct = B0 . m→∞ m t→∞ ε m→∞ m t→∞ ε Combining these results and applying the same initial conditions, we then get
m λ B λ m λ B λ E[C | Ω] ∼ + 0 bn , Var (C | Ω) ∼ + 0 bn (13) τm ε + r ε τm ε + r ε for large M - small λ and
b λ B λ E[C | Ω] ∼ ert + 0 bn , t r(ε + r) ε (14) b λ b λ B λ Var (C | Ω) ∼ ert ert + 1 + 0 bn t r(ε + r) r(ε + r) ε
for large t - small λ.
Exact results
The asymptotic results discussed above perform extremely well for realistic parameter values and perfectly match the exact simulation results (see Figure S3). In principle, however, our mathematical framework can be used for other biomarkers - or entirely different applications - where the parameter values may not fall in the asymptotic regimes considered before. For this reason, we derive the exact distributions of the processes At, Bt and Ct, for any given time t and combinations of non-zero shedding rates.
Probability generating functions
In order to compute the exact distributions of At, Bt and Ct, we first derive the joint A B probability generating functions of the processes (At, Ct ) and (Bt, Ct ). The marginal
8 generating functions then follow directly, and provide the probability distributions of the individual processes by simple analytical or numerical inversion.
A Process (At, Ct )
A Our derivation for the process (At, Ct ) is based on previous results for a two-type branching process presented in Antal and Krapivsky(2010, 2011). These studies assumed that cells of the second type also have the ability to reproduce, but shedding (or mutations) can happen only at wild-type cell death.
A We consider instead the two-type branching process (At, Ct ) fully defined by the set of transitions in equation (8). For any given initial condition, we denote ∗ A A Pm,n(t) = P ((At, Ct ) = (m, n) | (A0, C0 ) = ∗). The corresponding probability generating function is then defined as
∗ X m n ∗ P (x, y, t) = x y Pm,n(t). m,n≥0
The backward Kolmogorov equations for this process read
dP (1,0) m,n = (b − λ )P (2,0) + (d − λ )δ δ dt 2 m,n 0 m,0 n,0 (0,1) (1,1) (2,1) (1,0) + λ0Pm,n + λ1Pm,n + λ2Pm,n − (b + d + λ1)Pm,n (15) dP (0,1) m,n = ε δ δ − εP (0,1) . dt m,0 n,0 m,n Hence, multiplying both sides of equation (15) by xmyn and summing over all non- negative m, n we find
(1,0) (2,0) ∂tP = (b − λ2)P + (d − λ0) (0,1) (1,1) (2,1) (1,0) + λ0P + λ1P + λ2P − (b + d + λ1)P (0,1) (0,1) ∂tP = ε − εP .
Next, we observe that 2 P(2,0)(x, y, t) = P(1,0)(x, y, t) because of the independence of the progenies of the two initial cells. By applying this property and introducing the notation
A(x, y, t) = P(1,0)(x, y, t), C(x, y, t) = P(0,1)(x, y, t)
9 we reduce to the following system of first order differential equations
2 2 ∂tA = (b − λ2)A + (d − λ0) + λ0C + λ1AC + λ2A C − (b + d + λ1)A (16)
∂tC = ε − εC (17) with initial conditions
A(x, y, 0) = x (18) C(x, y, 0) = y . (19)
Equation (17), subject to the initial condition (19), has solution
C(x, y, t) = 1 + (y − 1)e−εt . (20)
Plugging this back into equation (16), we get
−εt 2 −εt −εt ∂tA = b + λ2(y − 1)e A + λ1(y − 1)e − b − d A+ d + λ0(y − 1)e . (21)
We apply the change of variables s = e−εt and find
[b + λ (y − 1)s] [λ (y − 1)s − b − d] [d + λ (y − 1)s] ∂ A = − 2 A2 − 1 A − 0 . s εs εs εs To ease the notation, we rewrite the equation above as
α + β s α + β s α + β s ∂ A = 2 2 A2 + 1 1 A + 0 0 (22) s s s s where b b + d d α = − , α = , α = − 2 ε 1 ε 0 ε λ (y − 1) λ (y − 1) λ (y − 1) β = − 2 , β = − 1 , β = − 0 . 2 ε 1 ε 0 ε We are interested in the probability generating function A(x, y, t) for t ≥ 0, that corresponds to 0 < s ≤ 1. Equation (22) is a Riccati equation, which can be reduced to a second order ODE. To do so, we define
α + β s X(x, y, s) = 2 2 A(x, y, s) s
10 which yields
2 (α2 + β2s)(α1 + β1s) − α2 (α2 + β2s)(α0 + β0s) ∂sX = X + X + 2 . (23) s(α2 + β2s) s
Next, we set ∂ Y (∂ Y )2 ∂2 Y X ≡ − s =⇒ ∂ X = s − s Y s Y 2 Y so that equation (23) transforms into
2 (α2 + β2s)(α1 + β1s) − α2 (α2 + β2s)(α0 + β0s) ∂s Y − ∂sY + 2 Y = 0 . (24) s(α2 + β2s) s
Equation (24) is a second order linear differential equation with rational coefficients. It has three singular points: s = 0 and s = − α2 being regular and s = ∞ being β2 irregular with rank 1. Hence, equation (24) is a single confluent Heun equation (Ronveaux, 1995). To bring it into standard form, we first move the non-zero regular singularity to 1 through the change of variable z = − β2 s. This leads to α2 α2β1 α2β0 (z − 1) β z − α1 − 1 α2(z − 1) β z − α0 ∂2 Y + 2 ∂ Y + 2 Y = 0 . (25) z z(z − 1) z z2
We now look for a solution to equation (25) of the form
Y (z) = zmenzf(z) (26) which implies
m−1 nz 0 ∂zY = z e [(m + nz)f + zf ] 2 m−2 nz 2 0 2 00 ∂z Y = z e (m + nz) − m f + 2z(m + nz)f + z f .
Substituting these expressions into equation (25) and rearranging, we obtain the following differential equation for f(z)
P (z) Q(z) f 00 + f 0 + f = 0 (27) z(z − 1) z2(z − 1) where P (z) and Q(z) are two polynomials of second and third degree in z, respectively. However, it is easily checked that by taking
q α2 2 m = −α2, n = − β1 + β1 − 4β0β2 (28) 2β2
11 the first and last coefficients of the polynomial Q(z) become zero. Hence, for these values equation (27) can be written as
γ δ ωz + ρ f 00 + + + η f 0 + f = 0 . (29) z z − 1 z(z − 1)
Here, the parameters γ, δ and η follow from the decomposition in partial fractions of P (z) z(z−1) while ω and ρ correspond to the coefficients of second and first degree terms in Q(z), respectively - when m and n are set as in equation (28). Explicit expressions for all these parameters in terms of the original rates are given by
b b q 2 m = , n = λ1 − λ1 − 4λ0λ2 ε 2λ2ε b − d b q γ = + 1 , δ = −1 , η = λ2 − 4λ λ ε λ ε 1 0 2 2 (30) b q 2 ω = − 2 (b + d)λ1 − (b − d) λ1 − 4λ0λ2 + 2(bλ0 + dλ2) 2λ2ε b q 2 ρ = 2 (b + d − ε)λ1 − (b − d + ε) λ1 − 4λ0λ2 + 2[bλ0 + (d − ε)λ2] . 2λ2ε
Now, equation (29) is the standard form of the single confluent Heun equation. Solutions to such a second order differential equation are called confluent Heun functions and depend on six arguments - the five parameters of the equation and the independent variable z. A standardized package for numerical and symbolical computations involving Heun functions is currently provided only by Maple (Motygin, 2018), which includes in particular the procedures HeunC and HeunCPrime for the evaluation of confluent Heun functions and their z derivative, respectively. Constistently with these implementations the general solution of equation (29) can be written as 1−γ f(z) = h1(z) + Dz h2(z) (31) where η(γ + δ) 1 − γ(δ − η) h (z) = HeunC η, 1 − γ, δ − 1, ω − , ρ + , z 1 2 2 η(γ + δ) 1 − γ(δ − η) h (z) = HeunC η, γ − 1, δ − 1, ω − , ρ + , z 2 2 2 are the confluent Heun functions uniquely determined by the following initial
12 conditions ρ h (0) = 1, h0 (0) = 1 1 γ (32) (γ − 1)(δ − η) − ρ h (0) = 1, h0 (0) = . 2 2 γ − 2 0 Here hi denotes the z derivative of the confluent Heun functions hi. As mentioned before these functions are implemented by the Maple procedure HeunCPrime and 00 uniquely determined by an additional condition on hi (0), too cumbersome to be reported here. Also notice that the expression in equation (31) contains only one integrating constant, D, as our derivation spawns from a first order differential equation. By plugging such expression back into equation (26), the solution to equation (25) becomes
m nz m nz 1+m−γ nz Y (z) = z e f(z) = z e h1(z) + Dz e h2(z).
Next, we denote the derivative of f(z) as
0 0 −γ 0 g(z) = f (z) = h1(z) + Dz [(1 − γ)h2(z) + zh2(z)] . (33)
Recalling that z = − β2 s = ks and that X(x, y, s) = − ∂sY , we find α2 Y
(m + nks)f(ks) + ksg(ks) X(x, y, s) = − . (34) sf(ks)
Notice that in terms of the original parameters of our model the coefficient k is given
by k = qb(y − 1) and thus depends on y. We can now apply the initial condition for X(x, y, s) to find the value of the constant D. Since s = e−εt and X(x, y, s) = α2+β2s s A(x, y, s), the initial condition A(x, y, t = 0) = x translates to X(x, y, s = 1) = (α2 + β2)x = α2(1 − k)x. Therefore, equation (34) at s = 1 implies
(m + nk)f(k) + kg(k) α (1 − k)x = − . 2 f(k)
Substituting the expressions for f and its derivative (equations (31) and (33), respectively) and solving for D, we find
0 (m + nk + α2x − kα2x)h1(k) + kh1(k) D = − 1−γ 0 . k [(m + nk + α2x − kα2x + 1 − γ)h2(k) + kh2(k)]
Plugging this value back into equation (34), we find an expression for X in terms of 0 s the functions hi(s) and h (s). Multiplying this expression by and substituting i α2(1−ks)
13 s = e−εt, we obtain
rt e K1(x, y)φ2(y, t) − K2(x, y)φ1(y, t) A(x, y, t) = rt (35) e K1(x, y)ψ1(y, t) − K2(x, y)ψ2(y, t)
where b − x[b + λ (y − 1)] K (x, y) = 2 + nk h (k) + kh0 (k) 1 ε 1 1 d − x[b + λ (y − 1)] K (x, y) = 2 + nk h (k) + kh0 (k) 2 ε 2 2 bnke−εt εk φ (y, t) = + 1 h (ke−εt) + e−εth0 (ke−εt) 1 ε 1 b 1 (36) bnke−εt d εk φ (y, t) = + h (ke−εt) + e−εth0 (ke−εt) 2 ε b 2 b 2 −εt −εt ψ1(y, t) = (1 − ke )h1(ke ) −εt −εt ψ2(y, t) = (1 − ke )h2(ke ).
A This is the joint probability generating function of the processes At and Ct , starting from one A cell at time t = 0.
To find the marginal generating function
∞ X A m A (x, t) = P(At = m | (A0, C0 ) = (1, 0))x m=0
we take the limit for y → 1 in equation (35). Given the conditions in equation (32), this yields e−rt(x − δ) + δ(1 − x) A (x, t) = . (37) e−rt(x − δ) + (1 − x) As expected, this coincides with the probability generating function of a supercritical birth death process with net growth rate r = b − d > 0 and extinction probability
δ = d/b < 1. The exact distribution for the process At then follows by inverting A (x, t) and is given by m δ 1 ∂ A (x, t) S(t) if m = 0 P(At = m) = = (38) m! ∂xm x=0 δ −m 1 − S(t) (S(t) − 1)S(t) if m ≥ 1
14 1−δe−rt where S(t) = 1−e−rt . Similarly, the generating function
∞ A X A A n C (y, t) = P(Ct = n | (A0, C0 ) = (1, 0))y n=0
is derived by taking the limit for x → 1 in equation (35). The expression for C A slightly simplifies to
ertK¯ (y)φ (y, t) − K¯ (y)φ (y, t) A(y, t) = 1 2 2 1 (39) C rt ¯ ¯ e K1(y)ψ1(y, t) − K2(y)ψ2(y, t)
where λ (y − 1) K¯ (y) = nk − 2 h (k) + kh0 (k) 1 ε 1 1 b − d + λ (y − 1) K¯ (y) = nk − 2 h (k) + kh0 (k) 2 ε 2 2
and the functions φ1, φ2, ψ1, ψ2 are defined as in equation (36). The function C A(y, t) can be inverted numerically (see discussion below on inversion techniques) A to find the distribution of the process Ct .
We present the particular form that these results assume when one or more of the
parameters qd, λ1, qb are zero. In general, among the nonzero shedding rates, the one with the highest index determines the special functions involved in the generating
function A(x, y, t). As shown above, these are confluent Heun functions when λ2 > 0, while they become confluent hypergeometric functions when λ2 = 0 and λ1 > 0 and Bessel functions of the first kind when λ2 = 0, λ1 = 0 and λ0 > 0. The remaining cases follow by straightforward substitution from these three. We provide a sketch
of how the previous derivation can be adapted to the cases λ2 = 0, λ1 > 0 and λ2 = 0, λ1 = 0, λ0 > 0. The steps up until equation (24) are valid for all scenarios, which will be the starting point of the adapted derivations. Also notice that for all these cases the marginal probability generating function A (y, t) remains unchanged, as the evolution of the process At is not affected by its shedding activity.
qb = 0, λ1 > 0
When qb = 0, we have β2 = 0. Hence, equation (24) becomes
β s + α − 1 α (α + β s) ∂2 Y − 1 1 ∂ Y + 2 0 0 Y = 0 s s s s2
15 By seeking directly a solution of the form Y (s) = s−α2 f(s) and then applying the
change of variables z = β1s, the equation above reduces to the standard form of Kummer’s equation (Abramowitz and Stegun, 1964)
zf 00(z) + (γ − z)f 0(z) − ωf(z) = 0 (40)
where
γ = α0 − α2 + 1
c = β1 β0 ω = −α2 + 1 . β1
Including again only one integrating constant D, its solution can be expressed in
terms of the confluent hypergeometric function 1F1 as
1−γ f(z) = 1F1(ω, γ, z) + Dz 1F1(ω − γ + 1, 2 − γ, z)
Substituting back we immediately find an expression for the solution Y (s) in terms of
h1(s) = 1F1(ω, γ, cs), h2(s) = 1F1(ω − γ + 1, 2 − γ, cs).
Furthermore, since ∂ a F (a, b, z) = F (a + 1, b + 1, z) ∂z 1 1 b 1 1 the derivative of Y (s) can be computed in terms of the functions h1, h2, g1 and g2, where
g1(s) = 1F1(ω + 1, γ + 1, cs), g2(s) = 1F1(ω − γ + 2, 3 − γ, cs).
Joining these results together, we find an expression for X(s), in terms of the same functions. This expression still depends on the integrating constant D, which is determined by the initial condition X(x, y, s = 1) = α2x. Then, multiplying X(x, y, s) by s and substituting s = e−εt we eventually find α2
rt e K1(x, y)φ2(y, t) − K2(x, y)φ1(y, t) A(x, y, t) = rt (41) e K1(x, y)ψ1(y, t) − K2(x, y)ψ2(y, t)
16 where
K1(x, y) = (b − d + e)(1 − x)h1(y, 0) − (λ1 + λ0)(y − 1)g1(y, 0)
K2(x, y) = (b − d − e)(d − bx)h2(y, 0) + (dλ1 + bλ0)(y − 1)g2(y, 0) −εt φ1(y, t) = (b − d + e)h1(y, t) − (λ1 + λ0)(y − 1)e g1(y, t) −εt φ2(y, t) = d(b − d − e)h2(y, t) + (dλ1 + bλ0)(y − 1)e g2(y, t)
ψ1(y, t) = (b − d + e)h1(y, t)
ψ2(y, t) = b(b − d − e)h2(y, t).
A The marginal probability generating function for the process Ct becomes
ertK¯ (y)φ (y, t) − K¯ (y)φ (y, t) A(y, t) = 1 2 2 1 (42) C rt ¯ ¯ e K1(y)ψ1(y, t) − K2(y)ψ2(y, t) where
¯ K1(y) = −(λ1 + λ0)(y − 1)g1(y, 0) ¯ K2(y) = (b − d − e)(d − b)h2(y, 0) + (dλ1 + bλ0)(y − 1)g2(y, 0) .
qb = 0, λ1 = 0
If we additionally have λ1 = 0, equation (24) becomes
α − 1 α (α + β s) ∂2Y − 1 ∂ Y + 2 0 0 Y = 0 . s s s s2 √ √ Through the change of variables z = 2 α2β0s = c s and by seeking a solution of the form Y (z) = zα1 f(z), we reduce to the Bessel equation (Abramowitz and Stegun, 1964) 2 00 0 2 2 z f + zf + [z − (α2 − α0) ]f = 0 . (43) The solution to equation (43) can be expressed as a Bessel function of the first kind
Jν(z) as
f(z) = Jα2−α0 (z) + DJα0−α2 (z). √ √ This allows us to write Y (s) in terms of h1(s) = Jα0−α2 (c s) and h2(s) = Jα2−α0 (c s) and since w ∂ J (z) = J (z) − J (z), z w z w w+1
17 we can compute Y (s) derivative in terms of the functions h1, h2, g1 and g2, where √ √ g1(s) = Jα0−α2+1(c s), g2(s) = Jα2−α0+1(c s).
By combining these expressions, we first derive X(s), and then find the integrating s constant D by applying the initial condition X(x, y, s = 1) = α2x. Multiplying by α2 and substituting s = e−εt, we find
K (x, y)φ (y, t) − K (x, y)φ (y, t) A(x, y, t) = 1 2 2 1 (44) K1(x, y)ψ2(y, t) − K2(x, y)ψ1(y, t) where p K1(x, y) = (bx − b)h1(y, 0) + bλ0(y − 1)g1(y, 0) p K2(x, y) = (bx − d)h2(y, 0) + bλ0(y − 1)g2(y, 0) p − ε t φ1(y, t) = bh1(y, t) − bλ0(y − 1)e 2 g1(y, t) p − ε t φ2(y, t) = dh2(y, t) − bλ0(y − 1)e 2 g2(y, t)
ψ1(y, t) = bh1(t)
ψ2(y, t) = bh2(t).
From here we can recover the usual marginal probability generating function for the
process At by considering the expansions of Bessel functions Jw(z) around z = 0. The A marginal probability generating function for the process Ct is instead given by ¯ ¯ A K1(y)φ2(y, t) − K2(y)φ1(y, t) C (y, t) = ¯ ¯ (45) K1(y)ψ1(y, t) − K2(y)ψ2(y, t)
where
¯ p K1(y) = bλ0(y − 1)g1(y, 0) ¯ p K2(y) = (b − d)h2(y, 0) + bλ0(y − 1)g2(y, 0) .
Finally, we remark that in all the derivations above, there are special cases of the equation f(z) that require extra attention. Namely, if some of the equation coefficients are integers, then its general solution assumes a slightly different expression than the one reported. However, it is unlikely that real estimates lead to such special cases, which we therefore do not discuss in detail here.
18 B Process (Bt, Ct )
B We derive exact results for the size distributions of the processes Bt and Ct . So far, we assumed a constant population of benign cells and studied the asymptotic behaviour of the biomarker amount they shed. Here, we first present additional details about B this case, providing the exact generating function of Ct at a given time t. Later, we A show how a derivation similar to that employed for the process (At, Ct ) can be used when Bt is modeled by a critical branching birth-death process.
Constant growth
In the derivation of asymptotic results for our model, we already mentioned that if B Bt is constant then Ct is a branching pure-death process with immigration and its exact probability generating function is given by equation (3). In our setup, we also B assume that Ct is originally at equilibrium, i.e. that it starts at the expected value B of its large time asymptotic distribution. Since we observed that in this limit Ct λbn B λbn converges to a Poisson random variable with mean ε B0, we set C0 = ε B0. With this initial condition, equation (3) becomes
λ bn B B n −εt (y−1)(1−e−εt)o ε 0 C (y, t) = 1 + (y − 1)e e .
Numerical inversion of this function then provides the exact distribution of the process B Ct at any given time t.
Critical growth
In our model Bt denotes the number of benign cells at time t with the ability to shed the biomarker in the bloodstream. So far we have assumed that this number stays constant over time, but more generally this number can fluctuate around a constant
average value. For this reason, Bt can be modeled as a critical branching birth death process, i.e. as a process defined like At but with the same birth and death rates bbn = dbn. Under this assumption, and assuming that the shedding probabilities at cell apoptosis and proliferation are the same for benign and malignant cells, the set
19 B of transitions characterizing the two-type process (Bt, Ct ) are
B −→ BBC rate bbn qb rate λbn,2
B −→ BB rate bbn (1 − qb) rate bbn − λbn,2
B −→ BC rate λbn,1 rate λbn,1
B −→ C rate bbn qd rate λbn,0
B −→ ∅ rate bbn (1 − qd) rate bbn − λbn,1 C −→ ∅ rate ε rate ε .
The exact probability generating functions for this process can be computed through A ∗ a derivation similar to that shown for (At, Ct ). Briefly, we can redefine P as
∗ X m n B B P (x, y, t) = x y P Bt, Ct = (m, n) | B0, C0 = ∗ m,n≥0 and set B(x, y, t) = P(1,0)(x, y, t), C0(x, y, t) = P(0,1)(x, y, t). As biomarker units are eliminated from the bloodstream at the same rate regardless of the cell type that shed them, we have C0(x, y, t) = C(x, y, t), where C(x, y, t) is given by equation (20). However, the function B(x, y, t) does not follow straightforwardly from
A(x, y, t) by simply substituting the dashed rates and taking the limit as dbn → bbn. The reason is that in this limit the two linearly independent solutions h1(z) and h2(z) of the reduced equation become the same and so a different set of solutions has to be chosen. Once these are found, the subsequent steps can be repeated and an explicit expression for B(x, y, t) can be derived. The exact probability generating function for B the process (Bt, Ct ) is given by
(B ,CB ) B CB P 0 0 (x, y, t) = B(x, y, t) 0 C(x, y, t) 0 .
The marginal generating functions for the two single processes follow from the y → 1 and x → 1 limits, respectively. In particular, for any combination of non-zero shedding rates the former is equal to
x − 1 B0 B(x, t) = 1 + 1 − dbnt(x − 1)
which coincides with the probability generating function of a critical branching
birth death process with death rate dbn and B0 initial individuals. This function is analytically invertible, even though the terms of the probability mass function of Bt
20 can only be expressed as finite sums
B0 dbnt m−1 B0 dbnt+1 X B0 − 1 m P(B = m | B ) = (d t)2j . t 0 m(d t)m(1 + d t)m B − m + j j bn bn bn j=0 0
B The marginal generating function of Ct can instead be inverted numerically to derive the exact probability distribution of the process. However, while the above described computations are feasible, modeling Bt as a critical branching birth death process leads to tedious complications and does not practically change the dynamics of the model. The complications are related to the fact that a critical branching process eventually gets extinct with probability one. We are generally interested in the large time limit distribution of Bt, but this limit converges to a point mass at zero and conditioning on Bt eventual survival does not make sense either. Moreover, since Bt starts with a very large number of cells, the time required by Bt to become extinct would be unrealistically long. Indeed, the expected time to extinction for a critical branching process is infinite, and using the expression above with dbn = 0.1, we find 7 −12 that P(B10000 yrs = 0 | B0 = 10 ) ≈ 10 . For the same principle, the probability that Bt exhibits significant deviations from B0 within human lifetime is very small. To quantify the probability of such fluctuations, we exploit Chebyshev inequality: 7 7 6 using again dbn = 0.1 and B0 = 10 , we find that P(|B55 yrs − 10 | ≥ 2 × 10 ) ≤ 0.01, which says that even after waiting 55 years, the probability of observing a deviation of at least 20% from the original population size would still be lower than 1%.
Process Ct and numerical inversion
Given the independence of biomarker shedding from benign and malignant cells, the A B probability generating function of the process Ct = Ct + Ct is equal to
C (y, t) = C A(y, t)C B(y, t) where the two functions on the right hand side are given by equation (39) (see also equations (42) and (45) for special cases) and equation (3) (assuming Bt is constant), respectively. As for many other functions derived before, C (y, t) can be numerically inverted to find the exact distribution of the process Ct.
We briefly discuss some inversion techniques. The goal of the inversion procedure is to derive the probability mass function of a stochastic process χt from its generating
21 P∞ k function G(z, t) = k=0 P(χt = k)z as
1 ∂kG(z, t) P(χt = k) = . (46) k! ∂zk z=0
In a few cases, the k-th derivative of the generating function is explicitly computable, thus allowing us to find an analytic formula for the probability mass function of χt. The expression for A (x, t) given by equation (37) is an example of such a generating function and from it we directly derived the probabilities
P(At = m) (see equation (38)). For most generating functions, however, this analytical inversion is not feasible and we have to compute the probability mass function numerically. Algorithms for this numerical procedure are based on techniques for series expansions of analytical functions, which are implemented as Series in both Mathematica (see also NSeries) and Maple. For more details on the numerical inversion of probability generating functions, we refer to refs. Abate and Whitt(1992); Choudhury et al.(1994).
Conditional distributions
In our model summary we pointed out that we are interested in the probability distributions of the processes At, Bt and Ct conditional on At non-extinction. Therefore, the asymptotic results derived before are conditioned on the event of At eventual survival, which we denoted as Ω. The same results were also expressed in terms of the primary tumor size, by computing the asymptotic distributions of the processes involved at the time when At = m, for m large. Here we show how the same kind of conditioning can be applied to the exact probability mass functions derived above.
We note that all the probabilities involved in the following derivations are conditioned on the usual initial values of the processes At, Bt and Ct. For clarity, we do not introduce a new notation for such conditioning, and hereafter we implicitly denote λ P( · ) = P · | A , B , CA, CB = 1, B , 0, bn B . 0 0 0 0 0 ε 0
Conditioning on At survival
As we are dealing with exact distributions at a given time, we condition on the event
22 of At survival up to time t, that is
Ωt = {At > 0} .
c We denote δt = P(Ωt ) = P(At = 0). This probability is equal to A (0, t), so we get δ − δe−rt δ = . t 1 − δe−rt Bayes theorem yields
P(At = m) P(At = m | Ωt) = · P (Ωt | At = m) P (Ωt)
for any m ≥ 0. The term P (Ωt | At = m), however, is equal to 0 for m = 0 and to 1 for every m ≥ 1. Hence, we get
P(At = m) P(At = m | Ωt) = 1 − δt for every m ≥ 1, and 0 otherwise.