Survival Analysis

Survival Analysis

1 Survival Analysis Lu Tian and Richard Olshen Stanford University 2 Survival Time/ Failure Time/Event Time • We will introduce various statistical methods for analyzing survival outcomes • What is the survival outcomes? the time to a clinical event of interest: terminal and non-terminal events. 1. the time from diagnosis of cancer to death 2. the time between administration of a vaccine and infection date 3. the time from the initiation of a treatment to the time of the disease progression. 3 Survival time T • Let T be a nonnegative random variable denoting the time to event of interest (survival time/event time/failure time). • The distribution of T could be discrete, continuous or a mixture of both. We will focus on the continuous distribution. 4 Survival time T • The distribution of T ≥ 0 can be characterized by its probability density function (pdf) and cumulative distribution function (CDF). However, in survival analysis, we often focus on 1. Survival function: S(t) = pr(T > t): If T is time to death, then S(t) is the probability that a subject can survive beyond time t: 2. Hazard function: h(t) = lim pr(T 2 [t; t + dt]jT ≥ t)=dt dt#0 R t 3. Cumulative hazard function: H(t) = 0 h(u)du 5 Relationships between survival and hazard functions • f(t) Hazard function: h(t) = S(t) : • Cumulative hazard function Z Z Z t t f(u) t df1 − S(u)g H(t) = h(u)du = du = = − logfS(t)g 0 0 S(u) 0 S(u) • S(t) = e−H(t) and f(t) = h(t)S(t) = h(t)e−H(t) 6 Additional properties of hazard functions • If H(t) is the cumulative hazard function of T , then H(T ) ∼ EXP (1); the unit exponential distribution. • If T1 and T2 are two independent survival times with hazard functions h1(t) and h2(t), respectively, then T = min(T1;T2) has a hazard function hT (t) = h1(t) + h2(t): 7 Hazard functions • The hazard function h(t) is NOT the probability that the event (such as death) occurs at time t or before time t • h(t)dt is approximately the conditional probability that the event occurs within the interval [t; t + dt] given that the event has not occurred before time t: • If the hazard function h(t) increases xxx% at [0; τ], the probability of failure before τ in general does not increase xxx%: 8 Why hazard • Interpretability. • Analytic simplification. • Modeling sensibility. • In general, it could be fairly straightforward to understand how the hazard changes with time, e.g., think about the hazard (of death) for a person since his/her birth. 9 Exponential distribution • In survival analysis the exponential distribution somewhat plays the role of the Gaussian distribution. • Denote the exponential distribution by EXP (λ): • f(t) = λe−λt • F (t) = 1 − e−λt • h(t) = λ; constant hazard • H(t) = λt 10 Exponential distribution • E(T ) = λ. The higher the hazard, the smaller the expected survival time. • Var(T ) = λ−2: • Memoryless property: pr(T > t) = pr(T > t + sjT > s): • c0 × EXP (λ) ∼ EXP (λ/c0); for c0 > 0: • The log-transformed exponential distribution is the so called extreme value distribution. 11 Gamma distribution • Gamma distribution is a generalization of the simple exponential distribution. • Be careful about the parametrization G(α; λ); α; γ > 0 : 1. The density function λαtα−1e−λt f(t) = / tα−1e−λt; Γ(α) where Z 1 Γ(α) = tα−1e−tdt 0 is the Gamma function. For integer α; Γ(α) = (α − 1)!: 2. There is no close formulae for survival or hazard function. 12 Gamma distribution • E(T ) = α/λ. • Var(T ) = α/λ2: • If Ti ∼ G(αi; λ); i = 1; ··· ;K and Ti; i = 1; ··· ;K are independent, then XK XK Ti ∼ G( αi; λ): i=1 i=1 • G(1; λ) ∼ EXP (λ): • ∼ 2 ··· 2λG(K=2; α) χK ; for K = 1; 2; : 13 Gamma distribution Figure 1: increasing hazard α > 1; constant hazard α = 1; decreasing hazard 0 < α < 1 Gamma h(t) t 14 Weibull distribution • Weibull distribution is also a generalization of the simple exponential distribution. • Be careful about the parametrization W (λ, p); λ > 0(scale parameter) and p > 0(shape parameter): p 1. S(t) = e−(λt) p p 2. f(t) = pλ(λt)p−1e−(λt) / tp−1e−(λt) : 3. h(t) = pλ(λt)p−1 / tp−1 4. H(t) = (λt)p: 15 Weibull distribution • E(T ) = λ−1Γ(1 + 1=p). h i • Var(T ) = λ−2 Γ(1 + 2=p) − fΓ(1 + 1=p)g2 • W (λ, 1) ∼ EXP (λ): • W (λ, p) ∼ fEXP (λp)g1=p 16 Hazard function of the Weibull distribution Weibull distribution p=1 p=0.5 p=1.5 p=2.5 h(t) 0 2 4 6 0.0 0.5 1.0 1.5 time 17 Log-normal distribution • The log-normal distribution is another commonly used parametric distribution for characterizing the survival time. • LN(µ, σ2) ∼ expfN(µ, σ2)g 2 • E(T ) = eµ+σ =2 2 2 • Var(T ) = e2µ+σ (eσ − 1) 18 The hazard function of the log-normal distribution log−normal (mu, sigma)=(0, 1) (mu, sigma)=(1, 1) (mu, sigma)=(−1, 1) h(t) 0.0 0.5 1.0 1.5 2.0 2.5 0.0 0.2 0.4 0.6 0.8 1.0 time 19 Generalized gamma distribution • The generalized gamma distribution becomes more popular due to its flexibility. • Again be careful about its parametrization GG(α; λ, p): p p • f(t) = pλ(λt)α−1e−(λt) =Γ(α=p) / tα−1e−(λt) • S(t) = 1 − γfα=p; (λt)pg=Γ(α=p); where Z x γ(s; x) = ts−1e−tdt 0 is the incomplete gamma function. 20 Generalized gamma distribution • For k = 1; 2; ··· Γ((α + k)=p) E(T k) = λkΓ(α=p) • If p = 1, GG(α; λ, 1) ∼ G(α; λ) • if α = p, GG(p; λ, p) ∼ W (λ, p) • if α = p = 1, GG(p; λ, p) ∼ EXP (λ) • The generalized gamma distribution can be used to test the adequacy of commonly used Gamma, Weibull and Exponential distributions, since they are all nested within the generalized gamma distribution family. 21 Homogeneous Poisson Process • N(t) =# events occurring in (0; t) • T1 denotes the time to the first event; T2 denotes the time from the first to the second event T3 denotes the time from the second to the third event et al. • If the gap times T1;T2; ··· are i.i.d EXP (λ); then N(t + dt) − N(t) ∼ P oisson(λdt): The process N(t) is called homogeneous Poisson process. • The interpretation of the intensity function (similar to hazard function) prfN(t + dt) − N(t) > 0g lim = λ dt#0 dt 22 Non-homogeneous Poisson Process R • − ∼ t+dt N(t + dt) N(t) P oisson( t λ(s)ds) for a rate function λ(s); s > 0: • The interpretation of the intensity function pr(N(t + dt) − N(t) > 0) lim = λ(t) dt#0 dt 23 Censoring • A common feature of survival data is the presence of right censoring. • There are different types of censoring. Suppose that T1;T2; ··· ;Tn are i.i.d survival times. 1. Type I censoring: observe only (Ui; δi) = fmin(Ti; c);I(Ti ≤ c)g; i = 1; ··· ; n; i.e,, we only have the survival information up to a fixed time c: 2. Type II censoring: observe only T(1;n);T(2;n); ··· ;T(r;n) where T(i;n) is the ith smallest among n survival times, i.e,., 24 we only observe the first r smallest survival times. 3. Random censoring (The most common type of censoring): C1;C2; ··· ;Cn are potential censoring times for n subjects, observe only (Ui; δi) = fmin(Ti;Ci);I(Ti ≤ Ci)g; i = 1; ··· ; n: We often treat the censoring time Ci as random variables in statistical inferences. 4. Interval censoring: observe only (Li;Ui); i = 1; ··· ; n such that Ti 2 [Li;Ui): 25 Non-informative Censoring • If Ti and Ci are independent, then censoring is non-informative. • Noninformative censoring: pr(T < s + ϵjT ≥ s) ≈ pr(T < s + ϵjT ≥ s; C ≥ s) (Tsiatis, 1975) • Examples of informative and non-informative censoring: not verifiable empirically. • Consequence of informative censoring: non-identifiability 26 Likelihood Construction • In the presence of censoring, we only observe (Ui; δi); i = 1; ··· ; n: • The likelihood construction must be with respect to the bivariate random variable (Ui; δi); i = 1; ··· :n: 1. If (Ui; δi) = (ui; 1), then Ti = ui;Ci > ui 2. If (Ui; δi) = (ui; 0), then Ti ≥ ui;Ci = ui: 27 Likelihood Construction • Ci; 1 ≤ i ≤ n are known constants, i.e., conditional on Ci: 8 < f(ui); if δi = 1 Li(F ) = : S(ui); if δi = 0 Yn Yn { } δi 1−δi ) L(F ) = Li(F ) = f(ui) S(ui) : i=1 i=1 28 Likelihood Construction • Assuming Ci; 1 ≤ i ≤ n are i.i.d random variables with a CDF G(·): 8 < f(ui)(1 − G(ui)); if δi = 1 Li(F; G) = : S(ui)g(ui); if δi = 0 Yn Yn [ ] δi 1−δi ) L(F; G) = Li(F ) = ff(ui)(1 − G(ui))g fS(ui)g(ui)g : i=1 i=1 ( )( ) Yn Yn Yn δi 1−δi 1−δi δi = Li(F; G) = f(ui) S(ui) g(ui) (1 − G(ui)) : i=1 i=1 i=1 29 Likelihood Construction • We have used the noninformative censoring assumption in the likelihood construction.

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    44 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us