<<

Math 659: Chapter 2 — Basic and Models (II)

Wenge Guo

July 22, 2011

Wenge Guo Math 659: Survival Analysis Review of Last lecture (1)

I A lifetime or survival time is the time until some specified event occurs. This event may be , the appearance of a tumor, the development of some disease, recurrence of a disease, equipment breakdown, cessation of breast feeding, and so on.

I Survival : A random variable X is a survival random variable if an observed outcome x of X is always positive.

I Several functions characterize the distribution of a survival random variable: density function (pdf) f (x), cumulative distribution function (cdf) F (x) (sf) S(x), function (hf) h(x), cumulative hazard function H(x), and residual lifetime mrl(x) at time x, respectively.

Wenge Guo Chapter 2 Basic Quantities and Models Review of Last lecture (2)

Implication of these functions:

I The survival function S(x) is the probability of an individual surviving to time x.

I The hazard function h(x), sometimes termed risk function, is the chance an individual of time x experiences the event in the next instant in time when he has not experienced the event at x.

I A related quantity to the hazard function is the cumulative hazard function H(x), which describes the overall risk rate from the onset to time x.

I The mean residual lifetime at age x, mrl(x), is the mean time to the event of interest, given the event has not occurred at x.

Wenge Guo Chapter 2 Basic Quantities and Models Relationship Summary

We only need to know one of these functions, the rest can be derived. In particular,

R x R ∞ I S(x) = 1 − F (x), F (x) = 0 f (t)dt, S(x) = x f (t)dt dF (x) dS(x) I f (x) = dx = − dx f (x) R x dH(x) I h(x) = S(x) , H(x) = 0 h(t)dt, h(x) = dx R ∞ R ∞ x (t−x)f (t)dt x S(t)dt I mrl(x) = S(x) = S(x) and R x dt mrl(0) − 0 mrl(t) S(x) = mrl(x) e R x I The key relation is S(x) = exp(− 0 h(t)dt) = exp(−H(x)).

Wenge Guo Chapter 2 Basic Quantities and Models Discrete Case

I Suppose that X takes values xj , j = 1, 2, 3, ···

I The probability mass function (pmf) p(xj ) = Pr(X = xj ), j = 1, 2, ··· where x1 < x2 < ··· < xj < ··· I The survival function is defined as S(x) = Pr(X > x) = P p(x ) xj >x j I Example: X with pmf p(xj ) = Pr(X = j) = 1/3, j = 1, 2, 3, then the survival function is  1, 0 ≤ x < 1   X 2/3, 1 ≤ x < 2 S(x) = Pr(X > x) = p(xj ) = 1/3, 2 ≤ x < 3 xj >x   0, x ≥ 3

18/33 Plot of the Survival Function for Discrete Case

19/33 Hazard Function for the Discrete Case

I The hf is defined as p(xj ) h(xj ) = Pr(X = xj |X ≥ xj ) = , j = 1, 2, ··· with S(xj−1) S(x0) = 1

I Because p(xj ) = S(xj−1) − S(xj ), thus S(xj−1)−S(xj ) S(xj ) h(xj ) = = 1 − S(xj−1) S(xj−1) I The survival function can be written as the product of the conditional survival . S(x) = Q S(x )/S(x ) xj ≤x j j−1

20/33 Relationship between h(xj ) and S(x)

I Note that,

S(x) =Pr(X > x) = Pr(X > x|X > xj )Pr(X > xj )

=Pr(X > x|X > xj )Pr(X > xj |X > xj−1)Pr(X > xj−1) = ···

I Because Pr(X > xj |X > xj−1) = S(xj )/S(xj−1) Q I Thus S(x) = [1 − h(x )], which provides the basis for xj ≤x j the Kaplan-Meier estimator of the survival function

I The cumulative hazard function is defined as H(x) = P h(x ) xj ≤x j I Note that S(x) = exp[−H(x)], which could be used to provide an alternative estimator for the survival function

21/33 Relationship Summary for Discrete Lifetimes

Interrelationships between the various quantities for discrete lifetimes X may be summarized as P Q I S(x) = p(x ) = [1 − h(x )], xj >x j xj ≤x j

I p(xj ) = S(xj ) − S(xj−1) = h(xj )S(xj−1), j = 1, 2,...,

p(xj ) I h(x) = , S(xj−1) P (xi+1−x)S(xi )+ j≥i+1(xj+1−xj )S(xj ) I mrl(x) = S(x) , for xi ≤ x < xi+1.

Wenge Guo Chapter 2 Basic Quantities and Models Parametric Distributions

I We mainly use nonparametric and semiparametric models and methods in this class

I Parametric models are useful and widely used in some area, such as reliability analysis, engineer

I The main purpose of survival analysis is to interpret data, for example, what is the population life distribution, what is treatment effect on the survival distribution, based on data collected

I In reliability, often it is needed to make prediction for the fraction failing of a product after three year based on one-year data. Thus extrapolation is needed and parametric models are used

I In reliability, the use of parametric models can be justified by physical/chemical principles or engineering knowledge, but it is hard to make this justification for human body 22/33 The Exponential

I The sf is S(x) = exp(−λx), λ > 0, x ≥ 0

I The pdf is f (x) = λ exp(−λx)

I The hf is λ

I The mean and sd is 1/λ

I The lack of memory property, which Pr(X ≥ x + z|x ≥ x) = Pr(X ≥ z)

23/33 The Exponential (2)

Property: An important feature of the is the ‘memoryless property’, P(X > x + z|X > x) = P(X > z). That is, on reaching any age, the probability of surviving z more units of time is the same as it was at age zero.

Example: For a component with an exponential distributed lifetime, the probability that a one-year-old component lasts 3 more months in operation is the same as the probability that a ten-year-old component lasts 3 more months in operations.

Implication: It indicates that the component’s lifetime does not pass through a period of ‘old ages’, where there is an increased risk of mortality.

Wenge Guo Chapter 2 Basic Quantities and Models The Exponential (3)

Application: The exponential probability model is one of the most commonly used probability models for modeling lifetimes of components. However, its constant hazard rate appears too restrictive in health and some industrial applications.

Parameter: The parameter β is an important , we have the following conclusion: If X ∼ exp(λ), then λX ∼ exp(1).

Wenge Guo Chapter 2 Basic Quantities and Models The

I Widely used in reliability α I The sf is S(x) = exp(−λx )

I λ is a scale parameter and α is a α−1 I The hf is λαx , which is more flexible, allows different shapes of the hazard I In particular, α determines the shape of the hazard function

I α > 1, increasing I α = 1, constant I α < 1, decreasing α I Let Z = ln(λX ), then sf of Z is exp[− exp(z)], which is called the standard smallest extreme value distribution

24/33 The Weibull Hazard Function

25/33 Several Comments on Weibull Model

I The Weibull model has a very simple hazard function and survival function.

I It is a very useful model in many engineering context. Its two parameters make the Weibull a very flexible model in a wide variety of situations: increasing , decreasing hazards, and constant hazards.

Wenge Guo Chapter 2 Basic Quantities and Models Several Comments on Weibull Model (2)

I An understanding of the hazard rate may provide insight as to what is causing the failures:

I A decreasing hazard rate would suggest ”infant mortality”. That is, defective items fail early and the decreases over time as they fall out of the population. I A constant hazard rate suggests that items are failing from random events. I An increasing hazard rate suggests ”wear out” - parts are more likely to fail as time goes on.

I However, the mean and of the distribution are more difficult to determine.

Wenge Guo Chapter 2 Basic Quantities and Models The Log

Since many lifetimes are measured on the logarithm scale, and such transformations often increase the symmetry in data, it is important to examine the log normal distribution where the transformed data are normally distributed.

A positive valued survival variable X has a log normal distribution and we write X ∼ lognormal(µ, σ) if Y = ln(X ) ∼ N(µ, σ).

Suppose Φ is the cumulative distribution function of the standard normal random variable, the survival function of X ∼ lognormal(µ, σ) is

ln(x) − µ S(x) = P(X > x) = P(ln(X ) > ln(x)) = 1 − Φ( ), σ from which the density and hazard may be obtained by differentiating S(x).

Wenge Guo Chapter 2 Basic Quantities and Models The Log Normal Distribution (2)

The features of the hazard rate: the hazard function of the lognormal is hump-shaped. It initially increases, reaches a maximum and then decreases toward 0 as lifetimes become larger and larger.

I The model is not suitable for lifetime modeling where hazards increase with old age.

I By using a part of the distribution, we can model the onset of some disease.

For the log normal distribution, the mean lifetime is given by exp(µ + σ2) and the variance by [exp(σ2) − 1]exp(2µ + σ2).

Wenge Guo Chapter 2 Basic Quantities and Models Lognormal Hazard Function

27/33 Log Logistic Distribution

Log-logistic distribution: mimic lognormal distribution properties, but it has closed-form hazard and survival functions.

A positive valued survival variable X has a log logistic distribution and we write X ∼ loglogistic(µ, σ) with two parameters µ and σ if Y = ln(X ) follows a logistic distribution with survival function

1 SY (y) = 1 − y−µ . 1 + exp[−( σ )]

Wenge Guo Chapter 2 Basic Quantities and Models Log Logistic Distribution (2)

The survival function of the log logistic distribution is 1 1 S (x) = 1 − = , X ln(x)−µ 1 + λxα 1 + exp[−( σ )]

1 µ where α = σ > 0 and λ = exp(− σ ). The hazard function is αλxα h (x) = , X (1 + λxα)2

which is similar in shape to the log normal hazard, but it is considerably easier to manipulate.

Wenge Guo Chapter 2 Basic Quantities and Models Regression Models

I To adjust the survival function to account for the covariate/explanatory variables I Examples:

I Quantitative variables: blood pressure, temperature, age, weight I Qualitative variables: gender, race, treatments, disease status t I The explanatory variable is denoted z = (z1, ··· , zp) I Two approaches in regression:

I Modeling Y = ln(X), accelerated failure time model I Modeling the hazard function h(x)

I Multiplicative hazard rate models I Additive hazard rate models

28/33 Accelerated Failure Time Model

I The first approach is analogous to the classical approach t I Y = ln(X) = µ + γ z + σW where γ is the regression coefficient vector, W is the error distribution

I If the error distribution is normal, then the regression model is a lognormal

I If the error distribution is the smallest extreme value distribution, then the regression model is the Weibull

I This model is called accelerated failure time model

I Let S0(x) denote the sf of X when z = 0, then t S(x|z) = S0[x exp(−γ z)] t I If exp(−γ z) > 1 the time scale is accelerated t I If exp(−γ z) < 1 the time scale is decelerated

29/33 Example

I Assume X is a Weibull distribution, Y = ln(X) t I The regression model is Y = γ z + σW α I The sf under baseline is S0(x) = exp(−x ) I The sf under z is t α t S(x|z) = exp{−[x exp(−γ z)] } = S0[x exp(−γ z)]

30/33 Multiplicative Hazard Model

I The Second approach is to model the hazard function as a function of covariate

I The first class of model in this approach is called the multiplicative hazard rate model

I In particular, the conditional hazard rate of an individual with covariate z is a product of the baseline hazard and a non-negative function of the covariate c(βt z), that is t h(x|z) = h0(x)c(β z) t t I For Cox model, c(β z) = exp(β z)

I A key feature of this class of model is proportional hazard t t h(x|z1) h0(x)c(β z1) c(β z1) when z is fixed = t = t h(x|z2) h0(x)c(β z2) c(β z2) c(βt z) I The sf under z is S(x|z) = S0(x)

31/33 Example

I Consider the baseline hazard to be a Weibull, that is α−1 h0(x) = αλx α−1 t I h(x|z) = αλx c(β z) α−1 t I If Cox model is used h(x|z) = αλx exp(β z)

I The sf under z is S(x|z) = exp[−λxα]exp(βt z) = exp[−λ(x exp[βt z/α])α]

I This is in the form of the accelerated failure time model

I The Weibull is the only continuous distribution which has the property of being both an accelerated failure time and a multiplicative hazards model

32/33 Additive Hazard Model

I The second class of models for the hazard rate is the family of additive hazard rate model

I The condition hazard function is modeled by

p X h(x|z) = h0(x) + zj (x)βj (x) j=1

I More details are in Chapter 10 (will not cover)

33/33