Median Survival Time: Median Survival Time 푚 Is Defined As the Quantity 푚 Satisfying 푆 푚 = 0.5

Total Page:16

File Type:pdf, Size:1020Kb

Median Survival Time: Median Survival Time 푚 Is Defined As the Quantity 푚 Satisfying 푆 푚 = 0.5 AMS 588 – Failure and Survival Data Analysis Instructor: Prof. Wei Zhu 1. Introduction Some Basics • Binomial distribution The Binomial distribution is based on the idea of a Bernoulli trial, which is an experiment with two, and only two, possible outcomes. 푌1,…푌푛 iid Bernoulli(p), 푝(푌 = 1) = 휋 = 1 − p(푌 = 0) 푌 = 1 is often termed as a “success” and 휋 is referred to as the success probability. 퐸 푌 = 휋; 푉푎푟 푌 = 휋(1 − 휋) The total number of “success” in 푛 trails follows a Binomial distribution: trials pmf: for i=0,1,…,n Binomial Theorem: For any real numbers푛and 푦 and integer 푛 ≥ 0, 푛 푛 (푥 + 푦)푛= ෍ 푥푦푛− =0 Some Basics • Hypergeometric distribution Suppose we have a large urn filled with 푁 balls that are identical in every way except that 푀 are red and N − M are green. We reach in, blindfolded, and select 퐾 balls at random. The number of red balls 푋 in a sample of size 퐾 follows a Hypergeometric 푀 푁−푀 distribution: 푥 퐾−푥 푃 푋 = 푥 푁, 푀, 퐾 = 푁 , 푥 = 0,1, … , 퐾 퐾푀 퐾 E X = μ = ; 푁 퐾푀 (푁 − 푀)(푁 − 퐾) 푉푎푟(푋) = 휎2 = 푁 푁(푁 − 1) Example: Let 푋1 and 푋2 be independent observations with 푋1~푏푛표푚푎푙 푛1, 푝1 and 푋2~푏푛표푚푎푙 푛2, 푝2 . Consider testing 퐻0: 푝1 = 푝2 푣푠 퐻1: 푝1 > 푝2. It is easy to show that under 퐻0: 푝1 = 푝2 = 푝, 푋 = 푋1 + 푋2is a sufficient statistics for 푝, and 푋1|푋 = 푥~ℎ푦푝푒푟푔푒표푚푒푡푟푐(푛1 + 푛2, 푛1, 푥) -- Fisher’s exact test The data can also be formulated into a 2x2 table Under 퐻0: 푝1 = 푝2 x1 n1-x1 n1 푛1 푛2 푥 푥−푥 x2 n2-x2 n2 1 1 (hypergeometric) 푓 푥1 푛1, 푛2, 푥 = 푛 x n-x n 푥 Some Basics • Iterative expectation & variance If 푋 and 푌 are any two random variables, then 퐸푋 = 퐸 퐸(푋|푌) 푉푎푟 푋 = 퐸 푉푎푟 푋 푌 + 푉푎푟 퐸 푋|푌 Provided that the expectations exist Example: 푋|푃~binomial(푛, 푃) and 푃~beta(α, β) α 퐸푋 = 퐸 퐸(푋|푃) = 퐸 푛푃 = 푛 α + β 푉푎푟 푋 = 퐸 푉푎푟 푋 푃 + 푉푎푟 퐸 푋|푃 = 퐸 푛푃(1 − 푃) + 푉푎푟 푛푃 αβ αβ = 푛 + 푛2 α + β α + β + 1 α + β 2 α + β + 1 αβ α + β + 푛 = 푛 α + β 2 α + β + 1 Some Basics Central Limit Theorem: n Delta method: Survival Analysis In many biomedical applications the primary endpoint of interest is time to a certain event, such as time to death; time it takes for a patient to respond to a therapy; time from response until disease relapse (i.e., disease returns); time to eradicate an infection after treatment with antibiotics; etc. We may be interested in • characterizing the distribution of “time to event” for a given population ; • comparing this “time to event” among different groups (e.g., treatment vs. control in a clinical trial or an observational study); • modeling the relationship of “time to event” to other covariates (sometimes called prognostic factors or predictors). Difficulties: • Censoring: Since the data are collected over a finite period of time, the “time to event” may not be observed for all the individuals in our study population (sample). This results in what is called censored data. • Differential follow-up: It is also common that the amount of follow-up for the individuals in a sample vary from subject to subject. The standard statistical methods cannot properly handle these difficulties in the analysis of the “time to event” data. A new research area in statistics has emerged which is called Survival Analysis or Censored Survival Analysis. “Time To Event” Let the random variable T denote time to the event of our interest. Of course, T is a positive random variable which has to be unambiguously defined; that is, we must be very specific about the start and end with the length of the time period in-between corresponding to T. EX: Survival time of a treatment for a population with certain disease: measured from the time of treatment initiation until death. cdf of T: 퐹 푡 = 푃 푇 ≤ 푡 , 푡 ≥ 0 Right continuous: lim 퐹(푢) = 퐹 푡 푢→푡+ When 푇 is a survival time, 퐹 푡 is the probability that a randomly selected subject from the population will die before time 푡. 푑퐹 푡 푡 pdf of T: 푓 푡 = , 퐹 푡 = ׬ 푓 푢 푑푢 푑푡 0 In biomedical applications, it is often common to use the survival function 푆 푡 = 푃 푇 ≥ 푡 = 1 − 퐹(푡−) 푤ℎ푒푟푒, 퐹 푡− = lim 퐹(푢) 푢→푡− When T is a survival time, S(t) is the probability that a randomly selected individual will survive to time t or beyond. Note: Sometimes, a survival function may be defined as 푆 푡 = 푃 푇 > 푡 = 1 − 퐹 푡 . This definition will be identical to the above one if T is a continuous random variable, which is the case we will focus on in this course. 푆 푡 • Non-increasing; • 푆 0 = 1 and 푆 ∞ = 0 for a proper random variable, which means that everyone will eventually experience the event, e.g. death. • However, we may also allow the possibility that 푆 ∞ > 0. This corresponds to a situation where there is a positive probability of not “dying” or not experiencing the event. For example, if the event of interest is the time from response until disease relapse and the disease has a cure for some proportion of individuals in the population, then we have 푆 ∞ > 0, where 푆 ∞ corresponds to the proportion of cured individuals. • If 푇 is continuous r.v., ∞ 푑푆 푡 푆 푡 = න 푓 푢 푑푢 , 푓 푡 = − 푡 푑푡 That is, there is a one-to-one correspondence between 푓 푡 and 푆 푡 . Here f(t) is the usual probability density function (pdf), where f(t) = dF(t)/dt Definitions Mean Survival Time: μ = 퐸(푇). Due to censoring, sample mean of observed survival times is no longer an unbiased estimate of μ = 퐸(푇). If we can estimate 푆 푡 well, then we can estimate μ = 퐸(푇) using the following fact: ∞ ∞ ∞ 퐸 푇 = න 푡푑퐹 푡 = − න 푡푑푆 푡 = න 푆 푡 푑푡 0 0 0 Median Survival Time: Median survival time 푚 is defined as the quantity 푚 satisfying 푆 푚 = 0.5. Sometimes denoted by 푡0.5. If 푆 푡 is not strictly decreasing, 푚 is the smallest one such that 푆 푚 ≤ 0.5. pth quantile of Survival Time (100pth percentile): 푡푝 such that 푆 푡푝 = 1 − 푝 0 < 푝 < 1 . If 푆 푡 is not strictly decreasing, 푡푝 is the smallest one such that 푆 푡푝 ≤ 1 − 푝 Mean Residual Life Time(푚푟푙): 푚푟푙 푡0 = 퐸[푇 − 푡0|푇 ≥ 푡0] i.e., 푚푟푙 푡0 = average remaining survival time given the population has survived beyond 푡0. It can be shown that ∞ ׬ 푆 푡 푑푡 푡0 푚푟푙 푡0 = 푆 푡0 EX: Figure 1.1: The survival function for a hypothetical population In this population: • 70% of the individuals will survive at least 2 years (i.e., 푡0.3 = 2) and • the median survival time is 2.8 years (i.e., 50% of the population will survive at least 2.8 years). is stochastically larger than Figure 1.2: We say that the survival distribution for group 1 is stochastically larger than the survival distribution for group 2 if 푆1 푡 ≥ 푆2 푡 , for all 푡 ≥ 0, where 푆 푡 is the survival function for group . If 푇 is the corresponding survival time for groups , we also say that 푇1 is stochastically (not deterministically) larger than 푇2. Note that 푇1 being stochastically larger than 푇1 does NOT necessarily imply that 푇1 ≥ 푇2. Mortality Rate The mortality rate at time 푡, where 푡 is generally taken to be an integer in terms of some unit of time (e.g., years, months, days, etc), is the proportion of the population who fail (die) between times 푡 and 푡 + 1among individuals alive at time 푡, , i.e., 푚 푡 = 푃[푡 ≤ 푇 < 푡 + 1|푇 ≥ 푡] Figure 1.3: A typical mortality pattern for human Hazard Rate The hazard rate is the limit of a mortality rate if the interval of time is taken to be small (rather than one unit). The hazard rate is the instantaneous rate of failure (experiencing the event) at time 푡 given that an individual is alive at time 푡. 푃[푡 ≤ 푇 < 푡 + ℎ|푇 ≥ 푡] 휆 푡 = lim ℎ→0 ℎ 푃[푡 ≤ 푇 < 푡 + ℎ] lim ℎ = ℎ→0 푃[푇 ≥ 푡] 푓(푡) 푆′ 푡 푑푙표푔{푆 푡 } = = − = − 푆(푡) 푆 푡 푑푡 • A useful way of describing the distribution of “time to event” because it has a natural interpretation that relates to the aging of a population; • Very popular in biomedical community. • If ℎ is very small, we have 푃 푡 ≤ 푇 < 푡 + ℎ 푇 ≥ 푡 ≈휆 푡 ℎ 푡 Λ 푡 = ׬0 휆 푢 푑푢 = −푙표푔{푆 푡 }, where Λ 푡 is referred to as the cumulative • hazard function. Here we used the fact that S(0) = 1. Hence, 푡 ׬ 휆 푢 푑푢 − 푆 푡 = 푒−Λ 푡 = 푒 0 Note: 1. There is a one-to-one relationship between hazard rate 휆 푡 , 푡 ≥ 0, and survival function 푆 푡 , namely, 푡 { ׬ 휆 푢 푑푢 푑푙표푔{푆 푡 − 푆 푡 = 푒 0 , 휆 푡 = − 푑푡 2. The hazard rate is NOT a probability, but a probability rate. Therefore it is possible that a hazard rate can exceed one in the same fashion as a density function푓 푡 may exceed one.
Recommended publications
  • Summary Notes for Survival Analysis
    Summary Notes for Survival Analysis Instructor: Mei-Cheng Wang Department of Biostatistics Johns Hopkins University Spring, 2006 1 1 Introduction 1.1 Introduction De¯nition: A failure time (survival time, lifetime), T , is a nonnegative-valued random vari- able. For most of the applications, the value of T is the time from a certain event to a failure event. For example, a) in a clinical trial, time from start of treatment to a failure event b) time from birth to death = age at death c) to study an infectious disease, time from onset of infection to onset of disease d) to study a genetic disease, time from birth to onset of a disease = onset age 1.2 De¯nitions De¯nition. Cumulative distribution function F (t). F (t) = Pr(T · t) De¯nition. Survial function S(t). S(t) = Pr(T > t) = 1 ¡ Pr(T · t) Characteristics of S(t): a) S(t) = 1 if t < 0 b) S(1) = limt!1 S(t) = 0 c) S(t) is non-increasing in t In general, the survival function S(t) provides useful summary information, such as the me- dian survival time, t-year survival rate, etc. De¯nition. Density function f(t). 2 a) If T is a discrete random variable, f(t) = Pr(T = t) b) If T is (absolutely) continuous, the density function is Pr (Failure occurring in [t; t + ¢t)) f(t) = lim ¢t!0+ ¢t = Rate of occurrence of failure at t: Note that dF (t) dS(t) f(t) = = ¡ : dt dt De¯nition. Hazard function ¸(t).
    [Show full text]
  • SOLUTION for HOMEWORK 1, STAT 3372 Welcome to Your First
    SOLUTION FOR HOMEWORK 1, STAT 3372 Welcome to your first homework. Remember that you are always allowed to use Tables allowed on SOA/CAS exam 4. You can find them on my webpage. Another remark: 6 minutes per problem is your “speed”. Now you probably will not be able to solve your problems so fast — but this is the goal. Try to find mistakes (and get extra points) in my solutions. Typically they are silly arithmetic mistakes (not methodological ones). They allow me to check that you did your HW on your own. Please do not e-mail me about your findings — just mention them on the first page of your solution and count extra points. You can use them to compensate for wrongly solved problems in this homework. They cannot be counted beyond 10 maximum points for each homework. Now let us look at your problems. General Remark: It is always prudent to begin with writing down what is given and what you need to establish. This step can help you to understand/guess a possible solution. Also, your solution should be written neatly so, if you have extra time, it is easier to check your solution. 1. Problem 2.4 Given: The hazard function is h(x)=(A + e2x)I(x ≥ 0) and S(0.4) = 0.5. Find: A Solution: I need to relate h(x) and S(x) to find A. There is a formula on page 17, also discussed in class, which is helpful here. Namely, x − h(u)du S(x)= e 0 . R It allows us to find A.
    [Show full text]
  • Study of Generalized Lomax Distribution and Change Point Problem
    STUDY OF GENERALIZED LOMAX DISTRIBUTION AND CHANGE POINT PROBLEM Amani Alghamdi A Dissertation Submitted to the Graduate College of Bowling Green State University in partial fulfillment of the requirements for the degree of DOCTOR OF PHILOSOPHY August 2018 Committee: Arjun K. Gupta, Committee Co-Chair Wei Ning, Committee Co-Chair Jane Chang, Graduate Faculty Representative John Chen Copyright c 2018 Amani Alghamdi All rights reserved iii ABSTRACT Arjun K. Gupta and Wei Ning, Committee Co-Chair Generalizations of univariate distributions are often of interest to serve for real life phenomena. These generalized distributions are very useful in many fields such as medicine, physics, engineer- ing and biology. Lomax distribution (Pareto-II) is one of the well known univariate distributions that is considered as an alternative to the exponential, gamma, and Weibull distributions for heavy tailed data. However, this distribution does not grant great flexibility in modeling data. In this dissertation, we introduce a generalization of the Lomax distribution called Rayleigh Lo- max (RL) distribution using the form obtained by El-Bassiouny et al. (2015). This distribution provides great fit in modeling wide range of real data sets. It is a very flexible distribution that is related to some of the useful univariate distributions such as exponential, Weibull and Rayleigh dis- tributions. Moreover, this new distribution can also be transformed to a lifetime distribution which is applicable in many situations. For example, we obtain the inverse estimation and confidence intervals in the case of progressively Type-II right censored situation. We also apply Schwartz information approach (SIC) and modified information approach (MIC) to detect the changes in parameters of the RL distribution.
    [Show full text]
  • Review of Multivariate Survival Data
    Review of Multivariate Survival Data Guadalupe G´omez(1), M.Luz Calle(2), Anna Espinal(3) and Carles Serrat(1) (1) Universitat Polit`ecnica de Catalunya (2) Universitat de Vic (3) Universitat Aut`onoma de Barcelona DR2004/15 Review of Multivariate Survival Data ∗ Guadalupe G´omez [email protected] Departament d'Estad´ıstica i I.O. Universitat Polit`ecnica de Catalunya M.Luz Calle [email protected] Departament de Matem`atica i Inform`atica Universitat de Vic Carles Serrat [email protected] Departament de Matem`atica Aplicada I Universitat Polit`ecnica de Catalunya Anna Espinal [email protected] Departament de Matem`atiques Universitat Aut`onoma de Barcelona September 2004 ∗Partially supported by grant BFM2002-0027 PB98{0919 from the Ministerio de Cien- cia y Tecnologia Contents 1 Introduction 1 2 Review of bivariate nonparametric approaches 3 2.1 Introduction and notation . 3 2.2 Nonparametric estimation of the bivariate distribution func- tion under independent censoring . 5 2.3 Nonparametric estimation of the bivariate distribution func- tion under dependent right-censoring . 9 2.4 Average relative risk dependence measure . 18 3 Extensions of the bivariate survival estimator 20 3.1 Estimation of the bivariate survival function for two not con- secutive gap times. A discrete approach . 20 3.2 Burke's extension to two consecutive gap times . 25 4 Multivariate Survival Data 27 4.1 Notation . 27 4.2 Likelihood Function . 29 4.3 Modelling the time dependencies via Partial Likelihood . 31 5 Bayesian approach for modelling trends and time dependen- cies 35 5.1 Modelling time dependencies via indicators .
    [Show full text]
  • Survival and Reliability Analysis with an Epsilon-Positive Family of Distributions with Applications
    S S symmetry Article Survival and Reliability Analysis with an Epsilon-Positive Family of Distributions with Applications Perla Celis 1,*, Rolando de la Cruz 1 , Claudio Fuentes 2 and Héctor W. Gómez 3 1 Facultad de Ingeniería y Ciencias, Universidad Adolfo Ibáñez, Diagonal Las Torres 2640, Peñalolén, Santiago 7941169, Chile; [email protected] 2 Department of Statistics, Oregon State University, 217 Weniger Hall, Corvallis, OR 97331, USA; [email protected] 3 Departamento de Matemática, Facultad de Ciencias Básicas, Universidad de Antofagasta, Antofagasta 1240000, Chile; [email protected] * Correspondence: [email protected] Abstract: We introduce a new class of distributions called the epsilon–positive family, which can be viewed as generalization of the distributions with positive support. The construction of the epsilon– positive family is motivated by the ideas behind the generation of skew distributions using symmetric kernels. This new class of distributions has as special cases the exponential, Weibull, log–normal, log–logistic and gamma distributions, and it provides an alternative for analyzing reliability and survival data. An interesting feature of the epsilon–positive family is that it can viewed as a finite scale mixture of positive distributions, facilitating the derivation and implementation of EM–type algorithms to obtain maximum likelihood estimates (MLE) with (un)censored data. We illustrate the flexibility of this family to analyze censored and uncensored data using two real examples. One of them was previously discussed in the literature; the second one consists of a new application to model recidivism data of a group of inmates released from the Chilean prisons during 2007.
    [Show full text]
  • Bp, REGRESSION on MEDIAN RESIDUAL LIFE FUNCTION FOR
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by D-Scholarship@Pitt REGRESSION ON MEDIAN RESIDUAL LIFE FUNCTION FOR CENSORED SURVIVAL DATA by Hanna Bandos M.S., V.N. Karazin Kharkiv National University, 2000 Submitted to the Graduate Faculty of The Department of Biostatistics Graduate School of Public Health in partial fulfillment of the requirements for the degree of Doctor of Philosophy University of Pittsburgh 2007 Bp, UNIVERSITY OF PITTSBURGH Graduate School of Public Health This dissertation was presented by Hanna Bandos It was defended on July 26, 2007 and approved by Dissertation Advisor: Jong-Hyeon Jeong, PhD Associate Professor Biostatistics Graduate School of Public Health University of Pittsburgh Dissertation Co-Advisor: Joseph P. Costantino, DrPH Professor Biostatistics Graduate School of Public Health University of Pittsburgh Janice S. Dorman, MS, PhD Professor Health Promotion & Development School of Nursing University of Pittsburgh Howard E. Rockette, PhD Professor Biostatistics Graduate School of Public Health University of Pittsburgh ii Copyright © by Hanna Bandos 2007 iii REGRESSION ON MEDIAN RESIDUAL LIFE FUNCTION FOR CENSORED SURVIVAL DATA Hanna Bandos, PhD University of Pittsburgh, 2007 In the analysis of time-to-event data, the median residual life (MERL) function has been promoted by many researchers as a practically relevant summary of the residual life distribution. Formally the MERL function at a time point is defined as the median of the remaining lifetimes among survivors beyond that particular time point. Despite its widely recognized usefulness, there is no commonly accepted approach to model the median residual life function. In this dissertation we introduce two novel regression techniques that model the relationship between the MERL function and covariates of interest at multiple time points simultaneously; proportional median residual life model and accelerated median residual life model.
    [Show full text]
  • Log-Epsilon-S Ew Normal Distribution and Applications
    Log-Epsilon-Skew Normal Distribution and Applications Terry L. Mashtare Jr. Department of Biostatistics University at Bu®alo, 249 Farber Hall, 3435 Main Street, Bu®alo, NY 14214-3000, U.S.A. Alan D. Hutson Department of Biostatistics University at Bu®alo, 249 Farber Hall, 3435 Main Street, Bu®alo, NY 14214-3000, U.S.A. Roswell Park Cancer Institute, Elm & Carlton Streets, Bu®alo, NY 14263, U.S.A. Govind S. Mudholkar Department of Statistics University of Rochester, Rochester, NY 14627, U.S.A. August 20, 2009 Abstract In this note we introduce a new family of distributions called the log- epsilon-skew-normal (LESN) distribution. This family of distributions can trace its roots back to the epsilon-skew-normal (ESN) distribution developed by Mudholkar and Hutson (2000). A key advantage of our model is that the 1 well known lognormal distribution is a subclass of the LESN distribution. We study its main properties, hazard function, moments, skewness and kurtosis coe±cients, and discuss maximum likelihood estimation of model parameters. We summarize the results of a simulation study to examine the behavior of the maximum likelihood estimates, and we illustrate the maximum likelihood estimation of the LESN distribution parameters to a real world data set. Keywords: Lognormal distribution, maximum likelihood estimation. 1 Introduction In this note we introduce a new family of distributions called the log-epsilon-skew- normal (LESN) distribution. This family of distributions can trace its roots back to the epsilon-skew-normal (ESN) distribution developed by Mudholkar and Hutson (2000) [14]. A key advantage of our model is that the well known lognormal distribu- tion is a subclass of the LESN distribution.
    [Show full text]
  • Introduction to Survival Analysis in Practice
    machine learning & knowledge extraction Review Introduction to Survival Analysis in Practice Frank Emmert-Streib 1,2,∗ and Matthias Dehmer 3,4,5 1 Predictive Society and Data Analytics Lab, Faculty of Information Technolgy and Communication Sciences, Tampere University, FI-33101 Tampere, Finland 2 Institute of Biosciences and Medical Technology, FI-33101 Tampere, Finland 3 Steyr School of Management, University of Applied Sciences Upper Austria, 4400 Steyr Campus, Austria 4 Department of Biomedical Computer Science and Mechatronics, UMIT- The Health and Life Science University, 6060 Hall in Tyrol, Austria 5 College of Artificial Intelligence, Nankai University, Tianjin 300350, China * Correspondence: [email protected]; Tel.: +358-50-301-5353 Received: 31 July 2019; Accepted: 2 September 2019; Published: 8 September 2019 Abstract: The modeling of time to event data is an important topic with many applications in diverse areas. The collective of methods to analyze such data are called survival analysis, event history analysis or duration analysis. Survival analysis is widely applicable because the definition of an ’event’ can be manifold and examples include death, graduation, purchase or bankruptcy. Hence, application areas range from medicine and sociology to marketing and economics. In this paper, we review the theoretical basics of survival analysis including estimators for survival and hazard functions. We discuss the Cox Proportional Hazard Model in detail and also approaches for testing the proportional hazard (PH) assumption. Furthermore, we discuss stratified Cox models for cases when the PH assumption does not hold. Our discussion is complemented with a worked example using the statistical programming language R to enable the practical application of the methodology.
    [Show full text]
  • Estimating the Survival Function
    S-Plus Commands for Survival Estimation Estimating the Survival Function > t_c(1,1,2,2,3,4,4,5,5,8,8,8,8,11,11,12,12,15,17,22,23) One-sample nonparametric methods: > surv.fit(t,status=rep(1,21)) 95 percent confidence interval is of type "log" We will consider three methods for estimating a survivorship time n.risk n.event survival std.dev lower 95% CI upper 95% CI function 1 21 2 0.90476190 0.06405645 0.78753505 1.0000000 2 19 2 0.80952381 0.08568909 0.65785306 0.9961629 S(t)=Pr(T ≥ t) 3 17 1 0.76190476 0.09294286 0.59988048 0.9676909 4 16 2 0.66666667 0.10286890 0.49268063 0.9020944 5 14 2 0.57142857 0.10798985 0.39454812 0.8276066 without resorting to parametric methods: 8 12 4 0.38095238 0.10597117 0.22084536 0.6571327 11 8 2 0.28571429 0.09858079 0.14529127 0.5618552 12 6 2 0.19047619 0.08568909 0.07887014 0.4600116 15 4 1 0.14285714 0.07636035 0.05010898 0.4072755 17 3 1 0.09523810 0.06405645 0.02548583 0.3558956 22 2 1 0.04761905 0.04647143 0.00703223 0.3224544 (1) Kaplan-Meier 23 1 1 0.00000000 NA NA NA (2) Life-table (Actuarial Estimator) (3) via the Cumulative hazard estimator 1 2 (1) The Kaplan-Meier Estimator Motivation: First, consider an example where there is no censoring. The Kaplan-Meier (or KM) estimator is probably The following are times of remission (weeks) for 21 leukemia the most popular approach.
    [Show full text]
  • University of California Santa Cruz Bayesian Inference
    UNIVERSITY OF CALIFORNIA SANTA CRUZ BAYESIAN INFERENCE FOR MEAN RESIDUAL LIFE FUNCTIONS IN SURVIVAL ANALYSIS A project document submitted in partial satisfaction of the requirements for the degree of MASTER OF SCIENCE in STATISTICS AND APPLIED MATHEMATICS by Valerie A. Poynor December 2010 This document is approved: Associate Professor Athanasios Kottas, Chair Associate Professor Raquel Prado Copyright c by Valerie A. Poynor 2010 Table of Contents List of Figures v List of Tables vi Abstract vii Dedication viii Acknowledgments ix 1 Introduction 1 2 Mean Residual Life Functions: Theory and Properties 4 2.1 Probabilistic Properties . 5 2.1.1 Elementary Identities . 5 2.1.2 Bounds for MRL Functions . 7 2.1.3 Properties of MRL (Inversion Formula) . 8 2.2 MRL Functions for Specific Distributions . 10 2.2.1 Linear MRL . 10 2.2.2 The Form of the Mean Residual Life for Some Common Distribu- tions . 13 3 Bayesian Inference for MRL Functions 26 3.1 Literature Review . 26 3.2 Exponentiated Weibull Model . 28 3.3 Nonparametric Lognormal Mixture Model . 32 3.3.1 Model Formulation . 32 3.3.2 Posterior Inference . 36 3.4 Example . 40 3.4.1 Results . 41 3.4.2 Model Comparison . 46 iii 4 Discussion and Conclusion 49 Bibliography 52 A Proofs 56 A.1 Equalities and Bounds of MRL . 56 A.2 properties of MRL . 59 iv List of Figures 2.1 (left) Linear mrl for X with A = 4 (slope) and B = 1 (intercept). (right) Corresponding survival function of X.................... 11 2.2 (left) Linear mrl for X with A = −0:2 (slope) and B = 1 (intercept).
    [Show full text]
  • Modeling Absolute Differences in Life Expectancy with a Censored Skew-Normal Regression Approach
    Modeling absolute diVerences in life expectancy with a censored skew-normal regression approach Andre´ Moser1,2 , Kerri Clough-Gorr2 and Marcel Zwahlen2, for the SNC study group 1 Department of Geriatrics, Bern University Hospital, and Spital Netz Bern Ziegler, and University of Bern, Bern, Switzerland 2 Institute of Social and Preventive Medicine (ISPM), University of Bern, Bern, Switzerland ABSTRACT Parameter estimates from commonly used multivariable parametric survival regression models do not directly quantify diVerences in years of life expectancy. Gaussian linear regression models give results in terms of absolute mean diVerences, but are not appropriate in modeling life expectancy, because in many situations time to death has a negative skewed distribution. A regression approach using a skew-normal distribution would be an alternative to parametric survival models in the modeling of life expectancy, because parameter estimates can be interpreted in terms of survival time diVerences while allowing for skewness of the distribution. In this paper we show how to use the skew-normal regression so that censored and left-truncated observations are accounted for. With this we model diVerences in life expectancy using data from the Swiss National Cohort Study and from oYcial life expectancy estimates and compare the results with those derived from commonly used survival regression models. We conclude that a censored skew-normal survival regression approach for left-truncated observations can be used to model diVerences in life expectancy
    [Show full text]
  • LIFE CONGINGENCY MODELS I Contents 1 Survival Models
    LIFE CONGINGENCY MODELS I VLADISLAV KARGIN Contents 1 Survival Models . 3 1.1 Survival function . 3 1.2 Expectation and survival function . 4 1.3 Actuarial notation . 8 1.4 Force of mortality . 13 1.5 Expectation of Life . 15 1.6 Future curtate lifetime. 18 1.7 Common analytical survival models . 20 2 Using Life Tables . 25 2.1 Life Tables. 25 2.2 Continuous Calculations Using Life Tables . 30 2.3 Select and ultimate tables . 35 3 Life Insurance . 38 3.1 Introduction . 38 3.2 Whole life insurance . 40 3.3 Term Life Insurance. 48 3.4 Deferred Life Insurance . 51 3.5 Pure Endowment Life Insurance . 55 3.6 Endowment Life Insurance. 56 3.7 Deferred Term Life Insurance . 58 3.8 Non-level payments paid at the end of the year . 59 3.9 Life insurance paid m times a year . 61 3.10 Level benefit insurance in the continuous case . 62 3.11 Calculating APV from life tibles. 64 3.12 Insurance for joint life and last survivor. 67 1 2 VLADISLAV KARGIN 4 Annuities . 72 4.1 Introduction . 72 4.2 Whole life annuities. 74 4.3 Annuities - Woolhouse approximation . 81 4.4 Annuities and UDD assumption . 84 5 Benefit Premiums . 86 5.1 Funding a liability. 86 5.2 Fully discrete benefit premiums: Pricing by equiva- lence principle . 87 5.3 Compensation for risk. 91 5.4 Premiums paid m times a year . 95 5.5 Benefit Premiums - Adjusting for Expenses . 96 VLADISLAV KARGIN 3 ~ SECTION 1 Survival Models The primary responsibility of the life insurance actuary is to maintain the solvency and profitability of the insurer.
    [Show full text]