Lecture 8 Models for Censored and Truncated Data - Tobit Model

Total Page:16

File Type:pdf, Size:1020Kb

Lecture 8 Models for Censored and Truncated Data - Tobit Model RS – Lecture 17 Lecture 8 Models for Censored and Truncated Data - Tobit Model 1 Censored and Truncated Data • In some data sets we do not observe values above or below a certain magnitude, due to a censoring or truncation mechanism. Examples : - A central bank intervenes to stop an exchange rate falling below or going above certain levels. - Dividends paid by a company may remain zero until earnings reach some threshold value. - A government imposes price controls on some goods. - A survey of only working women, ignoring non-working women. In these situations, the observed data consists of a combination of measurements of some underlying latent variable and observations that arise when the censoring/truncation mechanism is applied. 2 RS – Lecture 17 Censored and Truncated Data: Definitions • Y is censored when we observe X for all observations, but we only know the true value of Y for a restricted range of observations. Values of Y in a certain range are reported as a single value or there is significant clustering around a value, say 0. - If Y = k or Y > k for all Y, then Y is censored from below or left-censored. - If Y = k or Y < k for all Y, then Y is censored from above or right-censored. We usually think of an uncensored Y, Y*, the true value of Y when the censoring mechanism is not applied. We typically have all the observations for { Y,X }, but not {Y*,X }. • Y is truncated when we only observe X for observations where Y would not be censored. We do not have a full sample for { Y,X }, we exclude observations based on characteristics of Y. Censored and Truncated Data: Example 1 No censoring or truncation 10 8 6 y 4 2 0 0 2 4 6 x • We observe the full range of Y and the full range of X. RS – Lecture 17 Censored and Truncated Data: Example 2 Censored from above 10 8 6 y 4 2 0 0 2 4 6 x • If Y ≥ 6, we do not know its exact value. Example : A Central Bank intervenes if the exchange rate hits the band’s upper limit. => If S t ≥ Ē => S t= Ē . Censored and Truncated Data: Example 2 0.45 Pdf(S t) 0.40 0.35 Prob(S t≤ Ē) 0.30 observed part Prob(S t> Ē) 0.25 0.20 0.15 0.10 0.05 0.00 Ē St • The pdf of the exchange rate, S t, is a mixture of discrete (mass at St=Ē) and continuous (Prob[S t<Ē]) distributions. RS – Lecture 17 Censored and Truncated Data: Example 3 Censored from below 10 8 6 y 4 2 0 0 2 4 6 x • If Y ≤ 5, we do not know its exact value. Example : A Central Bank intervenes if the exchange rate hits the band’s lower limit. => If S t ≤ Ē => S t= Ē. Censored and Truncated Data: Example 3 PDF(y *) f(y*) Prob(y*<5) Prob(y*>5) 5 y* • The pdf of the observable variable, y, is a mixture of discrete (prob. mass at Y=5) and continuous (Prob[Y*>5]) distributions. 8 RS – Lecture 17 Censored and Truncated Data: Example 3 PDF(y *) Prob(y*<5) Prob(y*>5) 5 Y • Under censoring we assign the full probability in the censored region to the censoring point, 5. 9 Censored and Truncated Data: Example 4 Truncated 10 8 6 y 4 2 0 0 2 4 6 x • If Y < 3, the value of X (or Y) is unknown. (Truncation from below.) Example : If a family’s income is below certain level, we have no information about the family’s characteristics. RS – Lecture 17 Censored and Truncated Data: Example 4 PDF(Y) 0.45 0.40 0.35 Prob[Y>3] < 1.0 0.30 0.25 0.20 0.15 0.10 0.05 0.00 1 5 93 13 17 21 25 29 33 37 41 45Y 49 • Under data censoring, the censored distribution is a combination of a pmf plus a pdf. They add up to 1. We have a different situation under truncation. To create a pdf for Y we will use a conditional pdf. Censored Normal • Moments: Let y*~ N(µ*, σ2) and α = ( c – µ*)/ σ. Then - Data: yi = yi* if yi* ≥ c = c if yi* ≤ c - Prob(y = c |x) = Prob(y* ≤ c |x) = Prob[(y*- µ*)/σ ≤ (c - µ*)/σ|x] = Prob[z ≤ (c - µ*)/σ|x] = Φ(α) - Prob(y > c |x) = Prob(y* > 0|x) = 1- Φ(α) - First Moment - Conditional: E[y|y*> c] = µ*+ σ λ(α) - Unconditional: E[y] = Φ(α) c + (1- Φ(α)) [µ*+ σ λ(α)] 12 RS – Lecture 17 Censored Normal • To get the first moment, we use a useful result –proven later- for a truncated normal distribution. If v is a standard normal variable and the truncation is from below at c, a constant, then φ(c) E(v | v > c) = F −1 f = 1− Φ(c) - In our conditional model, c = -(xi’β). (Note that the expectation is also conditioned on x, thus x is treated as a constant.). -1 - Note : The ratio Fi fi (a pdf dividided by a CDF) is called Inverse Mill’s ratio , usually denoted by λ(.) –the hazard function . If the truncation is from above: φ(c) λ(c) = − 1− Φ(c) 13 Censored Normal • To get the first moment, we use a useful result –proven later- for a truncated normal distribution. If v is a standard normal variable and the truncation is from below at c, a constant, then φ(c) φ(−c) E(v | v > c) = F −1 f = = 1− Φ(c) Φ(−c) - In our conditional model, c = -(xi’β). (Note that the expectation is also conditioned on x, thus x is treated as a constant.). -1 - Note : The ratio Fi fi (a pdf dividided by a CDF) is called Inverse Mill’s ratio , usually denoted by λ(.) –the hazard function . If the 0.45 truncation is from above: 0.40 0.35 0.30 φ(c) 0.25 λ(c) = − 0.20 1− Φ(c) 0.15 0.10 0.05 0.00 14 RS – Lecture 17 Censored Normal • Moments (continuation): - Unconditional first moment: E[y] = Φ(α) c + (1- Φ(α)) [µ*+ σ λ(α)] If c =0 => E[y] = (1- Φ(–µ*/ σ)) [µ*+ σ λ(α)] = Φ(µ*/ σ)) [µ*+ σ λ(α)] - Second Moment (1 − δ ) Vary2 1 ()()=σ() −Φ α 2 +()() α−λ Φ α whereδ=λ2 −λα 0 ≤δ≤ 1 15 Censored Normal – Hazard Function • The moments depend on λ(c), the Inverse Mill’s ratio or hazard rate evaluated at c: φ(c) φ(−c) λ(c) = = 1− Φ(c) Φ(−c) It is a monotonic function that begins at zero (when c = -∞) and asymptotes at infinity (when c = -∞). See plot below for (–c). 16 RS – Lecture 17 Truncated Normal • Moments: Let y*~ N(µ*, σ2) and α = ( c – µ*)/ σ. - First moment: E[y*|y> c] = µ* + σ λ(α) <= This is the truncated regression. => If µ>0 and the truncation is from below –i.e., λ(α) >0–, the mean of the truncated variable is greater than the original mean Note : For the standard normal distribution λ(α) is the mean of the truncated distribution. - Second moment: - Var[y*|y> c] = σ2[1 - δ(α)] where δ(α) = λ(α) [λ(α)- α] => Truncation reduces variance! This result is general, it applies to upper or lower truncation given that 0 ≤ δ(α) ≤ 1 17 Truncated Normal * Model: yi = X iβ + εi Data: y = y* | y* > 0 f(y|y *>0,X) F(0|X ) f(y*|X) 0 β Xi Xiβ+σλ • Truncated regression model: * E(y i| yi > 0,X i) = X iβ + σλ i 18 RS – Lecture 17 Censored and Truncated Data: Intuition • To model the relation between the observed y and x, we consider a latent variable y* that is subject to censoring/truncation. A change in x affects y only through the effect of x on y*. - Model the true, latent variable: y* = f(x ) = X β + error - Observed Variable: y = h(y*) • Q: What is this latent variable? Considered the variable “Dividends paid last quarter”? - For all the respondents with y=0 ( left censored from below), we think of a latent variable for “excess cash” that underlies “dividends paid last quarter.” Extremely cash poor would pay negative dividends if that were possible. Censored and Truncated Data: Problems • In the presence of censoring/truncation, we have a dependent variable, y, with special attribute(s): (1) constrained and (2) clustered of observations at the constraint. Examples : - Consumption (1, not 2) - Wage changes (2, not 1) - Labor supply (1 & 2) • These attributes create problems for the linear model. RS – Lecture 17 Censored and Truncated Data: Problems • Censoring in a regression framework (from Ruud). • If y is constrained and if there is clustering – OLS on the complete sample biased and inconsistent. – OLS on the unclustered part biased and inconsistent. Data Censoring and Corner Solutions • The dependent variable is partly continuous, but with positive mas at one or more points. Two different cases: (1) Data censoring (from above or below) - Surveys for wealth (upper category $250,000 or more) - Duration in unemployment - Demand for stadium tickets => Y* is not observed because of constraints, selection technique or measuring method (data problem) - We are interested in the effect of x on y*: E( y*| x). =>If yi* = x i’ β + εi, βj measures the effect of a change of xj on y*.
Recommended publications
  • An Evaluation of Economic Efficiency of Finnish Regions by Dea and Tobit Models* **
    AN EVALUATION OF ECONOMIC EFFICIENCY OF FINNISH REGIONS BY DEA AND TOBIT MODELS* ** Heikki A. Loikkanen Department of Economics FIN-00014 University of Helsinki, Finland [email protected] Ilkka Susiluoto Department of Economics FIN-00014 University of Helsinki, Finland [email protected] Abstract: In the first phase of the study, private sector economic efficiency scores for 83 Finnish regions in 1988-1999 were estimated by the Data Envelopment Analysis (DEA) method. Inputs in the DEA analysis included capital stock, employment by education level, years of schooling and volume of local public sector activity. Outputs were regional value added and personal direct real income from employment. Efficiency differences between regions proved to be considerable and they were correlated with several regional factors. In the second phase the differences in the efficiency scores were explained by using Tobit and logistic regression models. In these cross-section models (1988, 1993, 1999 and average of 1988-1999) the explanatory variables included regional characteristics such as population size, distance from national centres, structure of regional economy (concentration), the existence of university, number of students, accessibility index, innovativity index and the number of patents. * Paper prepared for the 42st Congress of the European Regional Science Association, Dortmund, Germany, 27. – 31.8. 2002 ** Preliminary version, not to be quoted without the permission of the authors. 1. Introduction The purpose of this paper is to present some results concerning economic performance of Finnish regions. More specifically, we study inter-regional differences in private sector efficiency (or productivity). Our data consists of regional input and output variables and other regional characteristics concerning 83 NUTS 4-level regions in Finland during the period 1988-1999.
    [Show full text]
  • A Comparison of Methods for Analyzing Health-Related Quality-Of
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Elsevier - Publisher Connector Volume 5 • Number 4 • 2002 VALUE IN HEALTH A Comparison of Methods for Analyzing Health- Related Quality-of-Life Measures Peter C. Austin, PhD Institute for Clinical Evaluative Sciences, North York, Ontario; Department of Public Health Sciences, University of Toronto,Toronto, Canada ABSTRACT Objectives: Self-reported health status is often measured Survey. The results are compared to three models that using psychometric or utility indices that provide a score ignore the presence of a ceiling effect. intended to summarize an individual’s health. Measure- Results: The Censored LAD model produced coefficient ments of health status can be subject to a ceiling effect. estimates that tended to be shrunk toward 0, compared Frequently, researchers want to examine relationships to the other two models. The three models produced con- between determinants of health and measures of health flicting evidence on the effect of gender on health status. status. Regression methods that ignore the censoring in Similarly, the rate of decay in health status with increas- the health status measurement can produce biased coeffi- ing age differed across the three models. The Censored cient estimates. The authors examine the performance LAD model produced results very similar to median of three different models for assessing the relationship regression. Furthermore, the censored LAD model had between demographic characteristics and health status. the lowest prediction error in an independent validation Methods: Three methods that allow one to analyze data dataset. subject to a ceiling effect are compared.
    [Show full text]
  • A Simple Censored Median Regression Estimator
    Statistica Sinica 16(2006), 1043-1058 A SIMPLE CENSORED MEDIAN REGRESSION ESTIMATOR Lingzhi Zhou The Hong Kong University of Science and Technology Abstract: Ying, Jung and Wei (1995) proposed an estimation procedure for the censored median regression model that regresses the median of the survival time, or its transform, on the covariates. The procedure requires solving complicated nonlinear equations and thus can be very difficult to implement in practice, es- pecially when there are multiple covariates. Moreover, the asymptotic covariance matrix of the estimator involves the density of the errors that cannot be esti- mated reliably. In this paper, we propose a new estimator for the censored median regression model. Our estimation procedure involves solving some convex min- imization problems and can be easily implemented through linear programming (Koenker and D'Orey (1987)). In addition, a resampling method is presented for estimating the covariance matrix of the new estimator. Numerical studies indi- cate the superiority of the finite sample performance of our estimator over that in Ying, Jung and Wei (1995). Key words and phrases: Censoring, convexity, LAD, resampling. 1. Introduction The accelerated failure time (AFT) model, which relates the logarithm of the survival time to covariates, is an attractive alternative to the popular Cox (1972) proportional hazards model due to its ease of interpretation. The model assumes that the failure time T , or some monotonic transformation of it, is linearly related to the covariate vector Z 0 Ti = β0Zi + "i; i = 1; : : : ; n: (1.1) Under censoring, we only observe Yi = min(Ti; Ci), where Ci are censoring times, and Ti and Ci are independent conditional on Zi.
    [Show full text]
  • Working Paper No. 572 the Unequal Burden of Poverty on Time Use
    Working Paper No. 572 The Unequal Burden of Poverty on Time Use by Burca Kizilirmak Ankara University and Emel Memis The Levy Economics Institute and Ankara University August 2009 The authors gratefully acknowledge the Gender Equality and the Economy (GEE) program of The Levy Economics Institute for providing the data set used in the empirical analysis. They also wish to extend my sincere appreciation to Rania Antonopoulos, director of the GEE program for steering them in this research area, and for her useful comments throughout. Comments may be sent to the authors at [email protected] and [email protected]. The Levy Economics Institute Working Paper Collection presents research in progress by Levy Institute scholars and conference participants. The purpose of the series is to disseminate ideas to and elicit comments from academics and professionals. The Levy Economics Institute of Bard College, founded in 1986, is a nonprofit, nonpartisan, independently funded research organization devoted to public service. Through scholarship and economic research it generates viable, effective public policy responses to important economic problems that profoundly affect the quality of life in the United States and abroad. The Levy Economics Institute P.O. Box 5000 Annandale-on-Hudson, NY 12504-5000 http://www.levy.org Copyright © The Levy Economics Institute 2009 All rights reserved. ABSTRACT This study uses the first time-use survey carried out in South Africa (2000) to examine women’s and men’s time use, with a focus on the impacts of income poverty. We empirically explore the determinants of time spent on different paid and unpaid work activities, including a variety of household and individual characteristics, using bivariate and multivariate Tobit estimations.
    [Show full text]
  • Inflation Expectations and Choices of Households
    A Service of Leibniz-Informationszentrum econstor Wirtschaft Leibniz Information Centre Make Your Publications Visible. zbw for Economics Vellekoop, Nathanael; Wiederholt, Mirko Working Paper Inflation expectations and choices of households SAFE Working Paper, No. 250 Provided in Cooperation with: Leibniz Institute for Financial Research SAFE Suggested Citation: Vellekoop, Nathanael; Wiederholt, Mirko (2019) : Inflation expectations and choices of households, SAFE Working Paper, No. 250, Goethe University Frankfurt, SAFE - Sustainable Architecture for Finance in Europe, Frankfurt a. M., http://dx.doi.org/10.2139/ssrn.3383452 This Version is available at: http://hdl.handle.net/10419/196146 Standard-Nutzungsbedingungen: Terms of use: Die Dokumente auf EconStor dürfen zu eigenen wissenschaftlichen Documents in EconStor may be saved and copied for your Zwecken und zum Privatgebrauch gespeichert und kopiert werden. personal and scholarly purposes. Sie dürfen die Dokumente nicht für öffentliche oder kommerzielle You are not to copy documents for public or commercial Zwecke vervielfältigen, öffentlich ausstellen, öffentlich zugänglich purposes, to exhibit the documents publicly, to make them machen, vertreiben oder anderweitig nutzen. publicly available on the internet, or to distribute or otherwise use the documents in public. Sofern die Verfasser die Dokumente unter Open-Content-Lizenzen (insbesondere CC-Lizenzen) zur Verfügung gestellt haben sollten, If the documents have been made available under an Open gelten abweichend von diesen Nutzungsbedingungen
    [Show full text]
  • HEDS Discussion Paper 10/08
    HEDS Discussion Paper 10/08 Disclaimer: This is a Discussion Paper produced and published by the Health Economics and Decision Science (HEDS) Section at the School of Health and Related Research (ScHARR), University of Sheffield. HEDS Discussion Papers are intended to provide information and encourage discussion on a topic in advance of formal publication. They represent only the views of the authors, and do not necessarily reflect the views or approval of the sponsors. White Rose Repository URL for this paper: http://eprints.whiterose.ac.uk/11074/ Once a version of Discussion Paper content is published in a peer-reviewed journal, this typically supersedes the Discussion Paper and readers are invited to cite the published version in preference to the original version. Published paper Value Health. 2012 May;15(3):550-61. Epub 2012 Mar 23. White Rose Research Online [email protected] - 1 - - 2 - Tails from the Peak District: Adjusted Censored Mixture Models of EQ-5D Health State Utility Values Monica Hernandez Alava, Allan J. Wailoo, Roberta Ara Health Economics & Decision Science, School of Health and Related Research, University of Sheffield July 20l0 ABSTRACT: Health state utility data generated using the EQ-5D instrument are typically right bounded at one with a substantial gap to the next set of observations, left bounded by some negative value, and are multi modal. These features present challenges to the estimation of the effect of clinical and socioeconomic characteristics on health utilities. We present an adjusted censored model and then use this in a flexible, mixture modelling framework to address these issues.
    [Show full text]
  • HST 190: Introduction to Biostatistics
    HST 190: Introduction to Biostatistics Lecture 8: Analysis of time-to-event data 1 HST 190: Intro to Biostatistics • Survival analysis is studied on its own because survival data has features that distinguish it from other types of data. • Nonetheless, our tour of survival analysis will visit all of the topics we have learned so far: § Estimation § One-sample inference § Two-sample inference § Regression 2 HST 190: Intro to Biostatistics Survival analysis • Survival analysis is the area of statistics that deals with time-to- event data. • Despite the name, “survival” analysis isn’t only for analyzing time until death. § It deals with any situation where the quantity of interest is amount of time until study subject experiences some relevant endpoint. • The terms “failure” and “event” are used interchangeably for the endpoint of interest • For example, say researchers record time from diagnosis to death (in months) for a sample of patients diagnosed with laryngeal cancer. § How would you summarize the survival of laryngeal cancer patients? 3 HST 190: Intro to Biostatistics • A simple idea is to report the mean or median survival time § However, sample mean is not robust to outliers, meaning small increases the mean don’t show if everyone lived a little bit longer or whether a few people lived a lot longer. § The median is also just a single summary measure. • If considering your own prognosis, you’d want to know more! • Another simple idea: report survival rate at � years § How to choose cutoff time �? § Also, just a single summary measure—graphs show same 5-year rate 4 HST 190: Intro to Biostatistics • Summary measures such as means and proportions are highly useful for concisely describing data.
    [Show full text]
  • The Impact of Households' Characteristics on Food at Home
    The Impact of Households’ Characteristics on Food at Home and Food away from Home By GwanSeon Kim Department of Agricultural Economics University of Kentucky ([email protected]) Phone: 662-617-5501 Sayed Saghaian Department of Agricultural Economic University of Kentucky ([email protected]) Phone: 859-257-2356 Selected Paper prepared for presentation at the Southern Agricultural Economics Association’s 2016 Annual Meeting, San Antonio, Texas, February, 6‐9 2016 Copyright 2016 by GwanSeon Kim and Sayed Saghaian. All rights reserved. Readers may make verbatim copies of this document for non‐commercial purposes by any means, provided that this copyright notice appears on all such copies. Introduction Food, clothing, and housing are essential factors for a good life, but food is the most important component due to the fact that it is directly related to human health. According to the Food and Agriculture Organization (FAO) and International Fund for Agricultural Development (IFAD), approximately 805 million people in the world do not have enough food to eat (Food, et al., 2014). More than 200 diseases occur and up to 9,000 deaths are estimated each year in the U.S due to insufficient food (Mead, et al., 1999). Based on the United States Department of Agriculture, Food and Nutrition Service (USDA. FNS, 2008), Food and Nutrition Act of 2008 provides better levels of nutrition for low-income households from federal-state programs such as the Supplemental Nutrition Assistance Program (SNAP), also called the food stamp program. The U.S. Bureau of Labor Statistics (BLS, 2015) reports that the percentage changes on average annual food expenditure between 2013 and 2014 are -0.2% for food at home and 6.2% for food away from home, respectively.
    [Show full text]
  • Tobit Or Not Tobit?
    IZA DP No. 4588 Tobit or Not Tobit? Jay Stewart November 2009 DISCUSSION PAPER SERIES Forschungsinstitut zur Zukunft der Arbeit Institute for the Study of Labor Tobit or Not Tobit? Jay Stewart U.S. Bureau of Labor Statistics and IZA Discussion Paper No. 4588 November 2009 IZA P.O. Box 7240 53072 Bonn Germany Phone: +49-228-3894-0 Fax: +49-228-3894-180 E-mail: [email protected] Any opinions expressed here are those of the author(s) and not those of IZA. Research published in this series may include views on policy, but the institute itself takes no institutional policy positions. The Institute for the Study of Labor (IZA) in Bonn is a local and virtual international research center and a place of communication between science, politics and business. IZA is an independent nonprofit organization supported by Deutsche Post Foundation. The center is associated with the University of Bonn and offers a stimulating research environment through its international network, workshops and conferences, data service, project support, research visits and doctoral program. IZA engages in (i) original and internationally competitive research in all fields of labor economics, (ii) development of policy concepts, and (iii) dissemination of research results and concepts to the interested public. IZA Discussion Papers often represent preliminary work and are circulated to encourage discussion. Citation of such a paper should account for its provisional character. A revised version may be available directly from the author. IZA Discussion Paper No. 4588 November 2009 ABSTRACT Tobit or Not Tobit?* Time-use surveys collect very detailed information about individuals’ activities over a short period of time, typically one day.
    [Show full text]
  • Consumer Demand Analysis When Zero Consumption Occurs: the Case of Cigarettes (TB-1792)
    United States Department of WJ Agriculture Consumer Demand Analysis When Zero Technical Bulletin Number 1792 Consumption Occurs The Case of Cigarettes James R. Blaylock William N. Blisard yjfèè It's Easy To Order Another Copy! Just dial 1-800-999-6779. Toll free in the United States and Canada. Other areas, please call 1-301-725-7937. Ask for Consumer Demand Analysis When Zero Consumption Occurs: The Case of Cigarettes (TB-1792). The cost is $8.00 per copy. For non-U.S. addresses (includes Canada), add 25 percent. Charge your purchase to your VISA or MasterCard, or we can bill you. Or send a check or purchase order (made payable to ERS-NASS) to: ERS-NASS P.O. Box 1608 Rockville, MD 20849-1608. We'll fill your order by first-class mail. Consumer Demand Analysis When Zero Consumption Occurs: The Case of Cigarettes. By James R. Blaylock and William N. Blisard. Commodity Economics Division, Economic Research Service, U.S. Department of Agriculture. Technical Bulletin No. 1792. Abstract Analysts use household survey information when attempting to model the demand for certain commodities. However, many individuals do not purchase or use specific agricultural commodities during a survey period. This presents a problem for analysts because it is unclear what the zero purchase implies. It could mean that the individual never uses the product or that the survey period was too short. Or perhaps the individual would use the product if the price were lower. By focusing on a specific commodity with well-known characteristics and patterns of use, cigarettes, we are able to examine methods of treating zero observations.
    [Show full text]
  • The Tobit Model ML Estimation for the Tobit Model Summary
    Motivation: The Female Labour Supply The Married Women Labor Supply and the Tobit Model ML Estimation for the Tobit Model Summary The Tobit Model Quantitative Microeconomics R. Mora Department of Economics Universidad Carlos III de Madrid R. Mora The Tobit Model Motivation: The Female Labour Supply The Married Women Labor Supply and the Tobit Model ML Estimation for the Tobit Model Summary Outline 1 Motivation: The Female Labour Supply 2 The Married Women Labor Supply and the Tobit Model 3 ML Estimation for the Tobit Model R. Mora The Tobit Model Motivation: The Female Labour Supply The Married Women Labor Supply and the Tobit Model ML Estimation for the Tobit Model Summary The House Allocation of Time in Real Life % Leisure Personal Household Market Married Women 13 47 30 10 Married Men 14 46 16 24 Unmarried Women 14 48 21 17 Unmarried Men 15 45 21 19 Source: University of Michigan, US data R. Mora The Tobit Model Motivation: The Female Labour Supply The Married Women Labor Supply and the Tobit Model ML Estimation for the Tobit Model Summary Time Allocation among Singles Jack is less productive In the Household Market Market Jill Jack Household Household R. Mora The Tobit Model Motivation: The Female Labour Supply The Married Women Labor Supply and the Tobit Model ML Estimation for the Tobit Model Summary Economic Theories on Marriage Principal-Agent Problem: taking care of the kids and the car by people you trust Scale Economies: the marginal kid hardly increases the costs Risk-Sharing: pooling resources reduces risks none of these economic channels explain why married women work mostly in the household R.
    [Show full text]
  • Linear Regression with Type I Interval- and Left-Censored Response Data
    Environmental and Ecological Statistics 10, 221±230, 2003 Linear regression with Type I interval- and left-censored response data MARY LOU THOMPSON1,2 and KERRIE P. NELSON2 1Department of Biostatistics, Box 357232, University of Washington, Seattle, WA98195, USA 2National Research Center for Statistics and the Environment, University of Washington E-mail: [email protected] Received April2001; Revised November 2001 Laboratory analyses in a variety of contexts may result in left- and interval-censored measurements. We develop and evaluate a maximum likelihood approach to linear regression analysis in this setting and compare this approach to commonly used simple substitution methods. We explore via simulation the impact on bias and power of censoring fraction and sample size in a range of settings. The maximum likelihood approach represents only a moderate increase in power, but we show that the bias in substitution estimates may be substantial. Keywords: environmental data, laboratory analysis, maximum likelihood 1352-8505 # 2003 Kluwer Academic Publishers 1. Introduction The statisticalpractices of chemists are designed to protect against mis-identifying a sample compound and falsely reporting a detectable concentration. In environmental assessment, trace amounts of contaminants of concern are thus often reported by the laboratory as ``non-detects'' or ``trace'', in which case the data may be left- and interval- censored respectively. Samples below the ``non-detect'' threshold have contaminant levels which are regarded as being below the limits of detection and ``trace'' samples have levels that are above the limit of detection, but below the quanti®able limit. The type of censoring encountered in this setting is called Type I censoring: the censoring thresholds (``non-detect'' or ``trace'') are ®xed and the number of censored observations is random.
    [Show full text]