Piecewise Linear Regression for Leaf Appearance Rate Data

Total Page:16

File Type:pdf, Size:1020Kb

Piecewise Linear Regression for Leaf Appearance Rate Data Iowa State University Capstones, Theses and Creative Components Dissertations Spring 2021 Piecewise linear regression for leaf appearance rate data Lin Quan Follow this and additional works at: https://lib.dr.iastate.edu/creativecomponents Part of the Biostatistics Commons Recommended Citation Quan, Lin, "Piecewise linear regression for leaf appearance rate data" (2021). Creative Components. 786. https://lib.dr.iastate.edu/creativecomponents/786 This Creative Component is brought to you for free and open access by the Iowa State University Capstones, Theses and Dissertations at Iowa State University Digital Repository. It has been accepted for inclusion in Creative Components by an authorized administrator of Iowa State University Digital Repository. For more information, please contact [email protected]. Piecewise linear regression for leaf appearance rate data by Lin Quan A Creative Component submitted to the graduate faculty in partial fulfillment of the requirements for the degree of MASTER OF SCIENCE Major: Statistics Program of Study Committee: Dan Nettleton, Major Professor Lily Wang Peng Liu The student author, whose presentation of the scholarship herein was approved by the program of study committee, is solely responsible for the content of this creative component. The Graduate College will ensure this creative component is globally accessible and will not permit alterations after a degree is conferred. Iowa State University Ames, Iowa 2021 Copyright © Lin Quan, 2021. All rights reserved. ii TABLE OF CONTENTS Page LIST OF TABLES . iii LIST OF FIGURES . iv ABSTRACT . vi CHAPTER 1. INTRODUCTION . .1 CHAPTER 2. DATASET AND METHODS . .4 2.1 Dataset . .4 2.2 Methods . .4 2.2.1 Package segmented ................................4 2.2.2 Least-squares . .6 2.2.3 Bootstrap Resampling . .7 CHAPTER 3. RESULTS AND DISCUSSION . 10 3.1 Analysis of the first 396 genotypes . 10 3.2 Analysis of the last 99 genotypes . 15 CHAPTER 4. SUMMARY . 24 CHAPTER 5. ACKNOWLEDGEMENT . 25 BIBLIOGRAPHY . 26 iii LIST OF TABLES Page 3.1 ANOVA table for difference between slopes in two segments for each geno- type ........................................ 21 3.2 ANOVA table for estimated breakpoints for each genotype . 21 iv LIST OF FIGURES Page 3.1 (a) Histogram of p-values of genotypes with one breakpoint. (b) Histogram of p-values of genotypes with two breakpoints. 11 3.2 Plot of genotype C103. The black points are the raw data. The green line is a linear regression model. The blue line is a one breakpoint piecewise linear model, the blue triangle is a breakpoint. The red line is a two breakpoints piecewise linear model, the red circles are the breakpoints. 12 3.3 (a) Plot of comparing the estimators calculated by segmented and least- squares. (b) Plot of comparing the p-value calculated by segmented and residual bootstrap. The red line represents y=x. 13 3.4 (a) Studentized residuals. (b) Histogram of residuals. 14 3.5 (a) Plot of comparing the p-value calculated by segmented and wild boot- strap (Mammen's distribution). (b) Plot of comparing the p-value calculated by residual bootstrap and wild bootstrap (Mammen's distribution). (c) Plot of comparing the p-value calculated by wild bootstrap with the Rademacher distribution and Mammen's distribution. (d) Plot of comparing the p-value calculated by wild bootstrap with the standard normal distribution and Mammen's distribution. The red line represents y=x. 16 3.6 The estimated breakpoints and confidence intervals. (a) The results calcu- lated by segmented. (b) The results calculated by least-squares and boot- strap. Black points represent estimated breakpoints while red lines represent confidence intervals. 17 3.7 Plot of the width of CI against the p-value. (a) segmented method. (b) bootstrap method. 18 3.8 The estimated breakpoints and confidence intervals calculated by segmented. (a) genotypes in block1. (b) genotypes in block2. Black points represent estimated breakpoints while red lines represent confidence intervals. 19 3.9 The estimated breakpoints and confidence intervals calculated by bootstrap. (a) genotypes in block1. (b) genotypes in block2. Black points represent estimated breakpoints while red lines represent confidence intervals. 20 v 3.10 Histogram of residuals and normal Q-Q plot. (a), (b) correspond to differ- ence between slopes in two segments for each genotype. (c), (d) correspond to estimated breakpoints for each genotype. 22 3.11 (a) Plot of residuals versus estimated breakpoints. (b) Plot of residuals versus estimated difference between slopes in two segments. The red line represents y=0. 23 vi ABSTRACT Segmented regression models are generalization of linear and generalized linear models that replace a linear predictor with a piecewise linear predictor. Breakpoints where the piecewise linear predictor changes slope are unknown and estimated from data. We use segmented regression to model the relationship between the number of plant leaves and thermal time for hundreds of maize genotypes. Slope estimates from fitted segmented regression models provide estimates of leaf appearance rate (LAR) for each genotype. Estimates of breakpoints provide insight into developmental time points when changes in LAR occur for each genotype. We compare inferences about slopes and breakpoints obtained from the segmented R package with inferences obtained by bootstrap techniques. Furthermore, we use the estimated difference between slopes in two segments for each genotype and the estimated single breakpoint for each genotype as LAR characteristics of interest. We then use each of these characteristics as a response variable in a linear model to test for genotype effects. 1 CHAPTER 1. INTRODUCTION As an essential crop, maize plays an important role in small farm sustainability (Birch et al., 1998). Models for estimating maize developmental stages can be used to assist growers in management practices, such as choosing the best sowing date, and as an aid for selecting cultivars better adapted to different regions (Alberto et al., 2009). One important plant development parameter is the leaf appearance rate (LAR), which is the number of visible leaf collars per day (Xue et al., 2004). The LAR is used to obtain models of growth and yield of agricultural crops (Streck, 2003). It is an important parameter in the production efficiency of agricultural produce and has been used in ecophysiological studies in plants. Temperature is a major factor driving leaf appearance in maize (Hesketh and Warrington, 1989; White, 2001). One approach to predict individual leaf appearance is the phyllochron, defined as the time interval between the appearances of successive leaves (Klepper et al., 1982; Kirby, 1995). Time is often expressed as thermal time (TT), with units of oC day. The calculation of LAR is an essential part of many crop simulation models (Hodges, 1991), including the maize model (L´opez-Cedr´onet al., 2005). A segmented model is a regression model where the relationship between the response and one or more explanatory variables is piecewise linear, namely represented by two or more straight line segments connected to unknown points: these points are usually referred to as breakpoints, change points or even join points. Segmented regression is a useful tool for assessing the effectiveness of a quantitative covariate X on the response Y when there exists a value within the range Rx of the covariate where the effect of X changes. Piecewise linear regression models are widely applied in many fields, including climatology, environmental epidemiology, occupational medicine, toxicology, and ecology (Shao and Campbell, 2002; Toms and Lesperance, 2003; Betts et al., 2007; Muggeo, 2008a; Rhodes et al., 2008). In applications, it is common to see the effect of some key factors on the response change before and after some threshold value. For instance, in 2 mortality studies, the relationship between death and temperature is V-shaped, so it may be of interest to estimate the optimal temperature where the mortality reaches its minimum (Kunst et al., 1993). There is evidence that the risk of preterm delivery depends on the mother's stress only when it is above a specific threshold (Whitehead, 2002). Likewise, fertility patterns for cohabitant women in Italy have been modeled by exploiting segmented relationships of the hazard with respect to time since the start of cohabitation (Muggeo et al., 2009). In the literature, several methods have been employed in regression models with unknown breakpoints. Some methods separately estimate the breakpoints. If the breakpoint is fixed, the models are usually linear, without any problems of estimation and inference. The breakpoint can be estimated by a simple inspection of the scatter smooth plot (Kunst et al., 1993; Vermont et al., 1991) or by specific algorithms, e.g., exact- or grid-search-type algorithms (Stasinopoulos and Rigby, 1992; Ulm, 1991; Ertel and Fowlkes, 1976; Hawkins, 1976). Some other methods approximate the segmented relationship through a continuous differentiable function on the overall range of the explanatory variable (Tishler and Zang, 1981a; Muggeo, 2003; Bacon and Watts, 1971; Griffiths and Miller, 1973) or in a neighborhood of the unknown breakpoint (Tishler and Zang, 1981b). In addition, regression splines are used to estimate breakpoints (knots in spline terminology) by Molinari et al.(2001). Bayesian MCMC methods are an alternative for the same purpose. In a Bayesian approach, no differentiability assumption is needed, but on the other hand, the computational effort might
Recommended publications
  • How to Assess the Impact of Quality and Patient Safety Interventions with Routinely Collected Longitudinal Data
    How to Assess the Impact of Quality and Patient Safety Interventions with Routinely Collected Longitudinal Data Diego A. Martinez, PhD, Assistant Professor of Emergency Medicine, Johns Hopkins University School of Medicine, Baltimore, MD Mehdi Jalalpour, PhD, Assistant Professor of Civil and Environmental Engineering, Cleveland State University, Cleveland, Ohio, USA. David T. Efron, MD, FACS, Professor of Surgery, Johns Hopkins University School of Medicine, Baltimore, MD Scott R. Levin, PhD, Associate Professor of Emergency Medicine, Johns Hopkins University School of Medicine, Baltimore, MD ABSTRACT Objectives: Measuring the effect of patient safety improvement efforts is needed to determine their value but is difficult due to the inherent complexities of hospital operations. In this paper, we show by case study how interrupted time series design can be used to isolate and measure the impact of interventions while accounting for confounders often present in complex health delivery systems. Methods: We searched for time-stamped data from electronic medical records and operating room information systems associated with perioperative patient flow in a large, urban, academic hospital in Baltimore, Maryland. We limited the searched to those adult cases performed between January 2015 and March 2017. We used segmented regression and Box-Jenkins methods to measure the effect of perioperative throughput improvement efforts and account for the loss of high volume surgeons, surgical volume, and occupancy. Results: We identified a significant decline of operating room exit delays of about 50%, achieved in 6 months and sustained over 14 months. Conclusions: By longitudinal assessment of intervention effects, rather than cross-sectional comparison, our measurement tool estimated and provided inferences of change-points over time while taking into account the magnitude of other latent systems factors.
    [Show full text]
  • Process Control Charts and ITS Analysis of Epworth MEER Trial
    Supplementary material BMJ Open Qual Process control charts and ITS analysis of Epworth MEER trial Contents Introduction 2 References............................................. 2 Load R packages 3 Useful functions 3 Import .csv data file for standard 5 incident counts 4 Read in the standard 5 data and add field for incident rate: . 4 Display the data . 5 Summarise the data . 7 View the data 8 Plot patient activity levels . 8 Plot RiskMan incident rates for standard 5 . 9 Probablity distributions for the culpcount data 10 Poisson distribution . 10 Negative binomial distribution . 11 CHART AND ANALYSIS FUNCTIONS 12 Process control chart . 12 Segmented regression model function . 13 Segmented regression full analysis function . 13 ANALYSIS & FIGURES 15 ED+4G vs Other . 15 glm analyses . 15 Residual & autocorrelation plots for negative binomial analysis . 17 Generate u-charts and ITS plot . 19 1 Curtin AG, et al. BMJ Open Qual 2020; 9:e000741. doi: 10.1136/bmjoq-2019-000741 Supplementary material BMJ Open Qual Introduction This notebook file provides R code and results for process control charts (u-charts) and Interrupted Time Series (ITS) analyses of an intervention trial at the Epworth Richmond hospital in Melbourne which was applied to two units at the Epworth hospital in Richmond, 4 Gray (4G) and Emergency (ED), from Jan 2018 to Oct 2018. The intervention was the application of a quality improvement approach called MEER. The effectiveness of the MEER intervention was gauged by its influence on reported adverse adverse incidents at the Epworth Richmond hospital as recorded by an electronic reporting system called RiskMan. The RiskMan incidents analysed in this study can be categorised according to which of five National Safety and Quality Health Standards (NSQHS) they pertain to.
    [Show full text]
  • Original Research Article Analysing Interrupted Time Series with A
    Original Research Article Analysing interrupted time series with a control Authors Bottomley C 1,2* , Scott JAG 2,3 , Isham V 4 Institutions 1. MRC Tropical Epidemiology Group, London School of Hygiene & Tropical Medicine, London, UK 2. Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, UK 3. KEMRI-Wellcome Trust Research Programme, Kilifi, Kenya 4. Department of Statistical Science, University College London, London, UK * Corresponding Author: [email protected] London School of Hygiene & Tropical Medicine, Keppel Street, London, UK Tel: +44 207 927 2533 1 Abstract Interrupted time series are increasingly being used to evaluate the population-wide implementation of public health interventions. However, the resulting estimates of intervention impact can be severely biased if underlying disease trends are not adequately accounted for. Control series offer a potential solution to this problem, but there is little guidance on how to use them to produce trend-adjusted estimates. To address this lack of guidance, we show how interrupted time series can be analysed when the control and intervention series share confounders, i.e., when they share a common trend. We show that the intervention effect can be estimated by subtracting the control series from the intervention series and analysing the difference using linear regression or, if a log-linear model is assumed, by including the control series as an offset in a Poisson regression with robust standard errors. The methods are illustrated with two examples. Key words: interrupted time series, segmented regression, common trend model 2 Introduction Interrupted time series (ITS) analysis is an increasingly popular method for evaluating public health interventions [1].
    [Show full text]
  • An Exact Algorithm for Estimating Breakpoints in Segmented Generalized Linear Models
    Küchenhoff: An exact algorithm for estimating breakpoints in segmented generalized linear models Sonderforschungsbereich 386, Paper 27 (1996) Online unter: http://epub.ub.uni-muenchen.de/ Projektpartner An exact algorithm for estimating breakp oints in segmented generalized linear mo dels Helmut Kuc henho University of Munich Institute of Statistics Akademiestrasse D Munc hen Summary We consider the problem of estimating the unknown breakp oints in seg mented generalized linear mo dels Exact algorithms for calculating maxi mum likeliho o d estimators are derived for dierenttyp es of mo dels After discussing the case of a GLM with a single covariate having one breakp oint a new algorithm is presented when further covariates are included in the mo del The essential idea of this approach is then used for the case of more than one breakp oint As further extension an algorithm for the situation of two regressors eachhaving a breakp oint is prop osed These techniques are applied for analysing the data of the Munichrental table It can b e seen that these algorithms are easy to handle without to o much computational eort The algorithms are available as GAUSSprograms Keywords Breakp oint generalized linear mo del segmented regression Intro duction In many practical regressiontyp e problems we cannot t one uniform re gression function to the data since the functional relationship b etween the resp onse Y and the regressor X changes at certain p oints of the domain of X These p oints are usually called breakp oints or changep oints One imp
    [Show full text]
  • Fitting Segmented Regression Curves
    University of Montana ScholarWorks at University of Montana Graduate Student Theses, Dissertations, & Professional Papers Graduate School 1968 Fitting segmented regression curves Kenneth P. Johnson The University of Montana Follow this and additional works at: https://scholarworks.umt.edu/etd Let us know how access to this document benefits ou.y Recommended Citation Johnson, Kenneth P., "Fitting segmented regression curves" (1968). Graduate Student Theses, Dissertations, & Professional Papers. 8324. https://scholarworks.umt.edu/etd/8324 This Thesis is brought to you for free and open access by the Graduate School at ScholarWorks at University of Montana. It has been accepted for inclusion in Graduate Student Theses, Dissertations, & Professional Papers by an authorized administrator of ScholarWorks at University of Montana. For more information, please contact [email protected]. FITTING SEGMENTED REGRESSION CURVES By Kenneth P. Johnson B.A., University of Montana, l965 Presented in partial fulfillment of the requirements for the degree of Master of Arts UNIVERSITY OF MONTANA 1968 Approved by: Chairman, Board of Examiners /C'..,,_^ De ^ , Graduate' School Date Reproduced with permission of the copyright owner. Further reproduction prohibited without permission. UMI Number: EP39125 All rights reserved INFORMATION TO ALL USERS The quality of this reproduction is dependent upon the quality of the copy submitted. In the unlikely event that the author did not send a complete manuscript and there are missing pages, these will be noted. Also, if material had to be removed, a note will indicate the deletion. UMT DMSsdation PublisNng UMI EP39125 Published by ProQuest LLC (2013). Copyright in the Dissertation held by the Author. Microform Edition © ProQuest LLC.
    [Show full text]
  • Parameter Estimation in Linear-Linear Segmented Regression
    Brigham Young University BYU ScholarsArchive Theses and Dissertations 2010-04-20 Parameter Estimation in Linear-Linear Segmented Regression Erika Lyn Hernandez Brigham Young University - Provo Follow this and additional works at: https://scholarsarchive.byu.edu/etd Part of the Statistics and Probability Commons BYU ScholarsArchive Citation Hernandez, Erika Lyn, "Parameter Estimation in Linear-Linear Segmented Regression" (2010). Theses and Dissertations. 2113. https://scholarsarchive.byu.edu/etd/2113 This Selected Project is brought to you for free and open access by BYU ScholarsArchive. It has been accepted for inclusion in Theses and Dissertations by an authorized administrator of BYU ScholarsArchive. For more information, please contact [email protected], [email protected]. Parameter Estimation in Linear-Linear Segmented Regression Erika L. Hernandez A project submitted to the faculty of Brigham Young University in partial fulfillment of the requirements for the degree of Master of Science C. Shane Reese, Chair Scott D. Grimshaw Gilbert W. Fellingham Department of Statistics Brigham Young University August 2010 Copyright c 2010 Erika L. Hernandez All Rights Reserved ABSTRACT Parameter Estimation in Linear-Linear Segmented Regression Erika L. Hernandez Department of Statistics Master of Science Segmented regression is a type of nonlinear regression that allows differing functional forms to be fit over different ranges of the explanatory variable. This paper considers the simple segmented regression case of two linear segments that are constrained to meet, often called the linear-linear model. Parameter estimation in the case where the joinpoint between the regimes is unknown can be tricky. Using a simulation study, four estimators for the parameters of the linear-linear model are evaluated.
    [Show full text]
  • Water Quality Trend and Change-Point Analyses Using Integration of Locally Weighted Polynomial Regression and Segmented Regression
    Environ Sci Pollut Res DOI 10.1007/s11356-017-9188-x RESEARCH ARTICLE Water quality trend and change-point analyses using integration of locally weighted polynomial regression and segmented regression Hong Huang1,2 & Zhenfeng Wang1,2 & Fang Xia1,2 & Xu Shang1,2 & YuanYuan Liu 3 & Minghua Zhang1,2 & Randy A. Dahlgren1,2,4 & Kun Mei1,2 Received: 11 January 2017 /Accepted: 2 May 2017 # Springer-Verlag Berlin Heidelberg 2017 Abstract Trend and change-point analyses of water quality The reliability was verified by statistical tests and prac- time series data have important implications for pollution con- tical considerations for Shanxi Reservoir watershed. The trol and environmental decision-making. This paper devel- newly developed integrated LWPR-SegReg approach is oped a new approach to assess trends and change-points of not only limited to the assessment of trends and change-points water quality parameters by integrating locally weighted poly- of water quality parameters but also has a broad application to nomial regression (LWPR) and segmented regression other fields with long-term time series records. (SegReg). Firstly, LWPR was used to pretreat the original water quality data into a smoothed time series to represent Keywords Water quality . Long-term trend assessment . the long-term trend of water quality. Then, SegReg was used Change-point analysis . Locally weighted polynomial to identify the long-term trends and change-points of the regression . Segmented regression smoothed time series. Finally, statistical tests were applied to determine the significance of the long-term trends and change- points. The efficacy of this approach was validated using a 10- Introduction year record of total nitrogen (TN) and chemical oxygen de- mand (CODMn) from Shanxi Reservoir watershed in eastern Many countries and regions in the world suffer from chronic China.
    [Show full text]
  • Fast Algorithms for Segmented Regression
    Fast Algorithms for Segmented Regression Jayadev Acharya Ilias Diakonikolas MIT University of Southern California [email protected] [email protected] Jerry Li Ludwig Schmidt MIT MIT [email protected] [email protected] May 25, 2016 Abstract We study the fixed design segmented regression problem: Given noisy samples from a piece- wise linear function f, we want to recover f up to a desired accuracy in mean-squared error. Previous rigorous approaches for this problem rely on dynamic programming (DP) and, while sample efficient, have running time quadratic in the sample size. As our main contribution, we provide new sample near-linear time algorithms for the problem that – while not being minimax optimal – achieve a significantly better sample-time tradeoff on large datasets compared to the DP approach. Our experimental evaluation shows that, compared with the DP approach, our algorithms provide a convergence rate that is only off by a factor of 2 to 4, while achieving speedups of three orders of magnitude. 1 Introduction We study the regression problem – a fundamental inference task with numerous applications that has received tremendous attention in machine learning and statistics during the past fifty years (see, e.g., [MT77] for a classical textbook). Roughly speaking, in a (fixed design) regression problem, we are given a set of n observations (xi; yi), where the yi’s are the dependent variables and the xi’s are the independent variables, and our goal is to model the relationship between them. The typical assumptions are that (i) there exists a simple function f that (approximately) models the underlying relation, and (ii) the dependent observations are corrupted by random noise.
    [Show full text]
  • Segmented Regression Analysis of Interrupted Time Series Studies in Medication Use Research
    Journal of Clinical Pharmacy and Therapeutics (2002) 27, 299–309 RESEARCH NOTE Segmented regression analysis of interrupted time series studies in medication use research A. K. Wagner* PharmD, MPH,S.B.Soumerai*ScD, F. Zhang* MS and D. Ross-Degnan* ScD *Department of Ambulatory Care and Prevention, Harvard Medical School and Harvard Pilgrim Health Care and Department of Epidemiology, Harvard School of Public Health, Boston, MA, USA ineffective (3) or a drug with unfavourable side- SUMMARY effects can be withdrawn from the market (4). Interrupted time series design is the strongest, Interrupted time series (5, 6) is the strongest, quasi-experimental approach for evaluating lon- quasi-experimental design to evaluate longitudinal gitudinal effects of interventions. Segmented effects of such time-delimited interventions. Seg- regression analysis is a powerful statistical mented regression analysis of interrupted time method for estimating intervention effects in series data allows us to assess, in statistical terms, interrupted time series studies. In this paper, we how much an intervention changed an outcome of show how segmented regression analysis can be interest, immediately and over time; instantly or used to evaluate policy and educational inter- with delay; transiently or long-term; and whether ventions intended to improve the quality of factors other than the intervention could explain medication use and ⁄ or contain costs. the change. Segmented regression analysis is appropriate for Keywords: health policy evaluation, interrupted studying effects of interventions conducted as part time series design, longitudinal analysis, medi- of a randomized trial as well as interventions that cation use research, quasi-experimental design, constitute a natural experiment.
    [Show full text]
  • Package 'Segmented'
    Package ‘segmented’ April 22, 2021 Type Package Title Regression Models with Break-Points / Change-Points Estimation Version 1.3-4 Date 2021-04-28 Maintainer Vito M. R. Muggeo <[email protected]> Description Given a regression model, segmented `updates' it by adding one or more segmented (i.e., piece- wise linear) relationships. Several variables with multiple breakpoints are allowed. The estimation method is dis- cussed in Muggeo (2003, <doi:10.1002/sim.1545>) and illustrated in Muggeo (2008, <https://www.r-project.org/doc/Rnews/Rnews_2008- 1.pdf>). An approach for hypothesis testing is presented in Muggeo (2016, <doi:10.1080/00949655.2016.1149855>), and interval estima- tion for the breakpoint is discussed in Muggeo (2017, <doi:10.1111/anzs.12200>). License GPL NeedsCompilation no Author Vito M. R. Muggeo [aut, cre] (<https://orcid.org/0000-0002-3386-4054>) Repository CRAN Date/Publication 2021-04-22 14:00:02 UTC R topics documented: segmented-package . .2 aapc.............................................4 broken.line . .5 confint.segmented . .7 davies.test . .9 down............................................. 11 draw.history . 12 intercept . 13 lines.segmented . 14 1 2 segmented-package plant . 15 plot.segmented . 16 points.segmented . 19 predict.segmented . 20 print.segmented . 21 pscore.test . 22 seg.control . 24 seg.lm.fit . 27 segmented . 29 selgmented . 35 slope . 37 stagnant . 38 summary.segmented . 39 vcov.segmented . 41 Index 43 segmented-package Segmented relationships in regression models with breakpoints / changepoints estimation Description Estimation and Inference of Regression Models with piecewise linear relationships having a fixed number of break-points. The estimation method is described in Muggeo (2003) <doi: 10.1002/sim.1545>.
    [Show full text]
  • EMPIRICAL LIKELIHOOD METHOD for SEGMENTED LINEAR REGRESSION by Zhihua Liu a Dissertation Submitted to the Faculty of the Charles
    EMPIRICAL LIKELIHOOD METHOD FOR SEGMENTED LINEAR REGRESSION by Zhihua Liu A Dissertation Submitted to the Faculty of The Charles E. Schmidt College of Science in Partial Fulfillment of the Requirements for the Degree of Doctor of Philosophy Florida Atlantic University Boca Raton, FL December 2011 Copyright by Zhihua Liu 2011 ii ACKNOWLEDGEMENTS First of all, I have to confess that the thought of writing this dissertation was intimidating to me. As much as I wish to express my appreciation to all the people who have been there for me to make this dissertation possible, I know I can never include everyone, and not adequately express my immense gratitude to them in simple words. I would like to express my deepest gratitude to all my committee members, Dr. Lianfen Qian, Dr. Hongwei Long, Dr. Heinrich Niederhausen and Dr. Dragan Radulovic. I appreciate their time, interest, and valuable comments concerning my thesis. For the last few years, I had the opportunity and privilege to work with Dr. Lianfen Qian. She has given me ideas and suggestions that enlightened my under- standing of this research and gave me a better perspective on my own work. I would also want to use this opportunity to thank many faculty members and my colleagues for all their inspiration and encouragement. I am also fortunate to be surrounded by sweet people from PenServ and ERISA Pension Systems. Without their support, I would never have completed this disser- tation. In the course of writing this dissertation, I am running between a full-time job and actuarial exams. Most of the time, I wish I could have 40 hours a day.
    [Show full text]
  • New Robust Estimator of Change Point in Segmented Regression Model for Bed-Load of Rivers
    JOURNAL OF MECHANICS OF CONTINUA AND MATHEMATICAL SCIENCES www.journalimcms.org J. Mech. Cont.& Math. Sci., Vol.-14, No.-6 November-December (2019) pp 384-402 ISSN (Online) : 2454 -7190 Vol.-14, No.-6, November - December (2019) 384-402 ISSN (Print) 0973-8975 NEW ROBUST ESTIMATOR OF CHANGE POINT IN SEGMENTED REGRESSION MODEL FOR BED-LOAD OF RIVERS Omar Abdulmohsin Ali1, Mohammed Ahmed Abbas2 1Department of Statistics, University of Baghdad, Iraq 2Department of Statistics, University of Baghdad, Iraq [email protected], [email protected] Correspondening Author : Omar Abdulmohsin Ali; https://doi.org/10.26782/jmcms.2019.12.00027 Abstract Segmentation has vital employment in regression analysis where data have some change point. Traditional estimation methods such as Hudson, D.J.;(1966) and Muggeo, V. M., (2003)have been reviewed. But these methods do not take into account robustness in the presence of outliers values. However, third method was used as rank-based method, where the analysis will be devoted to the ranks of data instead of the data themselves. Our contribution in this paper is to use M-estimator methodology with three distinct weight functions (Huber, Tukey, and Hampel) which has been combined with Muggeo version approach to gain more robustness, Thus we get robust estimates from the change point and regression parameters simultaneously. We call our new estimator as robust Iterative Rewrighted M- estimator: IRWm-method with respect to its own weight function. Our primary interest is to estimate the change point that joins the segments of regression curve, and our secondary interest is to estimate the parameters of segmented regression model.
    [Show full text]