Variance Targeting for Heavy Tailed Time Series October 2011

Total Page:16

File Type:pdf, Size:1020Kb

Variance Targeting for Heavy Tailed Time Series October 2011 Variance Targeting for Heavy Tailed Time Series June 2012 Conference “Advances in Econometrics”, Yale SOM Jonathan Hill* Eric Renault** *University of North Carolina at Chapel Hill ** Brown University OUTLINE 1. Variance targeting in GARCH models 2. Sample variance with tail trimming 3. QMLE with (tail-trimmed) variance targeting = solution to infinite unconditional kurtosis problem 4. Trimming orthogonality conditions for GMM = solution to infinite conditional kurtosis problem 5. Monte Carlo study 1. Variance Targeting in GARCH Models • Example 1: GARCH(1,1) 1/ 2 yt1 ht1 t1 2 ht1 (yt1) ht , 0, 0, 1 Var(y ) t 1 new parameterization : ( , ), (,) Variance Targeting: • Idea = direct estimation of unconditional variance common practice: 1 T ˆ 2 (i) = estimated by sample variance: T yt T t1 (ii) (Q)MLE = applied to remaining parameters after plugging in sample variance: 2 2 ht1 yt1 ht yt1 (1 )ht 2 2 2 ˆt1,T ( ) ˆT yt1 (1 )ˆt,T ( ), (,). Cost-Benefit of variance targeting • Benefits: (i) Better finite sample performance (especially for estimation of ). (ii) Robustness to misspecification of variance equation. • Costs: (i) Efficiency loss at least if QMLE = MLE (ii) For asymptotic normality, requires a much more restrictive assumption than QMLE 4 Not only E t1 (and Eht1 ) 4 2 4 But also Eyt1 Eht1 t1 Var(ht1) Does unconditional kurtosis exist? • Existence of the unconditional fourth moment of stochastic process generating financial return data maintained interest for researchers: • He and Terasvirta (1999, ET)“Fourth Moment Structure of the GARCH(p,q) Process” R1. Existence “would enable one to see how well the kurtosis and autocorrelation (of squared returns) implied by the estimated model match the estimates obtained directly from the data” R2. Existence = far from being certain in case of volatility persistence GARCH (1,1) case: (see He and Terasvirta for general GARCH(p,q)) 1/ 2 yt1 ht1 t1 2 4 Et ( t1) 0, Et ( t1) 1, Et ( t1) 4 , 2 ht1 (yt1) ht . 4 2 2 E(yt1) 4 2 1. Conditionally normal case: 4 2 2 E(yt1) 2 ( ) 1. Pareto Tail Index • Basrak, Davis and Mikosch, SPA, 2002 “Regular variation of GARCH processes” 1/ 2 yt1 ht1 t1 2 a ht1 (yt1) ht , 0, 0, 2 / 2 P yt c c ,c Et 1 p 1 2, Eyt p 1 Var(y ) t 1 R3. Hill’s tail index estimator for daily log- returns over the period 2001-2011: 90% confidence band: SP 500 and NASDAQ : [2,3] DAX: [2, 3.5] NIKKEI : [1.8, 2.8] Conclusion: (i) Variance should be finite ( not compelling for Nikkei!) (ii)(Unconditional) moment of order 3 may exist (iii)Hard to find a series that appears to have finite unconditional moment of order 4 Hill Plots for daily log-returns on SP500 (2513 days from Jan ,1st, 2001 to Jan 1st, 2011) Multivariate examples: • Example 1: DCC-GARCH(1,1): Univariate GARCH for each asset + dynamic conditional correlations: yi,t i,t 1/ 2 conditional correlation matrix : (hii,t ) Qt conditional variance matrix of t ( i,t )1in correlation targeting in the DCC equation : Qt (1 )Q t t 'Qt1 no additional problem under 4 maintained assumption E i,t ,i. Variance targeting and parsimony • DCC example Once the unconditional individual variances and correlation matrix is estimated : only individual GARCH + two more parameters and saves N = n(n+1)/2 parameters. • Example 2: VEC-GARCH(1,1) 1/ 2 yt Ht t ,ht vech(Ht ), t vech( t t ') h c A Gh , t t1 t1 vech Var(y ) [Id A G]1c. t N Variance targeting still requires unconditional finite (matricial) kurtosis 2. Sample variance with tail trimming • Univariate case: 1 T ˆ y 2 T 0 Var(y ) T t t T t1 0 4 BUT : T ˆT asymptotically normal if E(yt ) . Key idea: Tail-Trimming of returns before computing sample variance Iˆ( y) 0 y is one of the k ( y) largest T ,t t T observations among y , 1,...,T. 1 T ˆ( y) ˆ(tr) 2 ˆ( y) IT ,t 1 otherwise, T yt IT ,t T t1 ( y) kT promotes Gaussian asymptotics ( y) (tr) 0 kT /T 0 ensures unbiasedness : ˆT Var(yt ) Asymptotic distribution of estimator of unconditional variance in GARCH: 1st case: Without trimming (with finite kurtosis): Horvath, Kokoszka,Zitikis, JFEC,2006: 1 T 1 T 1 T ˆ 2 2 T yt ht ( t 1) ht T t1 T t1 T t1 0 T 0 1 1 2 0 ht ( t 1) oP (1/ T ) T t1 2 T 1 0 v2 Var Tˆ Var y 2 T E h2 Var 2 T T t 0 t t t1 2nd case: With trimming: T 2 ˆ(tr) 2 ( y) vT VarT T Var yt I yt cT t1 2 0 2 vT 1 2 2 4 Lim E h Var if E y T 0 t t t T v2 Lim T if Ey 4 , BUT v2 o(T 2 ) T T t T Same asymptotic distribution if finite kurtosis More involved in the general case because, by trimming, we lose the mds property: Pareto tails: ( y) 2 ( y) 2 kT ( y) Pyt cT d(cT ) ,2 4 T 1/ ( y) ( y) 1/ T T kT cT (d) ( y) as 0 kT T Feller (1971) (regularly varying functions): 4 ˆ( y) ( y) 4 ( y) Eyt IT ,t (cT ) LcT , L(x) o(x ), 0 Corollary 1: ( y) 4 ˆ( y) 2 and kT Eyt IT ,t o(T) Corollary 2 (with geometric -mixing): Long term variance matrix = o(T) T 2 2 ˆ( y) 2 vT Var yt IT ,t o(T ),vT o(T). t1 4 BUT : E(yt ) vT / T 4 2 4 E(yt ) vT / T bE( t ) 1 2 1 4 b E(ht ) 1 R. Geometric -mixing for y = implied by +<1 when has an absolutely continuous distribution Asymptotic distribution of tail-trimmed estimators of unconditional variance: Long-term variance matrix = o(T) Promotes asymptotic normality of tail trimmed sample variance: T (tr) 0 d ˆT (0,1) vT ,t T 2 ˆ( y) 0 insofar as : Eyt IT 0. vT, t 2 ˆ( y) 0 Always true if : T Eyt IT 0. Lighter trimming for heavier tail (feasible from estimation of tail index) 1st improvement of tail-trimmed sample variance • Peng (2001) : “Estimating the mean of a heavy tailed distribution”, Statistics & Probability Letters Characterizes the systematic finite sample bias due to trimming: 1 T ˆ(tr) 2 ˆ( y) T yt IT ,t biased by : T t1 ( y) 2 ˆ( y) kT ( y) 2 Eyt 1 IT ,t cT 2 T k ( y) P y c( y) T t T T This bias can be( y )estimated by: ˆ 2 ˆ * kT (a) R y ( y ) T k ˆ 2 T T ( y) ˆ Hill estimator based on kT largest absolute values. 1 k ( y ) 1 (a) 1 T y ˆ i ( y) Log (a) k y ( y ) T i1 k T (a) ( y) y ( y ) component kT in order statistics of absolute values : kT (a) (a) (a) y1 y2 .... yT . Improved estimator of conditional variance based on Peng (2001): * (tr) ˆ * ˆT ˆT RT Pros and cons of Peng’s improvement • Pro 1: Avoids a systematic finite sample under- estimation of unconditional variance • Pro 2: Provides asymptotic normality without the problematic maintained assumption: T 2 ˆ( y) 0 Eyt IT 0. vT Con: The correction term is not asymptotically independent from the main term * (tr) ˆ * ˆT ˆT RT (tr) asymptotic distribution of ˆT We propose a new estimator: by under-correction for the gap, we obtain an asymptotically normal estimator with the same distribution limit as the initial one: ~( y) ˆ 2 ˆ * kT (a) R y~( y ) T k ˆ 2 T T ~( y) ( y) (a) (a) T k / k y~( y ) / y ( y ) 0 T T k k T T ( y) ˆ Hill estimator based on even more kT largest absolute values. Price to pay for this simpler bias- corrected estimator: Need to maintain the assumption: T 2 ˆ( y) ~( y) Eyt IT IT 0. vT we will work under this maintained assumption , or No-bias correction + the corresponding stronger assumption: T 2 ˆ( y) 0 Eyt IT 0. vT 3. QMLE with (tail-trimmed) variance targeting 1/ 2 2 yt1 ht1 t1,ht1 (yt1) ht . 1 , /, (,)', ( ,')' T 1/ 2 0 d T ˆT (0, Id3 ) vT NW of T 1 1 SW of T J K E( 4 ) 1 SE of t J 1 J 1KK' J 1 T 2 vT /T 1 ht ht 1 ht ht J E 2 , K E 2 ht ' ht R1. Trimming has no impact on asymptotic distribution of QML-VT when finite kurtosis (Francq,Horvath,Zakoian,JFEC, 2011) R2. In case of infinite kurtosis, the asymptotic variance of GARCH parameters (α,) (SE block) is smaller: 1 1 * T 1 1 K' J J K R3. Price to pay for variance targeting = now in terms of rate of convergence BUT Two directions in the parameter space with root-T rate of convergence: 1 2 ' J K * , () ()' () 0 T 0 ()'ˆT 0 vT 0 4 1 T ()'ˆT 0,E( t ) 1' J 1 (1,0)' ()' J K 1 1 1 (1,1)' ()' J K 1 J K 2 1 4. Trimming orthogonality conditions for GMM QML score (with re-parameterization for variance targeting): y2 1 h () 2 t t et () 1st () 1 ht () ht () 4 May need tail-trimming if E(t ) since then Gaussian QML ( Hall and Yao, 2003) is not asymptotically normal and has a slow rate of convergence R1.
Recommended publications
  • Autoregressive Conditional Kurtosis
    Autoregressive conditional kurtosis Article Accepted Version Brooks, C., Burke, S. P., Heravi, S. and Persand, G. (2005) Autoregressive conditional kurtosis. Journal of Financial Econometrics, 3 (3). pp. 399-421. ISSN 1479-8417 doi: https://doi.org/10.1093/jjfinec/nbi018 Available at http://centaur.reading.ac.uk/20558/ It is advisable to refer to the publisher’s version if you intend to cite from the work. See Guidance on citing . Published version at: http://dx.doi.org/10.1093/jjfinec/nbi018 To link to this article DOI: http://dx.doi.org/10.1093/jjfinec/nbi018 Publisher: Oxford University Press All outputs in CentAUR are protected by Intellectual Property Rights law, including copyright law. Copyright and IPR is retained by the creators or other copyright holders. Terms and conditions for use of this material are defined in the End User Agreement . www.reading.ac.uk/centaur CentAUR Central Archive at the University of Reading Reading’s research outputs online This is a pre-copyedited, author-produced PDF of an article accepted for publication in the Journal of Financial Econometrics following peer review. The definitive publisher-authenticated version (C. Brooks, S.P. Burke, S. Heravi and G. Persand, ‘Autoregressive Conditional Kurtosis’, Journal of Financial Econometrics, 3.3 (2005)) is available online at: http://jfec.oxfordjournals.org/content/3/3/399 1 Autoregressive Conditional Kurtosis Chris Brooks1, Simon P. Burke2, Saeed Heravi3, Gita Persand4 The authors’ affiliations are 1Corresponding author: Cass Business School, City of London, 106 Bunhill Row, London EC1Y 8TZ, UK, tel: (+44) 20 70 40 51 68; fax: (+44) 20 70 40 88 81 41; e-mail: [email protected] ; 2School of Business, University of Reading 3Cardiff Business School, and 4Management School, University of Southampton.
    [Show full text]
  • Quadratic Versus Linear Estimating Equations
    ST 762 Nonlinear Statistical Models for Univariate and Multivariate Response Quadratic versus Linear Estimating Equations GLS estimating equations 0 1 n fβj 0 2 2 −1 X σ gj 0 Yj − fj @ 2 2 1/σ A 4 4 2 2 2 = 0: 0 2σ gj 0 2σ gj (Yj − fj ) − σ gj j=1 νθj Estimating equations for β are linear in Yj . Estimating equations for β only require specification of the first two moments. GLS is optimal among all linear estimating equations. 1 / 26 Quadratic versus Linear Estimating Equations ST 762 Nonlinear Statistical Models for Univariate and Multivariate Response Gaussian ML estimating equations 0 2 2 1 n fβj 2σ gj νβj 2 2 −1 X σ g 0 Yj − fj j = 0: @ 2 2 1/σ A 4 4 (Y − f )2 − σ2g 2 0 2σ gj 0 2σ gj j j j j=1 νθj Estimating equations for β are quadratic in Yj . Estimating equations for β require specification of the third and fourth moments as well. Specifically, if we let Yj − f (xj ; β) j = ; σg (β; θ; xj ) then we need to know 3 ∗ 2 ∗ E j = ζj and var j = 2 + κj : 2 / 26 Quadratic versus Linear Estimating Equations ST 762 Nonlinear Statistical Models for Univariate and Multivariate Response Questions ∗ ∗ ^ If we know the true values ζj and κj , how much is β improved using the quadratic estimating equations versus using the linear estimating equations? If we use working values (for example ζj = κj = 0, corresponding to ∗ ∗ normality) that are not the true values (i.e., ζj and κj ), is there any improvement in using the quadratic estimating equations? If we use working variance functions that are not the true variance functions, is there
    [Show full text]
  • Inference Based on Estimating Equations and Probability-Linked Data
    University of Wollongong Research Online Centre for Statistical & Survey Methodology Faculty of Engineering and Information Working Paper Series Sciences 2009 Inference Based on Estimating Equations and Probability-Linked Data R. Chambers University of Wollongong, [email protected] J. Chipperfield Australian Bureau of Statistics Walter Davis University of Wollongong, [email protected] M. Kovacevic Statistics Canada Follow this and additional works at: https://ro.uow.edu.au/cssmwp Recommended Citation Chambers, R.; Chipperfield, J.; Davis, Walter; and Kovacevic, M., Inference Based on Estimating Equations and Probability-Linked Data, Centre for Statistical and Survey Methodology, University of Wollongong, Working Paper 18-09, 2009, 36p. https://ro.uow.edu.au/cssmwp/38 Research Online is the open access institutional repository for the University of Wollongong. For further information contact the UOW Library: [email protected] Centre for Statistical and Survey Methodology The University of Wollongong Working Paper 18-09 Inference Based on Estimating Equations and Probability-Linked Data Ray Chambers, James Chipperfield, Walter Davis, Milorad Kovacevic Copyright © 2008 by the Centre for Statistical & Survey Methodology, UOW. Work in progress, no part of this paper may be reproduced without permission from the Centre. Centre for Statistical & Survey Methodology, University of Wollongong, Wollongong NSW 2522. Phone +61 2 4221 5435, Fax +61 2 4221 4845. Email: [email protected] Inference Based on Estimating Equations and Probability-Linked Data Ray Chambers, University of Wollongong James Chipperfield, Australian Bureau of Statistics Walter Davis, Statistics New Zealand Milorad Kovacevic, Statistics Canada Abstract Data obtained after probability linkage of administrative registers will include errors due to the fact that some linked records contain data items sourced from different individuals.
    [Show full text]
  • Generalized Estimating Equations for Mixed Models
    GENERALIZED ESTIMATING EQUATIONS FOR MIXED MODELS Lulah Alnaji A Dissertation Submitted to the Graduate College of Bowling Green State University in partial fulfillment of the requirements for the degree of DOCTOR OF PHILOSOPHY August 2018 Committee: Hanfeng Chen, Advisor Robert Dyer, Graduate Faculty Representative Wei Ning Junfeng Shang Copyright c August 2018 Lulah Alnaji All rights reserved iii ABSTRACT Hanfeng Chen, Advisor Most statistical approaches of molding the relationship between the explanatory variables and the responses assume subjects are independent. However, in clinical studies the longitudinal data are quite common. In this type of data, each subject is assessed repeatedly over a period of time. Therefore, the independence assumption is unlikely to be valid with longitudinal data due to the correlated observations of each subject. Generalized estimating equations method is a popular choice for longitudinal studies. It is an efficient method since it takes the within-subjects correla- tion into account by introducing the n n working correlation matrix R(↵) which is fully char- ⇥ acterized by the correlation parameter ↵. Although the generalized estimating equations’ method- ology considers correlation among the repeated observations on the same subject, it ignores the between-subject correlation and assumes subjects are independent. The objective of this dissertation is to provide an extension to the generalized estimating equa- tions to take both within-subject and between-subject correlations into account by incorporating the random effect b to the model. If our interest focuses on the regression coefficients, we regard the correlation parameter ↵ and as nuisance and estimate the fixed effects β using the estimating equations U(β,G,ˆ ↵ˆ).
    [Show full text]
  • Using Generalized Estimating Equations to Estimate Nonlinear
    Using generalized estimating equations to estimate nonlinear models with spatial data ∗ § Cuicui Lu†, Weining Wang ‡, Jeffrey M. Wooldridge Abstract In this paper, we study estimation of nonlinear models with cross sectional data using two-step generalized estimating equations (GEE) in the quasi-maximum likelihood estimation (QMLE) framework. In the interest of improving efficiency, we propose a grouping estimator to account for the potential spatial correlation in the underlying innovations. We use a Poisson model and a Negative Binomial II model for count data and a Probit model for binary response data to demon- strate the GEE procedure. Under mild weak dependency assumptions, results on estimation consistency and asymptotic normality are provided. Monte Carlo simulations show efficiency gain of our approach in comparison of different esti- mation methods for count data and binary response data. Finally we apply the GEE approach to study the determinants of the inflow foreign direct investment (FDI) to China. keywords: quasi-maximum likelihood estimation; generalized estimating equations; nonlinear models; spatial dependence; count data; binary response data; FDI equation JEL Codes: C13, C21, C35, C51 arXiv:1810.05855v1 [econ.EM] 13 Oct 2018 ∗This paper is supported by the National Natural Science Foundation of China, No.71601094 and German Research Foundation. †Department of Economics, Nanjing University Business School, Nanjing, Jiangsu 210093 China; email: [email protected] ‡Department of Economics, City, U of London; Northampton Square, Clerkenwell, London EC1V 0HB. Humboldt-Universität zu Berlin, C.A.S.E. - Center for Applied Statistics and Economics; email: [email protected] §Department of Economics, Michigan State University, East Lansing, MI 48824 USA; email: [email protected] 1 1 Introduction In empirical economic and social studies, there are many examples of discrete data which exhibit spatial or cross-sectional correlations possibly due to the closeness of ge- ographical locations of individuals or agents.
    [Show full text]
  • Stat 8112 Lecture Notes Unbiased Estimating Equations Charles J
    Stat 8112 Lecture Notes Unbiased Estimating Equations Charles J. Geyer September 26, 2020 1 Introduction In this handout we generalize the notion of maximum likelihood estima- tion to solution of unbiased estimating equations. We are much less formal in this handout, merely giving a broad overview. Unlike in Geyer (2013) there is no \no-n" version of these asymptotics (and as far as I can see there cannot be). Thus these asymptotics are based on the law of large numbers (LLN) and the central limit theorem (CLT), and n is sample size. The mathematics we explicitly show in this handout will be for indepen- dent and identically distributed (IID) data. If one has non-IID data, then one must use an LLN or a CLT for such data. Suppose X1, X2, ::: are IID and g(x; θ) is some continuously differen- tiable function of data and the the parameter that satisfies Eθfg(Xi; θ)g = 0; for all θ: (1) Write n 1 X h (θ) = g(X ; θ): n n i i=1 We seek estimators satisfying hn(θ^n) = 0: (2) If g is a vector-to-vector function, then so is hn and thus we say (2) are estimating equations (plural), thinking of each component of (2) as one scalar equation. We say the estimating equations are unbiased if Eθfhn(θ)g = 0; for all θ; (3) which follows from (1). The terminology is a bit different from the usual applications of unbiasedness. Clearly (3) says hn(θ) is an unbiased estimator of zero if θ is the true unknown parameter value, but we usually don't think of random variables containing unknown parameters as \estimators." Nevertheless, this is the accepted terminology for saying that (3) holds.
    [Show full text]
  • Introduction to the Generalized Estimating Equations and Its Applications in Small Cluster Randomized Trials
    Introduction to the Generalized Estimating Equations and its Applications in Small Cluster Randomized Trials Fan Li BIOSTAT 900 Seminar November 11, 2016 1 / 24 Overview Outline Background The Generalized Estimating Equations (GEEs) Improved Small-sample Inference Take-home Message How GEEs work? How to improve small-sample inference, especially in cluster randomized trial (CRT) applications? 2 / 24 Cluster Randomized Trials (CRTs) Randomizing clusters of subjects rather than independent subjects (convenience, ethics, contamination etc.) Intervention administered at the cluster level Outcomes are measured at the individual level Outcomes from subjects within the same cluster exhibit a greater correlation than those from different clusters Intraclass correlation coefficient (ICC) ranges from 0:01 to 0:1 Interest lies in evaluating the intervention effect Small number of clusters with large cluster sizes 3 / 24 The Stop Colorectal Cancer (STOP CRC) Study Study the effect of intervention to improve cancer screening 26 health clinics (n = 26) are allocated to either usual care or intervention (1-to-1 ratio) Usual care - provided opportunistic/occasional screening Intervention - an automated program for mailing testing kits with instructions to patients The clinics contain variable number of patients (mi): Min 461/Med 1426/Max 3302 Primary outcome - The patient-level binary outcome (yij), completion status of screening test within a year of study initiation Baseline clinic- and individual-level covariates Inference { estimand? 4 / 24 The Estimand
    [Show full text]
  • Maximum Likelihood Estimation of Distribution Parameters from Incomplete Data Edwin Joseph Hughes Iowa State University
    Iowa State University Capstones, Theses and Retrospective Theses and Dissertations Dissertations 1962 Maximum likelihood estimation of distribution parameters from incomplete data Edwin Joseph Hughes Iowa State University Follow this and additional works at: https://lib.dr.iastate.edu/rtd Part of the Mathematics Commons Recommended Citation Hughes, Edwin Joseph, "Maximum likelihood estimation of distribution parameters from incomplete data " (1962). Retrospective Theses and Dissertations. 2089. https://lib.dr.iastate.edu/rtd/2089 This Dissertation is brought to you for free and open access by the Iowa State University Capstones, Theses and Dissertations at Iowa State University Digital Repository. It has been accepted for inclusion in Retrospective Theses and Dissertations by an authorized administrator of Iowa State University Digital Repository. For more information, please contact [email protected]. This dissertation has been 62—6497 microfilmed exactly as received HUGHES, Edwin Joseph, 1928- MAXIMUM LIKELIHOOD ESTIMATION OF DISTRIBUTION PARAMETERS FROM INCOMPLETE DATA. Iôwà State University of Science and Technology Ph.D., 1962 Mathematics University Microfilms, Inc., Ann Arbor, Michigan MAXIMUM LIKELIHOOD ESTIMATION OF DISTRIBUTION PARAMETERS FROM INCOMPLETE DATA by Edwin Joseph Hughes A Dissertation Submitted to the Graduate Faculty in Partial Fulfillment of The Requirements for the Degree of DOCTOR OF PHILOSOPHY Major Subject: Statistics Approved: Signature was redacted for privacy. In Ch _ „ rk Signature was redacted for privacy.
    [Show full text]
  • Application of Generalized Linear Models and Generalized Estimation Equations
    Fisheries Research 145 (2013) 66–75 Contents lists available at SciVerse ScienceDirect Fisheries Research j ournal homepage: www.elsevier.com/locate/fishres Application of Generalized Linear Models and Generalized Estimation Equations to model at-haulback mortality of blue sharks captured in a pelagic longline fishery in the Atlantic Ocean a,∗ b a Rui Coelho , Paulo Infante , Miguel N. Santos a Instituto Português do Mar e da Atmosfera (IPMA I.P.), Avenida 5 de Outubro s/n, 8700-305 Olhão, Portugal b Centro de Investigac¸ ão em Matemática e Aplicac¸ ões (CIMA-UE) e Departamento de Matemática, ECT da Universidade de Évora, Rua Romão Ramalho 59, 7000-671 Évora, Portugal a r t a b i s c l e i n f o t r a c t Article history: At-haulback mortality of blue shark (Prionace glauca) captured by the Portuguese pelagic longline fish- Received 31 July 2012 ery targeting swordfish in the Atlantic was modeled. Data was collected by onboard fishery observers Received in revised form 29 January 2013 that monitored 762 fishing sets (1 005 486 hooks) and recorded information on 26 383 blue sharks. The Accepted 16 February 2013 sample size distribution ranged from 40 to 305 cm fork length, with 13.3% of the specimens captured dead at-haulback. Data modeling was carried out with Generalized Linear Models (GLM) and Gener- Keywords: alized Estimation Equations (GEE), given the fishery-dependent source of the data. The explanatory Generalized Estimation Equations variables influencing blue shark mortality rates were year, specimen size, fishing location, sex, season Generalized Linear Models and branch line material.
    [Show full text]
  • M-Estimation (Estimating Equations)
    Chapter 7 M-Estimation (Estimating Equations) 7.1 Introduction In Chapter 1 we made the distinction between the parts of a fully specified statistical model. The primary part is the part that is most important for answering the underlying scientific questions. The secondary part consists of all the remaining details of the model. Usually the primary part is the mean or systematic part of the model, and the secondary part is mainly concerned with the distributional assumptions about the random part of the model. The full specification of the model is important for constructing the likelihood and for using the associated classical methods of inference as spelled out in Chapters 2 and 3 and supported by the asymptotic results of Chapter 6. However, we are now ready to consider robustifying the inference so that misspecification of some secondary assumptions does not invalidate the resulting inferential methods. Basically this robustified inference relies on replacing the 1 b information matrix inverse I.Â/ in the asymptotic normality result for ÂMLE by a generalization I.Â/1B.Â/I.Â/1 called the sandwich matrix. In correctly specified models, I.Â/ D B.Â/, and the sandwich matrix just reduces to the usual I.Â/1. When the model is not correctly specified, I.Â/ ¤ B.Â/, and the sandwich matrix is important for obtaining approximately valid inference. Thus, use of this more general result accommodates misspecification but is still appropriate in correctly specified models although its use there in small samples can entail some loss of efficiency relative to standard likelihood inference.
    [Show full text]
  • Estimated Estimating Equations: Semiparamet- Ric Inference for Clustered/Longitudinal Data
    Estimated estimating equations: Semiparamet- ric inference for clustered/longitudinal data Jeng-Min Chiou Academia Sinica, Taipei, Taiwan and Hans-Georg Muller¨ † University of California, Davis, USA Summary. We introduce a flexible marginal modelling approach for statistical in- ference for clustered/longitudinal data under minimal assumptions. This estimated estimating equations (EEE) approach is semiparametric and the proposed mod- els are fitted by quasi-likelihood regression, where the unknown marginal means are a function of the fixed-effects linear predictor with unknown smooth link, and variance-covariance is an unknown smooth function of the marginal means. We propose to estimate the nonparametric link and variance-covariance functions via smoothing methods, while the regression parameters are obtained via the esti- mated estimating equations. These are score equations that contain nonparamet- ric function estimates. The proposed EEE approach is motivated by its flexibil- ity and easy implementation. Moreover, if data follow a generalized linear mixed model (GLMM), with either specified or unspecified distribution of random effects and link function, the proposed model emerges as the corresponding marginal (population-average) version and can be used to obtain inference for the fixed ef- fects in the underlying GLMM, without the need to specify any other components of this GLMM. Among marginal models, the EEE approach provides a flexible al- ternative to modelling with generalized estimating equations (GEE). Applications of EEE include diagnostics and link selection. The asymptotic distribution of the proposed estimators for the model parameters is derived, enabling statistical infer- ence. Practical illustrations include Poisson modelling of repeated epileptic seizure counts and simulations for clustered binomial responses.
    [Show full text]
  • Association Between Early Cerebral Oxygenation and Neurodevelopmental Impairment Or Death in Premature Infants
    Journal of Perinatology https://doi.org/10.1038/s41372-021-00942-w ARTICLE Association between early cerebral oxygenation and neurodevelopmental impairment or death in premature infants 1 1 1 1 1 1 Anup C. Katheria ● Jacob Stout ● Ana L. Morales ● Debra Poeltler ● Wade D. Rich ● Jane Steen ● 1 1 Shauna Nuzzo ● Neil Finer Received: 6 July 2020 / Revised: 15 October 2020 / Accepted: 19 January 2021 © The Author(s), under exclusive licence to Springer Nature America, Inc. 2021 Abstract Objective To assess the relationship between cerebral oxygenation in the first 72 h of life and neurodevelopmental impairment (NDI) at 2 years corrected age in former premature infants. Study design Prospective observational cohort study of 127 infants <32 weeks GA at birth with cerebral oxygenation monitoring using NIRS in the first 72 h of life. Results Using a threshold cutoff for cerebral hypoxia, infants with NDI or death had increased duration of hypoxia (4 vs 2.3%, p = 0.001), which was more pronounced in the 23–27 week subgroup (7.6 vs 3.2%, p < 0.001). Individual generalized 1234567890();,: 1234567890();,: estimating equations to adjust for repeated measures were modeled in this subgroup for the physiologic parameters including StO2. StO2 < 67% was a predictor for death or NDI (OR 2.75, 95% CI 1.006, 7.5132, p = 0.049). Conclusion An increased duration of cerebral hypoxia is associated with NDI or death in infants born <32 weeks GA. Introduction context of neonatal morbidities like hypoxic ischemic encephalopathy (HIE), late-onset sepsis, and congenital Underdeveloped cerebral circulatory systems of preterm heart disease.
    [Show full text]