VGAM Package. R News, 8, 28–39

Total Page:16

File Type:pdf, Size:1020Kb

VGAM Package. R News, 8, 28–39 Package ‘VGAM’ January 14, 2021 Version 1.1-5 Date 2021-01-13 Title Vector Generalized Linear and Additive Models Author Thomas Yee [aut, cre], Cleve Moler [ctb] (author of several LINPACK routines) Maintainer Thomas Yee <[email protected]> Depends R (>= 3.5.0), methods, stats, stats4, splines Suggests VGAMextra, MASS, mgcv Enhances VGAMdata Description An implementation of about 6 major classes of statistical regression models. The central algorithm is Fisher scoring and iterative reweighted least squares. At the heart of this package are the vector generalized linear and additive model (VGLM/VGAM) classes. VGLMs can be loosely thought of as multivariate GLMs. VGAMs are data-driven VGLMs that use smoothing. The book ``Vector Generalized Linear and Additive Models: With an Implementation in R'' (Yee, 2015) <DOI:10.1007/978-1-4939-2818-7> gives details of the statistical framework and the package. Currently only fixed-effects models are implemented. Many (150+) models and distributions are estimated by maximum likelihood estimation (MLE) or penalized MLE. The other classes are RR-VGLMs (reduced-rank VGLMs), quadratic RR-VGLMs, reduced-rank VGAMs, RCIMs (row-column interaction models)---these classes perform constrained and unconstrained quadratic ordination (CQO/UQO) models in ecology, as well as constrained additive ordination (CAO). Hauck-Donner effect detection is implemented. Note that these functions are subject to change; see the NEWS and ChangeLog files for latest changes. License GPL-3 URL https://www.stat.auckland.ac.nz/~yee/VGAM/ NeedsCompilation yes 1 2 R topics documented: BuildVignettes yes LazyLoad yes LazyData yes Repository CRAN Date/Publication 2021-01-14 06:20:05 UTC R topics documented: VGAM-package . 13 A1A2A3 . 16 AA.Aa.aa . 18 AB.Ab.aB.ab . 19 ABO............................................. 20 acat ............................................. 21 add1.vglm . 23 AICvlm . 24 alaplace . 26 alaplaceUC . 31 altered . 32 amlbinomial . 34 amlexponential . 36 amlnormal . 38 amlpoisson . 40 anova.vglm . 43 AR1............................................. 45 AR1EIM . 48 auuc............................................. 52 aux.posbernoulli.t . 53 backPain . 54 beggs ............................................ 55 bell ............................................. 56 Benford . 57 Benini . 58 benini1 . 60 Betabinom . 61 betabinomial . 65 betabinomialff . 67 betaff . 71 Betageom . 73 betageometric . 74 betaII . 76 Betanorm . 77 betaprime . 79 betaR . 80 Biamhcop . 82 biamhcop . 83 R topics documented: 3 Biclaytoncop . 85 biclaytoncop . 86 BICvlm . 88 Bifgmcop . 89 bifgmcop . 90 bifgmexp . 91 bifrankcop . 93 bigamma.mckay . 94 bigumbelIexp . 96 bilogis . 97 bilogistic . 99 Binom2.or . 100 binom2.or . 102 Binom2.rho . 105 binom2.rho . 107 binomialff . 110 Binorm . 112 binormal . 114 binormalcop . 116 Binormcop . 117 Biplackett . 119 biplackettcop . 120 biplot-methods . 122 Bisa............................................. 122 bisa ............................................. 123 Bistudentt . 125 bistudentt . 126 bmi.nz . 128 borel.tanner . 129 Bort............................................. 130 Brat ............................................. 131 brat ............................................. 133 bratt . 135 calibrate . 137 calibrate-methods . 138 calibrate.qrrvglm . 139 calibrate.qrrvglm.control . 142 calibrate.rrvglm . 144 calibrate.rrvglm.control . 146 cao.............................................. 147 cao.control . 151 Card............................................. 154 cardioid . 156 cauchitlink . 157 cauchy . 159 cdf.lmscreg . 161 cens.gumbel . 163 cens.normal . 165 4 R topics documented: cens.poisson . 166 cfibrosis . 169 cgo.............................................. 170 chest.nz . 171 chinese.nz . ..
Recommended publications
  • A Note on the Screaming Toes Game
    A note on the Screaming Toes game Simon Tavar´e1 1Department of Statistics, Columbia University, 1255 Amsterdam Avenue, New York, NY 10027, USA June 11, 2020 Abstract We investigate properties of random mappings whose core is composed of derangements as opposed to permutations. Such mappings arise as the natural framework to study the Screaming Toes game described, for example, by Peter Cameron. This mapping differs from the classical case primarily in the behaviour of the small components, and a number of explicit results are provided to illustrate these differences. Keywords: Random mappings, derangements, Poisson approximation, Poisson-Dirichlet distribu- tion, probabilistic combinatorics, simulation, component sizes, cycle sizes MSC: 60C05,60J10,65C05, 65C40 1 Introduction The following problem comes from Peter Cameron’s book, [4, p. 154]. n people stand in a circle. Each player looks down at someone else’s feet (i.e., not at their own feet). At a given signal, everyone looks up from the feet to the eyes of the person they were looking at. If two people make eye contact, they scream. What is the probability qn, say, of at least one pair screaming? The purpose of this note is to put this problem in its natural probabilistic setting, namely that of a random mapping whose core is a derangement, for which many properties can be calculated simply. We focus primarily on the small components, but comment on other limiting regimes in the discussion. We begin by describing the usual model for a random mapping. Let B1,B2,...,Bn be indepen- arXiv:2006.04805v1 [math.PR] 8 Jun 2020 dent and identically distributed random variables satisfying P(B = j)=1/n, j [n], (1) i ∈ where [n] = 1, 2,...,n .
    [Show full text]
  • Please Scroll Down for Article
    This article was downloaded by: [informa internal users] On: 4 September 2009 Access details: Access Details: [subscription number 755239602] Publisher Taylor & Francis Informa Ltd Registered in England and Wales Registered Number: 1072954 Registered office: Mortimer House, 37-41 Mortimer Street, London W1T 3JH, UK Communications in Statistics - Theory and Methods Publication details, including instructions for authors and subscription information: http://www.informaworld.com/smpp/title~content=t713597238 Sequential Testing to Guarantee the Necessary Sample Size in Clinical Trials Andrew L. Rukhin a a Statistical Engineering Division, Information Technologies Laboratory, National Institute of Standards and Technology, Gaithersburg, Maryland, USA Online Publication Date: 01 January 2009 To cite this Article Rukhin, Andrew L.(2009)'Sequential Testing to Guarantee the Necessary Sample Size in Clinical Trials',Communications in Statistics - Theory and Methods,38:16,3114 — 3122 To link to this Article: DOI: 10.1080/03610920902947584 URL: http://dx.doi.org/10.1080/03610920902947584 PLEASE SCROLL DOWN FOR ARTICLE Full terms and conditions of use: http://www.informaworld.com/terms-and-conditions-of-access.pdf This article may be used for research, teaching and private study purposes. Any substantial or systematic reproduction, re-distribution, re-selling, loan or sub-licensing, systematic supply or distribution in any form to anyone is expressly forbidden. The publisher does not give any warranty express or implied or make any representation that the contents will be complete or accurate or up to date. The accuracy of any instructions, formulae and drug doses should be independently verified with primary sources. The publisher shall not be liable for any loss, actions, claims, proceedings, demand or costs or damages whatsoever or howsoever caused arising directly or indirectly in connection with or arising out of the use of this material.
    [Show full text]
  • Chapter 8 - Frequently Used Symbols
    UNSIGNALIZED INTERSECTION THEORY BY ROD J. TROUTBECK13 WERNER BRILON14 13 Professor, Head of the School, Civil Engineering, Queensland University of Technology, 2 George Street, Brisbane 4000 Australia. 14 Professor, Institute for Transportation, Faculty of Civil Engineering, Ruhr University, D 44780 Bochum, Germany. Chapter 8 - Frequently used Symbols bi = proportion of volume of movement i of the total volume on the shared lane Cw = coefficient of variation of service times D = total delay of minor street vehicles Dq = average delay of vehicles in the queue at higher positions than the first E(h) = mean headway E(tcc ) = the mean of the critical gap, t f(t) = density function for the distribution of gaps in the major stream g(t) = number of minor stream vehicles which can enter into a major stream gap of size, t L = logarithm m = number of movements on the shared lane n = number of vehicles c = increment, which tends to 0, when Var(tc ) approaches 0 f = increment, which tends to 0, when Var(tf ) approaches 0 q = flow in veh/sec qs = capacity of the shared lane in veh/h qm,i = capacity of movement i, if it operates on a separate lane in veh/h qm = the entry capacity qm = maximum traffic volume departing from the stop line in the minor stream in veh/sec qp = major stream volume in veh/sec t = time tc = critical gap time tf = follow-up times tm = the shift in the curve Var(tc ) = variance of critical gaps Var(tf ) = variance of follow-up-times Var (W) = variance of service times W = average service time.
    [Show full text]
  • Concentration Inequalities from Likelihood Ratio Method
    Concentration Inequalities from Likelihood Ratio Method ∗ Xinjia Chen September 2014 Abstract We explore the applications of our previously established likelihood-ratio method for deriving con- centration inequalities for a wide variety of univariate and multivariate distributions. New concentration inequalities for various distributions are developed without the idea of minimizing moment generating functions. Contents 1 Introduction 3 2 Likelihood Ratio Method 4 2.1 GeneralPrinciple.................................... ...... 4 2.2 Construction of Parameterized Distributions . ............. 5 2.2.1 WeightFunction .................................... .. 5 2.2.2 ParameterRestriction .............................. ..... 6 3 Concentration Inequalities for Univariate Distributions 7 3.1 BetaDistribution.................................... ...... 7 3.2 Beta Negative Binomial Distribution . ........ 7 3.3 Beta-Prime Distribution . ....... 8 arXiv:1409.6276v1 [math.ST] 1 Sep 2014 3.4 BorelDistribution ................................... ...... 8 3.5 ConsulDistribution .................................. ...... 8 3.6 GeetaDistribution ................................... ...... 9 3.7 GumbelDistribution.................................. ...... 9 3.8 InverseGammaDistribution. ........ 9 3.9 Inverse Gaussian Distribution . ......... 10 3.10 Lagrangian Logarithmic Distribution . .......... 10 3.11 Lagrangian Negative Binomial Distribution . .......... 10 3.12 Laplace Distribution . ....... 11 ∗The author is afflicted with the Department of Electrical
    [Show full text]
  • On a Two-Parameter Yule-Simon Distribution
    On a two-parameter Yule-Simon distribution Erich Baur ♠ Jean Bertoin ♥ Abstract We extend the classical one-parameter Yule-Simon law to a version depending on two pa- rameters, which in part appeared in [1] in the context of a preferential attachment algorithm with fading memory. By making the link to a general branching process with age-dependent reproduction rate, we study the tail-asymptotic behavior of the two-parameter Yule-Simon law, as it was already initiated in [1]. Finally, by superposing mutations to the branching process, we propose a model which leads to the full two-parameter range of the Yule-Simon law, generalizing thereby the work of Simon [20] on limiting word frequencies. Keywords: Yule-Simon model, Crump-Mode-Jagers branching process, population model with neutral mutations, heavy tail distribution, preferential attachment with fading memory. AMS subject classifications: 60J80; 60J85; 60G55; 05C85. 1 Introduction The standard Yule process Y = (Y (t))t≥0 is a basic population model in continuous time and with values in N := 1, 2,... It describes the evolution of the size of a population started { } from a single ancestor, where individuals are immortal and give birth to children at unit rate, independently one from the other. It is well-known that for every t 0, Y (t) has the geometric −t ≥ distribution with parameter e . As a consequence, if Tρ denotes an exponentially distributed random time with parameter ρ > 0 which is independent of the Yule process, then for every k N, there is the identity ∈ ∞ P(Y (T )= k)= ρ e−ρt(1 e−t)k−1e−tdt = ρB(k, ρ + 1), (1) ρ − Z0 arXiv:2001.01486v1 [math.PR] 6 Jan 2020 where B is the beta function.
    [Show full text]
  • PDF Document
    Available at Applications and Applied http://pvamu.edu/aam Mathematics: Appl. Appl. Math. An International Journal ISSN: 1932-9466 (AAM) Vol. 15, Issue 2 (December 2020), pp. 764 – 800 Statistics of Branched Populations Split into Different Types Thierry E. Huillet Laboratoire de Physique Théorique et Modélisation CY Cergy Paris Université / CNRS UMR-8089 Site de Saint Martin, 2 Avenue Adolphe-Chauvin 95302 Cergy-Pontoise, France [email protected] Received: January 3, 2019; Accepted: May 30, 2020 Abstract Some population is made of n individuals that can be of P possible species (or types) at equilib- rium. How are individuals scattered among types? We study two random scenarios of such species abundance distributions. In the first one, each species grows from independent founders according to a Galton-Watson branching process. When the number of founders P is either fixed or random (either Poisson or geometrically-distributed), a question raised is: given a population of n individ- uals as a whole, how does it split into the species types? This model is one pertaining to forests of Galton-Watson trees. A second scenario that we will address in a similar way deals with forests of increasing trees. Underlying this setup, the creation/annihilation of clusters (trees) is shown to result from a recursive nucleation/aggregation process as one additional individual is added to the total population. Keywords: Branching processes; Galton-Watson trees; Increasing trees; Poisson and geometric forests of trees; Combinatorial probability; Canonical and grand-canonical species abundance distributions; Singularity analysis of large forests MSC 2010 No.: 60J80, 92D25 764 AAM: Intern.
    [Show full text]
  • A New Discrete Distribution for Vehicle Bunch Size Data: Model, Properties, Comparison and Application
    A NEW DISCRETE DISTRIBUTION FOR VEHICLE BUNCH SIZE DATA: MODEL, PROPERTIES, COMPARISON AND APPLICATION Adil Rashid1, Zahoor Ahmad2 and T. R. Jan3 1,2,3 P.G Department of Statistics, University of Kashmir, Srinagar(India) ABSTRACT In the present paper we shall construct a new probability distribution by compounding Borel distribution with beta distribution and it will be named as Borel-Beta Distribution (BBD). The newly proposed distribution finds its application in modeling traffic data sets where the concentration of observed frequency corresponding to the very first observation is maximum. BBD reduces to Borel-uniform distribution on specific parameter setting. Furthermore, we have also obtained compound version of Borel-Tanner distributions. Some mathematical properties such as mean, variance of some compound distributions have also been discussed. The estimation of parameters of the proposed distribution has been obtained via maximum likelihood estimation method. Finally the potentiality of proposed distribution is justified by using it to model the real life data set.. Keywords –Borel distribution, Borel-Tanner distribution, beta distribution, Consul distribution, Compound distribution I. INTRODUCTION Probability distributions are used to model the real world phenomenon so that scientists may predict the random outcome with somewhat certainty but to many complex systems in nature one cannot expect the behavior of random variables to be identical everytime. Most of the times researchers encounter the scientific data that lacks the sequential pattern and hence and to deal with such a randomness one must be well equipped knowledge of probability. The aim of researchers lies in predicting the random behavior of data under consideration efficiently and this can be done by using a suitable probability distributions but the natural question that arises here, is that limited account of probability distributions can’t be used to model vast and diverse forms of data sets.
    [Show full text]
  • Error Analysis of Structured Markov Chains Eleni Vatamidou
    Error analysis of structured Markov chains Error analysis of structured Markov chains Eleni Vatamidou Eleni Vatamidou Eindhoven University of Technology Department of Mathematics and Computer Science Error analysis of structured Markov chains Eleni Vatamidou This work is part of the research programme Error Bounds for Structured Markov Chains (project nr. 613.001.006), which is (partly) financed by the Netherlands Organ- isation for Scientific Research (NWO). c Eleni Vatamidou, 2015. Error analysis of structured Markov chains / by Eleni Vatamidou. { Mathematics Subject Classification (2010): Primary 60K25, 91B30; Secondary 68M20, 60F10, 41A99. A catalogue record is available from the Eindhoven University of Technology Library. ISBN: 978-90-386-3878-2 This thesis is number D 192 of the thesis series of the Beta Research School for Operations Management and Logistics. Kindly supported by a full scholarship from the legacy of K. Katseas. Error analysis of structured Markov chains proefschrift ter verkrijging van de graad van doctor aan de Technische Universiteit Eindhoven, op gezag van de rector magnificus, prof.dr.ir. F.P.T. Baaijens, voor een commissie aangewezen door het College voor Promoties in het openbaar te verdedigen op maandag 22 juni 2015 om 16.00 uur door Eleni Vatamidou geboren te Thessaloniki, Griekenland Dit proefschrift is goedgekeurd door de promotoren en de samenstelling van de promotiecommissie is als volgt: voorzitter: prof.dr. M.G.J. van den Brand 1e promotor: prof.dr. A.P. Zwart 2e promotor: prof.dr.ir. I.J.B.F. Adan copromotor: dr. M. Vlasiou leden: prof.dr. H. Albrecher (Universite de Lausanne) dr. B. Van Houdt (Universiteit Antwerpen) prof.dr.ir.
    [Show full text]
  • Galton--Watson Branching Processes and the Growth of Gravitational
    Mon. Not. R. Astron. Soc. 000, 000–000 (0000) Printed 23 September 2018 (MN LATEX style file v1.3) Galton–Watson branching processes and the growth of gravitational clustering Ravi K. Sheth Berkeley Astronomy Deptartment, Univeristy of California, Berkeley, CA 94720 23 September 2018 ABSTRACT The Press–Schechter description of gravitational clustering from an initially Poisson distribution is shown to be equivalent to the well studied Galton–Watson branching process. This correspondence is used to provide a detailed description of the evolution of hierarchical clustering, including a complete description of the merger history tree. The relation to branching process epidemic models means that the Press–Schechter description can also be understood using the formalism developed in the study of queues. The queueing theory formalism, also, is used to provide a complete description of the merger history of any given Press–Schechter clump. In particular, an analytic expression for the merger history of any given Poisson Press–Schechter clump is obtained. This expression allows one to calculate the partition function of merger history trees. It obeys an interesting scaling relation; the partition function for a given pair of initial and final epochs is the same as that for certain other pairs of initial and final epochs. The distribution function of counts in randomly placed cells, as a function of time, is also obtained using the branching process and queueing theory descriptions. Thus, the Press–Schechter description of the gravitational evolution of clustering from an initially Poisson distribution is now complete. All these interrelations show why the Press–Schechter approach works well in a statistical sense, but cannot provide a detailed description of the dynamics of the clustering particles themselves.
    [Show full text]
  • Count Data Is the Poisson Dis- Tribution with the Pmf Given By: E−Μµy P (Y = Y) =
    A Service of Leibniz-Informationszentrum econstor Wirtschaft Leibniz Information Centre Make Your Publications Visible. zbw for Economics Drescher, Daniel Working Paper Alternative distributions for observation driven count series models Economics Working Paper, No. 2005-11 Provided in Cooperation with: Christian-Albrechts-University of Kiel, Department of Economics Suggested Citation: Drescher, Daniel (2005) : Alternative distributions for observation driven count series models, Economics Working Paper, No. 2005-11, Kiel University, Department of Economics, Kiel This Version is available at: http://hdl.handle.net/10419/21999 Standard-Nutzungsbedingungen: Terms of use: Die Dokumente auf EconStor dürfen zu eigenen wissenschaftlichen Documents in EconStor may be saved and copied for your Zwecken und zum Privatgebrauch gespeichert und kopiert werden. personal and scholarly purposes. Sie dürfen die Dokumente nicht für öffentliche oder kommerzielle You are not to copy documents for public or commercial Zwecke vervielfältigen, öffentlich ausstellen, öffentlich zugänglich purposes, to exhibit the documents publicly, to make them machen, vertreiben oder anderweitig nutzen. publicly available on the internet, or to distribute or otherwise use the documents in public. Sofern die Verfasser die Dokumente unter Open-Content-Lizenzen (insbesondere CC-Lizenzen) zur Verfügung gestellt haben sollten, If the documents have been made available under an Open gelten abweichend von diesen Nutzungsbedingungen die in der dort Content Licence (especially Creative
    [Show full text]
  • On Total Claim Amount for Marked Poisson Cluster Models Bojan Basrak, Olivier Wintenberger, Petra Zugec
    On total claim amount for marked Poisson cluster models Bojan Basrak, Olivier Wintenberger, Petra Zugec To cite this version: Bojan Basrak, Olivier Wintenberger, Petra Zugec. On total claim amount for marked Poisson cluster models. Journal of Applied Probability, Cambridge University press, In press. hal-01788339v2 HAL Id: hal-01788339 https://hal.archives-ouvertes.fr/hal-01788339v2 Submitted on 21 Mar 2019 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. On total claim amount for marked Poisson cluster models Bojan Basraka, Olivier Wintenbergerb, Petra Zugecˇ c,∗ aDepartment of Mathematics, University of Zagreb, Bijeniˇcka 30, Zagreb, Croatia bLPSM, Sorbonne university, Jussieu, F-75005, Paris, France cFaculty of Organization and Informatics, Pavlinska 2, Varaˇzdin, University of Zagreb, Croatia Abstract We study the asymptotic distribution of the total claim amount for marked Poisson cluster models. The marks determine the size and other characteris- tics of the individual claims and potentially influence arrival rate of the future claims. We find sufficient conditions under which the total claim amount sat- isfies the central limit theorem or alternatively tends in distribution to an infinite variance stable random variable. We discuss several Poisson cluster models in detail, paying special attention to the marked Hawkes processes as our key example.
    [Show full text]
  • Estimating the Propagation and Extent of Cascading Line Outages from Utility Data with a Branching Process
    PREPRINT OF DOI 10.1109/TPWRS.2012.2190112, IEEE TRANSACTIONS ON POWER SYSTEMS, VOL 27, NO 4, NOV 2012, 2146-2155 1 Estimating the propagation and extent of cascading line outages from utility data with a branching process Ian Dobson, Fellow IEEE Abstract—Large blackouts typically involve the cascading out- have approximately reproduced the distribution of blackout age of transmission lines. We estimate from observed utility sizes obtained from simulations of mechanisms of cascading data how much transmission line outages propagate, and obtain failure in blackouts [16], [17]. Branching processes are used parameters of a probabilistic branching process model of the cascading. The branching process model is then used to predict to analyze observed blackout data in [6], [18]. Moreover, the distribution of total number of outages for a given number of branching processes can approximate other high-level models initial outages. We study how the total number of lines outaged of cascading failure [5], [19], [20], [21]. depends on the propagation as the cascade proceeds. The analysis gives a new way to quantify the effect of cascading failure from Ren and Dobson [18] analyze propagation in a transmission standard utility line outage data. line outage data set that is smaller and has a different source Index Terms—Power transmission reliability, reliability mod- than the data set considered in this paper. They estimate an eling, reliability theory, risk analysis, failure analysis average value of propagation and use a branching process to predict the distribution of the number of line outages from the estimated propagation and the distribution of the initial I.
    [Show full text]