An Empirical Bayesian Approach to Misspecified Covariance Structures

Total Page:16

File Type:pdf, Size:1020Kb

An Empirical Bayesian Approach to Misspecified Covariance Structures An Empirical Bayesian Approach to Misspecified Covariance Structures Dissertation Presented in Partial Fulfillment of the Requirements for the Degree Doctor of Philosophy in the Graduate School of The Ohio State University By Hao Wu, M.A., M.S. Graduate Program in Psychology The Ohio State University 2010 Dissertation Committee: Dr. Michael W. Browne, Advisor Dr. Michael C. Edwards Dr. Steven N. MacEachern Dr. Thomas E. Nygren Copyright by Hao Wu 2010 Abstract The analysis of covariance structures has been an important topic in psy- chometrics and latent variable modeling. A covariance structure is a model for the covariance matrix of observed manifest variables. It is derived from hypothesized lin- ear relationships among the manifest variables and hypothesized unobserved latent variables. The traditional approach to covariance structures has been successful when the covariance structure is correctly specified, i.e., when the population covariance matrix satisfies the given covariance structure. However, in reality, covariance struc- tures never hold exactly in the population as the hypotheses behind them are only approximations to the truth. Consequently it is necessary to model misspecification when covariance structures are analyzed. The traditional approach, nevertheless, only acknowledges and accounts for the effect of misspecification by post hoc modifications of the original approach to correctly specified covariance structures, and does not actively model the process that may have lead to misspecification. In this dissertation, we present a new approach to misspecified covariance structures in which the systematic error, identified as the process behind misspecifi- cation, is explicitly modeled along with the sampling error as a stochastic quantity ii with a distribution, and the inverse sample size for this distribution, as an unknown parameter to be estimated, gives a measure of misspecification. Analytical properties of the maximum beta likelihood (MBL) procedure implied by this approach and its limit, the maximum inverted Wishart likelihood (MIWL) procedure, are investigated and several connections with the traditional approach are found. Computer programs that give numerical implementations of these procedures are provided. Asymptotic sampling distributions of estimators given by the above two procedures are derived un- der different replication frameworks with a much weaker assumption than the usually invoked Pitman drift assumption. Sampling experiments are conducted to validate the asymptotic sampling distributions and to demonstrate the importance to account for the variations in the parameter estimates due to systematic error. iii Dedicated to my mother iv Acknowledgments I am greatly indebted to my advisor, Dr. Michael W. Browne, for his insight that initiated the current research, his intellectual support and encouragement with- out which the dissertation would not have been possible, and his careful review of the dissertation. I am also grateful to Dr. Steven N. MacEachern and Dr. Michael C. Edwards, whose insightful comments and helpful suggestions point to the practical significance of the current research, shed light on its potentials of further development, and lead to the improvement of the dissertation in terms of both the presentation of key ideas and the demonstration of key results. I wish to thank Dr. Thomas E. Nygren for serving in my dissertation com- mittee though he has a tight schedule as the vice chair of the department. This work was partially supported by NSF grant SES-0437251 from January 2008 through September 2009. v Vita 2004 ................................. B.S. Statistics, Peking University, China 2006 ................................. M.A. Psychology, The Ohio State University 2007 ................................. M.S. Statistics, The Ohio State University 2004 - present .................. Graduate Research and Teaching Associate, Department of Psychology, The Ohio State University Publications Wu, H., Myung, I. J. and Batchelder, W. H. (2010a) On the minimum description length complexity of multinomial processing tree models. Journal of Mathematical Psychology, 54, 291-303. Wu, H., Myung, I. J. and Batchelder, W. H. (2010b) Minimum description length model selection of multinomial processing tree models. Psychonomic Bulletin & Re- view, 17(3), 275-286. Fields of Study Major Field: Psychology Area of Concentration: Quantitative Psychology vi Table of Contents Page Abstract . ii Acknowledgments . v Vita . vi List of Figures . x List of Tables . xi List of Acronyms . xiii List of Notation . xiv Chapters: 1. Introduction . 1 1.1 Covariance Structures . 1 1.2 Model Misspecification . 4 1.2.1 The Issue of Misspecification . 4 1.2.2 Traditional Approach: Procedures . 4 1.2.3 Traditional Approach: Problems . 7 1.3 Overview of the Dissertation . 9 2. An Empirical Bayesian Approach: Motivation . 10 2.1 The Rationale behind the Model . 10 2.2 Statistical Formulation . 12 2.3 Previous Use of This Model . 13 2.4 A Parallel Example . 14 vii 3. Analytical Properties . 18 3.1 The Marginal Distribution . 18 3.2 Asymptotic Behavior . 19 3.2.1 Relationship to Wishart and Inverted Wishart Distributions 19 3.2.2 Relationship to the Normal Distribution . 20 3.3 The Inverted Wishart Model . 22 3.3.1 Parameter Estimation . 23 3.3.2 Bias Correction . 25 3.3.3 Usage as a Loss Function . 26 3.4 The Saturated Covariance Structure . 27 4. Computation . 29 4.1 Derivatives . 29 4.2 Approximate Hessian Matrix . 31 4.3 Computational Formulae . 33 4.4 Computation of the MIWLE . 35 5. Replication Frameworks and Consistency . 36 5.1 The Major Replication Framework . 36 5.2 An Alternative Replication Framework . 37 6. Unconditional Sampling Distribution . 39 6.1 An Additional Assumption . 39 6.2 MBLE . 40 6.3 MIWLE . 43 7. Conditional Sampling Distribution . 45 7.1 v# > 0.................................. 45 7.2 v# =0.................................. 48 7.3 A Non-Central χ2 Approximation . 49 7.4 MIWLE . 52 7.5 Confidence Intervals . 53 8. Sampling Experiments . 55 8.1 Unconditional Sampling Distribution . 55 8.1.1 n = m = 1000 . 56 viii 8.1.2 n = m = 200 . 58 8.1.3 When n and m Vary . 58 8.1.4 Interim Summary . 60 8.2 Conditional Sampling Distribution . 60 8.2.1 The Construction of a Misspecified Covariance Matrix . 61 8.2.2 n = m = 1000 . 62 8.2.3 n = m = 200 . 64 8.2.4 When n and m Vary . 65 8.2.5 Interim Summary . 67 8.3 Comparing the New Approach with the Traditional Approach . 68 9. Summary and Conclusions . 70 Bibliography . 73 Appendices: A. The Constant k ................................ 75 B. Mathematical Formulae . 77 B.1 Taylor Expansions of ln Γp(m/2) . 77 B.2 Second Derivatives of F1 and F2 .................... 81 C. Tables and Figures . 83 D. MATLAB Codes . 104 D.1 Parameter Estimation via MBL . 104 D.2 Parameter Estimation via MIWL . 112 D.3 CFA Covariance Structure and its Derivatives . 122 D.4 Construction of a Misspecified Covariance Matrix . 124 D.5 Inversion of the Non-Central χ2 Distribution . 126 ix List of Figures Figure Page C.1 Unconditional sampling distribution of ξˆ................. 95 IW C.2 Unconditional sampling distribution ofv ˆ0 andv ˆ0 . 96 C.3 Comparison of the unconditional sampling distributions of ξˆ and ξˆIW when n = m = 1000. 97 C.4 Comparison of the unconditional sampling distributions of ξˆ and ξˆIW when n = m = 200. 98 C.5 Conditional sampling distributions of ξˆ. 99 IW C.6 Conditional sampling distribution ofv ˆ0 andv ˆ0 . 100 C.7 Comparison of the conditional sampling distributions of ξˆ and ξˆIW when n = m = 1000. 101 C.8 Comparison of the conditional sampling distributions of ξˆ and ξˆIW when n = m = 200. 102 C.9 Comparison of ξˆ and ξˆW.......................... 103 x List of Tables Table Page C.1 The missing rates of 95% unconditional CIs of ξ when n = m = 1000. 83 C.2 Halflengths of 95% unconditional CIs of ξ when n = m = 1000. 84 C.3 The unconditional RMSEs of parameter estimators when n = m = 1000. 84 C.4 The missing rates of 95% unconditional CIs of ξ when n = m = 200. 85 C.5 The missing rates of 95% unconditional CBs of v. 86 C.6 The missing rates of 95% unconditional CIs of selected covariance structure parameters. 86 C.7 The average KS distances between the simulated and asymptotic un- conditional sampling distributions for both ξˆ and ξˆIW. ........ 87 C.8 The average KS distances between the simulated unconditional sam- pling distributions of ξˆ and ξˆIW...................... 87 C.9 The KS distances between the simulated and asymptotic unconditional IW sampling distributions for bothv ˆ0 andv ˆ0 . 87 C.10 The KS distances between the simulated unconditional sampling dis- IW tributions ofv ˆ0 andv ˆ0 .......................... 88 C.11 The missing rates of 95% conditional CIs of ξ when n = m = 1000. 89 C.12 Halflengths of 95% conditional CIs of ξ when n = m = 1000. 89 C.13 Comparing three estimators of Σ..................... 90 xi C.14 The conditional RMSEs of parameter estimators when n = m = 1000. 90 C.15 The missing rates of 95% conditional CIs of ξ when n = m = 200. 91 C.16 The missing rates of 95% conditional CIs of selected covariance struc- ture parameters. 91 ˆ C.17 The bias ofρ ˆ and ψ1 ........................... 92 C.18 The missing rates of 95% conditional CBs of v ............. 92 C.19 Lengths of 90% conditional CIs of v. .................. 92 C.20 The KS distances between the simulated and asymptotic conditional sampling distributions for estimators of selected parameters. 93 C.21 The KS distances between the simulated conditional sampling distri- butions
Recommended publications
  • User Guide April 2004 EPA/600/R04/079 April 2004
    ProUCL Version 3.0 User Guide April 2004 EPA/600/R04/079 April 2004 ProUCL Version 3.0 User Guide by Anita Singh Lockheed Martin Environmental Services 1050 E. Flamingo Road, Suite E120, Las Vegas, NV 89119 Ashok K. Singh Department of Mathematical Sciences University of Nevada, Las Vegas, NV 89154 Robert W. Maichle Lockheed Martin Environmental Services 1050 E. Flamingo Road, Suite E120, Las Vegas, NV 89119 i Table of Contents Authors ..................................................................... i Table of Contents ............................................................. ii Disclaimer ................................................................. vii Executive Summary ........................................................ viii Introduction ................................................................ ix Installation Instructions ........................................................ 1 Minimum Hardware Requirements ............................................... 1 A. ProUCL Menu Structure .................................................... 2 1. File ............................................................... 3 2. View .............................................................. 4 3. Help .............................................................. 5 B. ProUCL Components ....................................................... 6 1. File ................................................................7 a. Input File Format ...............................................9 b. Result of Opening an Input Data File
    [Show full text]
  • Basic Statistical Concepts Statistical Population
    Basic Statistical Concepts Statistical Population • The entire underlying set of observations from which samples are drawn. – Philosophical meaning: all observations that could ever be taken for range of inference • e.g. all barnacle populations that have ever existed, that exist or that will exist – Practical meaning: all observations within a reasonable range of inference • e.g. barnacle populations on that stretch of coast 1 Statistical Sample •A representative subset of a population. – What counts as being representative • Unbiased and hopefully precise Strategies • Define survey objectives: what is the goal of survey or experiment? What are your hypotheses? • Define population parameters to estimate (e.g. number of individuals, growth, color etc). • Implement sampling strategy – measure every individual (think of implications in terms of cost, time, practicality especially if destructive) – measure a representative portion of the population (a sample) 2 Sampling • Goal: – Every unit and combination of units in the population (of interest) has an equal chance of selection. • This is a fundamental assumption in all estimation procedures •How: – Many ways if underlying distribution is not uniform » In the absence of information about underlying distribution the only safe strategy is random sampling » Costs: sometimes difficult, and may lead to its own source of bias (if sample size is low). Much more about this later Sampling Objectives • To obtain an unbiased estimate of a population mean • To assess the precision of the estimate (i.e.
    [Show full text]
  • Nearest Neighbor Methods Philip M
    Statistics Preprints Statistics 12-2001 Nearest Neighbor Methods Philip M. Dixon Iowa State University, [email protected] Follow this and additional works at: http://lib.dr.iastate.edu/stat_las_preprints Part of the Statistics and Probability Commons Recommended Citation Dixon, Philip M., "Nearest Neighbor Methods" (2001). Statistics Preprints. 51. http://lib.dr.iastate.edu/stat_las_preprints/51 This Article is brought to you for free and open access by the Statistics at Iowa State University Digital Repository. It has been accepted for inclusion in Statistics Preprints by an authorized administrator of Iowa State University Digital Repository. For more information, please contact [email protected]. Nearest Neighbor Methods Abstract Nearest neighbor methods are a diverse group of statistical methods united by the idea that the similarity between a point and its nearest neighbor can be used for statistical inference. This review article summarizes two common environmetric applications: nearest neighbor methods for spatial point processes and nearest neighbor designs and analyses for field experiments. In spatial point processes, the appropriate similarity is the distance between a point and its nearest neighbor. Given a realization of a spatial point process, the mean nearest neighbor distance or the distribution of distances can be used for inference about the spatial process. One common application is to test whether the process is a homogeneous Poisson process. These methods can be extended to describe relationships between two or more spatial point processes. These methods are illustrated using data on the locations of trees in a swamp hardwood forest. In field experiments, neighboring plots often have similar characteristics before treatments are imposed.
    [Show full text]
  • Experimental Investigation of the Tension and Compression Creep Behavior of Alumina-Spinel Refractories at High Temperatures
    ceramics Article Experimental Investigation of the Tension and Compression Creep Behavior of Alumina-Spinel Refractories at High Temperatures Lucas Teixeira 1,* , Soheil Samadi 2 , Jean Gillibert 1 , Shengli Jin 2 , Thomas Sayet 1 , Dietmar Gruber 2 and Eric Blond 1 1 Université d’Orléans, Université de Tours, INSA-CVL, LaMé, 45072 Orléans, France; [email protected] (J.G.); [email protected] (T.S.); [email protected] (E.B.) 2 Chair of Ceramics, Montanuniversität Leoben, 8700 Leoben, Austria; [email protected] (S.S.); [email protected] (S.J.); [email protected] (D.G.) * Correspondence: [email protected] Received: 3 September 2020; Accepted: 18 September 2020; Published: 22 September 2020 Abstract: Refractory materials are subjected to thermomechanical loads during their working life, and consequent creep strain and stress relaxation are often observed. In this work, the asymmetric high temperature primary and secondary creep behavior of a material used in the working lining of steel ladles is characterized, using uniaxial tension and compression creep tests and an inverse identification procedure to calculate the parameters of a Norton-Bailey based law. The experimental creep curves are presented, as well as the curves resulting from the identified parameters, and a statistical analysis is made to evaluate the confidence of the results. Keywords: refractories; creep; parameters identification 1. Introduction Refractory materials, known for their physical and chemical stability, are used in high temperature processes in different industries, such as iron and steel making, cement and aerospace. These materials are exposed to thermomechanical loads, corrosion/erosion from solids, liquids and gases, gas diffusion, and mechanical abrasion [1].
    [Show full text]
  • Introduction to Basic Concepts
    Introduction to Basic Concepts ! There are 5 basic steps to resource management: " 1st - Assess current situation (gather and analyze info) " 2nd - Develop a plan " 3rd- Implement the plan " 4th- Monitor results (effects) of plan " 5th- Replan ! This class will help you do the 1st and 4th steps in the process ! For any study of vegetation there are basically 8 steps: " 1st Set objectives - What do you wan to know? You can’t study everything, you must have a goal " 2nd Choose variable you are going to measure Make sure you are meeting your objectives " 3rd Read published research & ask around Find out what is known about your topic and research area " 4th Choose a sampling method, number of samples, analysis technique, and statistical hypothesis " 5th Go collect data - Details, details, details.... Pay attention to detail - Practice and train with selected methods before you start - Be able to adjust once you get to the field " 6th Analyze the data - Summarize data - Use a statistical comparison " 7th Interpret the results - What did you find? Why is it important? - What can your results tell you about the ecosystem? - Remember the limitations of your technique " 8th Write the report - Your results are worthless if they stay in your files - Clearly document your methods - Use tables and figures to explain your data - Be correct and concise - No matter how smart you are, you will sound stupid if you can’t clearly communicate your results A good reference for several aspects of rangeland inventory and monitoring is: http://rangelandswest.org/az/monitoringtechnical.html
    [Show full text]
  • Shapiro, A., “Statistical Inference of Moment Structures”
    Statistical Inference of Moment Structures Alexander Shapiro School of Industrial and Systems Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332-0205, USA e-mail: [email protected] Abstract This chapter is focused on statistical inference of moment structures models. Although the theory is presented in terms of general moment structures, the main emphasis is on the analysis of covariance structures. We discuss identifiability and the minimum discrepancy function (MDF) approach to statistical analysis (estimation) of such models. We address such topics of the large samples theory as consistency, asymptotic normality of MDF estima- tors and asymptotic chi-squaredness of MDF test statistics. Finally, we discuss asymptotic robustness of the normal theory based MDF statistical inference in the analysis of covariance structures. 1 Contents 1 Introduction 1 2 Moment Structures Models 1 3 Discrepancy Functions Estimation Approach 6 4 Consistency of MDF Estimators 9 5 Asymptotic Analysis of the MDF Estimation Procedure 12 5.1 Asymptotics of MDF Test Statistics . 15 5.2 Nested Models . 19 5.3 Asymptotics of MDF Estimators . 20 5.4 The Case of Boundary Population Value . 22 6 Asymptotic Robustness of the MDF Statistical Inference 24 6.1 Elliptical distributions . 26 6.2 Linear latent variate models . 27 1 Introduction In this paper we discuss statistical inference of moment structures where first and/or second population moments are hypothesized to have a parametric structure. Classical examples of such models are multinomial and covariance structures models. The presented theory is sufficiently general to handle various situations, however the main focus is on covariance structures. Theory and applications of covariance structures were motivated first by the factor analysis model and its various generalizations and later by a development of the LISREL models (J¨oreskog [15, 16]) (see also Browne [8] for a thorough discussion of covariance structures modeling).
    [Show full text]
  • Standard Deviation - Wikipedia Visited on 9/25/2018
    Standard deviation - Wikipedia Visited on 9/25/2018 Not logged in Talk Contributions Create account Log in Article Talk Read Edit View history Wiki Loves Monuments: The world's largest photography competition is now open! Photograph a historic site, learn more about our history, and win prizes. Main page Contents Featured content Standard deviation Current events Random article From Wikipedia, the free encyclopedia Donate to Wikipedia For other uses, see Standard deviation (disambiguation). Wikipedia store In statistics , the standard deviation (SD, also represented by Interaction the Greek letter sigma σ or the Latin letter s) is a measure that is Help used to quantify the amount of variation or dispersion of a set About Wikipedia of data values.[1] A low standard deviation indicates that the data Community portal Recent changes points tend to be close to the mean (also called the expected Contact page value) of the set, while a high standard deviation indicates that A plot of normal distribution (or bell- the data points are spread out over a wider range of values. shaped curve) where each band has a Tools The standard deviation of a random variable , statistical width of 1 standard deviation – See What links here also: 68–95–99.7 rule population, data set , or probability distribution is the square root Related changes Upload file of its variance . It is algebraically simpler, though in practice [2][3] Special pages less robust , than the average absolute deviation . A useful Permanent link property of the standard deviation is that, unlike the variance, it Page information is expressed in the same units as the data.
    [Show full text]
  • Sampling Designs for National Forest Assessments Ronald E
    Knowledge reference for national forest assessments Sampling designs for national forest assessments Ronald E. McRoberts,1 Erkki O. Tomppo,2 and Raymond L. Czaplewski 3 THIS CHAPTER DISCUSSES THE FOLLOWING POINTS: • Definition of a population for sampling purposes. • Selection of the size and shape for a plot configuration. • Distinguishing among simple random, systematic, stratified and cluster sampling designs. • Methods for constructing sampling design. • Estimating population means and variances. • Estimating sampling errors. • Special considerations for tropical forest inventories. Abstract estimation methods is a key component of the overall process for information management National forest assessments are best conducted and data registration for NFAs (see chapter with sufficiently accurate and scientifically on Information management and data defensible estimates of forest attributes. This registration, p. 93). chapter discusses the statistical design of the sampling plan for a forest inventory, including 1.1 Objectives the process used to define the population to be The goal is to estimate the condition of forests sampled and the selection of a sample intended for an entire nation using data collected from to satisfy NFA precision requirements. The a sample of field plots. The basic objectives team designing a national forest inventory of an NFA are assumed to be fourfold: (i) to should include an experienced statistician. If obtain national estimates of the total area such an expert is not available, this section of forest, subdivided by major categories provides guidance and recommendations for of forest types and conditions, as well as relatively simple sampling designs that reduce the numbers and distributions of trees by risk and improve chances for success.
    [Show full text]
  • Calculating 95% Upper Confidence Limit (UCL) Guidance Document
    STATE OF CONNECTICUT DEPARTMENT OF ENERGY AND ENVIRONMENTAL PROTECTION Guidance for Calculating the 95% Upper Confidence Level for Demonstrating Compliance with the Remediation Standard Regulations May 30, 2014 Robert J. Klee, Commissioner 79 Elm Street, Hartford, CT 06106 www.ct.gov/deep/remediation 860-424-3705 TABLE OF CONTENTS LIST OF ACRONYMS iii DEFINITION OF TERMS iv 1. Introduction 1 1.1 Definition of 95 % Upper Confidence Level 1 1.2 Data Quality Considerations 2 1.3 Applicability 2 1.3.1 Soil 3 1.3.2 Groundwater 3 1.4 Document Organization 4 2. Developing a Data Set for a Release Area 4 2.1 Data Selection for 95% UCL Calculation for a Release Area 5 2.2 Non-Detect Soil Results in a Release Area Data Set 7 2.3 Quality Control Soil Sample Results in a Release Area Data Set 7 3. Developing a Data Set for a Groundwater Plume 7 3.1 Data Selection for 95% UCL Calculation for a Groundwater Plume 8 3.2 Non-Detect Results in a Groundwater Plume Data Set 8 3.3 Quality Control Results in a Groundwater Plume Data Set 8 4. Evaluating the Data Set 9 4.1 Distribution of COC Concentrations in the Environment 9 4.2 Appropriate Data Set Size 10 4.3 Statistical DQOs 10 4.3.1 Randomness of Data Set 10 4.3.2 Strength of Data Set 10 4.3.3 Skewness of Data Set 11 5. Statistical Calculation Methods 11 i 5.1 Data Distributions 12 5.2 Handling of Non-Detect Results in Statistical Calculations 12 6.
    [Show full text]
  • Introduction to Estimation
    5: Introduction to Estimation Parameters and statistics Statistical inference is the act of generalizing from a sample to a population with calculated degree of certainty. The two forms of statistical inference are estimation and hypothesis testing. This chapter introduces estimation. The next chapter introduces hypothesis testing. A statistical population represents the set of all possible values for a variable. In practice, we do not study the entire population. Instead, we use data in a sample to shed light on the wider population. The process of generalizing from the sample to the population is statistical inference. The term parameter is used to refer to a numerical characteristic of a population. Examples of parameters include the population mean (μ) and the population standard deviation (σ). A numerical characteristic of the sample is a statistic. In this chapter we introduce a particular type of statistic called an estimate. The sample mean x is the natural estimator of population mean μ. Sample standard deviation s is the natural estimator of population standard deviation σ. Different symbols are used to denote parameters and estimates. (e.g., μ versus x ). The parameter is a fixed constant. In contrast, the estimator varies from sample to sample. Other differences are summarized: Parameters Estimators Source Population Sample Value known? No Yes (calculate) Notation Greek (μ) Roman ( x ) Vary from sample to sample No Yes Error-prone No Yes Page 5.1 (C:\data\StatPrimer\estimation.doc, 8/1/2006) Sampling distribution of a mean (SDM) If we had the opportunity to take repeated samples from the same population, samples means ( x s) would vary from sample to sample and form a sampling distribution means (SDM).
    [Show full text]
  • Exercise 1C Scientific Investigation: Statistical Analysis
    Exercise 1C Scientific Investigation: Statistical Analysis Parts of this lab adapted from General Ecology Labs, Dr. Chris Brown, Tennessee Technological University and Ecology on Campus, Dr. Robert Kingsolver, Bellarmine University. In part C of our Scientific Investigation labs, we will use the measurement data from part B to ask new questions and apply some basic statistics. Ecology is the ambitious attempt to understand life on a grand scale. We know that the mechanics of the living world are too vast to see from a single vantage point, too gradual to observe in a single lifetime, and too complex to capture in a single narrative. This is why ecology has always been a quantitative discipline. Measurement empowers ecologists because our measuring instruments extend our senses, and numerical records extend our capacity for observation. With measurement data, we can compare the growth rates of trees across a continent, through a series of environmental conditions, or over a period of years. Imagine trying to compare from memory the water clarity of two streams located on different continents visited in separate years, and you can easily appreciate the value of measurement. Numerical data extend our capacity for judgment too. Since a stream runs muddier after a rain and clearer in periods of drought, how could you possibly wade into two streams in different seasons and hope to develop a valid comparison? In a world characterized by change, data sets provide reliability unrealized by single observations. Quantitative concepts such as averages, ratios, variances, and probabilities reveal ecological patterns that would otherwise remain unseen and unknowable.
    [Show full text]
  • Quality Guidelines for Frames Insocialstatistics
    Quality Guidelines for Frames in Social Statistics Version 1.51 ESSnet KOMUSO Quality in Multisource Statistics http://ec.europa.eu/eurostat/cros/content/essnet-quality-multisource-statistics_en Framework Partnership Agreement No 07112.2015.003-2015.226 Specific Grant Agreement No 3 (SGA-3) No 07112.2018.007-2018.0444 Work Package 2 QUALITY GUIDELINES FOR FRAMES IN SOCIAL STATISTICS Version 1.51, 2019-09-30 Prepared by: Thomas Burg (Statistics Austria), Alexander Kowarik (Statistics Austria), Magdalena Six (Statistics Austria), Giovanna Brancato (Istat, Italy) and Danuté Krapavickaité (LS, Lithuania) Reviewers: Fabian Bach (ESTAT), Lionel Vigino (ESTAT), Li-Chun Zhang (SSB, Norway), Ton de Waal (CBS, The Netherlands), Angela McCourt (CSO, Ireland), Eva-Maria Asamer (Statistics Austria) and Christoph Waldner (Statistics Austria) ESSnet co-ordinator: Niels Ploug (DST, Denmark), email [email protected], telephone +45 3917 3951 1 Quality Guidelines for Frames in Social Statistics Version 1.51 Table of contents 1 Purpose and Objectives of the Document ................................................................................................................ 3 2 Frames in Official Statistics ....................................................................................................................................... 5 2.1 Definition of a Frame ........................................................................................................................................ 5 2.2 Frames in Social Statistics ................................................................................................................................
    [Show full text]