Using Statistical Sampling Is One of a Series of Papers Issued by the Program Evaluation and Methodology Division (PEMD)

Total Page:16

File Type:pdf, Size:1020Kb

Using Statistical Sampling Is One of a Series of Papers Issued by the Program Evaluation and Methodology Division (PEMD) Revised Using Statistical May 1992 Sampling GAWPEMD-10.1.6 Preface GAO assists congressional decisionmakers in their deliberative process by furnishing analytical information on issues and opinions under consideration. Many diverse methodologies are needed to develop sound and timely answers to the questions that are posed by the Congress. To provide GAO evaluators with basic information about the commonly used methodologies, GAO’s policy guidance includes documents such as methodology transfer papers and technical guidelines. The purpose of this methodology transfer paper on statistical sampling is to provide its readers with a background on sampling concepts and methods that will enable them to identify jobs that can benefit from statistical sampling, to know when to seek assistance from a statistical sampling specialist, and to work with the specialist to design and execute a sampling plan. This paper describes sample design, selection and estimation procedures, and the concepts of confidence and sampling precision. Two additional topics, treated more briefly, include special applications of sampling to auditing and evaluation and some relationships between sampling and data collection problems. Last, but not least, the strengths and limitations of statistical sampling are summarized. The original paper was authored by Harry Conley and Lou Fink in April 1986. This reissued version supersedes the earlier edition. Using Statistical Sampling is one of a series of papers issued by the Program Evaluation and Methodology Division (PEMD). The purpose of the series is to provide GAO evaluators with guides to various aspects of audit and evaluation methodology, to illustrate applications, and to indicate where more detailed information is available. Page 1 GAO/PEMD-10.1,6 Statistical Sampling Preface We look forward to receiving comments from the readers of this paper. They should be addressed to Eleanor Chelimsky at 202-275-1854. Werner Grosshans I Assistant Comptroller General Office of Policy -w I Eleanor Chelimsky Assistant Comptroller General for Program Evaluation and Methodology Page 2 GAO/PEMD-10.1.6 Statistical SumpIining Page 3 GAO/PEMD-10.1.6 Statistical Sampling Contents Preface Chapter 1 10 Introduction Evaluation Design 11 Sample Design As an Element of an Assignment 13 Design Representativeness: The Goal of Statistid 14 sampling Random Selection 15 Distinguishing Between Samples and Examples and 16 Between Sampling Operations The Organization of This Paper 17 Chapter 2 19 A Review of Basic Statistks: A Shorthand Description of a Population 20 Measurement Levels of Data 20 Concepts Central Tendency 22 Dispersion 26 Some Comments on Looking Summary statistics in 29 the Eye Chapter 3 30 An Overview of Sample Design 30 Selection Procedures 41 Sample Design Estimation Procedures 42 and Selection and Three Statistical Sampling Methods 43 Estimation Procedures Chapter 4 47 Basic Estimation The Concepts of Precision and Confidence 43 Procedures and Variable Sampling 51 Calculating Sample Size 57 F’urther Sample Attribute Sanmlinn 62 Design A Distinction Be&ken Precision and Accuracy 67 Considerations Page4 GAO/PEMD- 10.1.6 Statistical Sampling Chapter 5 69 Advanced Ratio Estimation 69 Regression Estimation 70 Estimation Difference Estimation 71 Procedures Example of Advanced Estimation Procedures 71 Which Method to Use 80 Other Advantages and Disadvantages of Ratio, 80 Regression, and Difference Estimation Chapter 6 82 Sampling in the Discovery Sampling 82 Au& Environment Acceptance saPh3 88 Audit Judgment and Statistical Sampling 92 The Characteristics of a Good Sample 93 Chapter 7 96 Random Selection The Selection of Sampling Units 97 Procedures The Application of Selection Procedures 104 Selection With Probability Proportional to Size 105 A Final Check 108 Chapter 8 109 Data Collection Checking the Quality of Data Collection 110 and Analysis The Problem of Nonresponse 112 Considerations Related to sampling Chapter 9 116 Statistical Sampling As a Friend of Evaluators Page 6 GAO/PEMD-10.1.6 Statistical Sampling Content5 Appendixes Appendix I: Theoretical Background 122 Appendix II: A Comprehensive Description of 144 Sampling Procedures Appendix III: Details on Stratified and Cluster 180 Sampling Appendix IV: Computer Software Packages 214 Bibliography 221 Glossary 225 Papers in This 238 Series Tables Table 1.1: Four Strategies and Their Evaluation Questions Table 2.1: A Frequency Distribution 19 Table 2.2: Days of Downtime for 10 Computers 22 Table 2.3: Data for Measures of Dispersion 28 Table 2.4: Interpreting the Value of the Coefficient 29 of Variation Table 4.1: ExampIe of Sample Values 53 Table 4.2: Data for Figure 4.1 62 Table 5.1: Data for Advanced Estimation 72 Procedures Example Table 6.1: The Probability of Finding at Least One 84 Error in a Population of 500 for Various Numbers of Errors and Sample Sizes Table 6.2: The Probability of Finding at Least One 86 Error in a Population of 600 for Various Numbers of Errors and Sample Sizes Table 6.3: An Acceptance Sampling Table 90 Table 7.1: Selection With Probability ProportionaI 106 to Size Table 8.1: Tabulation of Responsesto Question on 112 Abolition of the Death Penalty Table 8.2: Type of Sentence Respondents Received 113 Page 6 GAOIPEMD-10.1.6 Statistical Sampling Contents Table I. 1: Dice-Throwing Probabilities 123 Table 1.2: Number and Percent of Red Beads 123 Found in 400 Random Samples of 400 From a Population of 20,000 Containing 4,000 Red Beads Table 1.3: The Relative Frequency of Means of 129 Samples of Two From a Population of Digits Between 0 and 9 Table 1.4: The Relative Frequency of Means of 131 Samples of Three From a Population of Digits Between 0 and 9 Table I.5: The Amounts of 100 Small Purchases I37 Table I.6: Table oft Factors 143 Table II. 1: Random Decimal Digits 146 Table 11.2:Selecting Random Numbers and 157 Random Letters in Compound Numbering Systems With the Same Quantity of Items for Each Letter Table 11.3:Matching Random Numbers and 159 Random Letters in Compound Numbering System With the Same Quantity of Items for Each Letter Table 11.4:Sorting Random Number-Letter 160 Combinations in Compound Numbering Systems With the Same Quantity of Items for Each Letter Table 11.5:Selecting Random Numbers and 162 Random Letters in Compound Numbering Systems With a Different Quantity of Items for Each Letter Table 11.6:Matching Random Numbers and 164 Random Letters in Compound Numbering Systems With a Different Quantity of Items for Each Letter Table II.7: Sorting Random Letter-Number 166 Combinations in Compound Numbering Systems With a Different Quantity of Items for Each Letter Table 11.8:A Population of Records in Four 170 Groups Page 7 GAO/PEMD-10.1.6 Statistical Sampling Contents Table 11.9:Selecting a Single Systematic Sample 17 1 From Four Groups of Records Table III. 1: Air Freight Forwarder Shipping Costs 182 Table 111.2:Savings From Using Direct Air 182 Shipment Instead of Air Freight Forwarder Classified by Air Freight Forwarder Shipping Costs of Less Than $100 Table III. 3: Savings From Using Direct Air 188 Shipment Instead of Air Freight Forwarder Classified by Air Freight Forwarder Shipping Costs of More Than $100 Table III.4: Preliminary Sample Selection 192 Table III.5: Calculated Savings 193 Table 111.6:Final Sample Items Under Neyman 196 Allocation Table 111.7:Sample Results Under Judgmental 198 Allocation Table 111.8:Independent Random Samples of 201 Payroll Records Table 111.9:Assumed Air Freight Forwarder Costs 204 Table III. 10: Allocation of a Sample of 50 205 Table III. 11: Frequency Distribution on 207 Transaction Dollar Amounts Table III. 12: Dollar Amount Multiplied by Strata 208 Numbers Table III. 13: Strata Boundaries 208 Table III. 14: Random Number Sample 210 Figures Figure 4.1: Final Sample Size As Population Size 61 Increases Figure 4.2: Sample Data for Supply Depot Example 63 Figure 4.3: Sample Data for Supply Depot Example 66 for a Given Precision Figure 5.1: Computer Output for Advanced 74 Estimation Procedures Example Figure 5.2: Statistics for Advanced Estimation 78 Procedures Example Figure I. 1: Number of Samples Classified by 126 Percentage of Red Beads Page 8 GAO/PEMD-10.1.6 Statistical Sampling Content8 Figure I.2: The Relative Frequency of a Large 128 Population of Single Digits Between 0 and 9 Figure I.3: The Relative Frequency of Means of 130 Samples of Two From a Population of Digits Between 0 and 9 Figure 1.4: The Relative Frequency of Means of 132 Samples of Three From a Population of Digits Between 0 and 9 Figure 1.5: The Smoothed Curve of the Relative 133 Frequency of Means of Samples of Three From a Population of Digits Between 0 and 9 Figure 1.6: Frequency Distributions of the Means 135 of Samples Drawn From 228 U.S. Cities With Populations Greater Than 50,000 in 1950 Figure III. 1: Results From SRO-STATS 195 Calculation Abbreviations FTC Finite population correction GAO U.S. General Accounting Office ICC Interstate Commerce Commission LTPD Lot tolerance percent defective PEMD Program Evaluation and Methodology Division PPS Probability proportional to size Page 9 GAO/PEMD-10.1.6 Statistical Sampling Chapter 1 Introduction Sampling is a very important element in the design and planning of a job here at GAO as well as any evaluation. The purpose of this paper is to help GAO managers and evaluators learn more about statistical (or probability) sampling and the role it plays in the design and execution of a job. We have attempted TV take the mystery out of what is often thought of as an esoteric subject by “walking” the reader through the various sampling procedures. In this document, we have chosen to describe the computations and sample selection procedures as they are normally done by computer in GAO, but formulas necessary to do the necessary calculations by hand can be found in any sampling textbook.
Recommended publications
  • Hypothesis Testing with Two Categorical Variables 203
    Chapter 10 distribute or Hypothesis Testing With Two Categoricalpost, Variables Chi-Square copy, Learning Objectives • Identify the correct types of variables for use with a chi-square test of independence. • Explainnot the difference between parametric and nonparametric statistics. • Conduct a five-step hypothesis test for a contingency table of any size. • Explain what statistical significance means and how it differs from practical significance. • Identify the correct measure of association for use with a particular chi-square test, Doand interpret those measures. • Use SPSS to produce crosstabs tables, chi-square tests, and measures of association. 202 Part 3 Hypothesis Testing Copyright ©2016 by SAGE Publications, Inc. This work may not be reproduced or distributed in any form or by any means without express written permission of the publisher. he chi-square test of independence is used when the independent variable (IV) and dependent variable (DV) are both categorical (nominal or ordinal). The chi-square test is member of the family of nonparametric statistics, which are statistical Tanalyses used when sampling distributions cannot be assumed to be normally distributed, which is often the result of the DV being categorical rather than continuous (we will talk in detail about this). Chi-square thus sits in contrast to parametric statistics, which are used when DVs are continuous and sampling distributions are safely assumed to be normal. The t test, analysis of variance, and correlation are all parametric. Before going into the theory and math behind the chi-square statistic, read the Research Examples for illustrations of the types of situations in which a criminal justice or criminology researcher would utilize a chi- square test.
    [Show full text]
  • Sampling and Fieldwork Practices in Europe
    www.ssoar.info Sampling and Fieldwork Practices in Europe: Analysis of Methodological Documentation From 1,537 Surveys in Five Cross-National Projects, 1981-2017 Jabkowski, Piotr; Kołczyńska, Marta Veröffentlichungsversion / Published Version Zeitschriftenartikel / journal article Empfohlene Zitierung / Suggested Citation: Jabkowski, P., & Kołczyńska, M. (2020). Sampling and Fieldwork Practices in Europe: Analysis of Methodological Documentation From 1,537 Surveys in Five Cross-National Projects, 1981-2017. Methodology: European Journal of Research Methods for the Behavioral and Social Sciences, 16(3), 186-207. https://doi.org/10.5964/meth.2795 Nutzungsbedingungen: Terms of use: Dieser Text wird unter einer CC BY Lizenz (Namensnennung) zur This document is made available under a CC BY Licence Verfügung gestellt. Nähere Auskünfte zu den CC-Lizenzen finden (Attribution). For more Information see: Sie hier: https://creativecommons.org/licenses/by/4.0 https://creativecommons.org/licenses/by/4.0/deed.de Diese Version ist zitierbar unter / This version is citable under: https://nbn-resolving.org/urn:nbn:de:0168-ssoar-74317-3 METHODOLOGY Original Article Sampling and Fieldwork Practices in Europe: Analysis of Methodological Documentation From 1,537 Surveys in Five Cross-National Projects, 1981-2017 Piotr Jabkowski a , Marta Kołczyńska b [a] Faculty of Sociology, Adam Mickiewicz University, Poznan, Poland. [b] Department of Socio-Political Systems, Institute of Political Studies of the Polish Academy of Science, Warsaw, Poland. Methodology, 2020, Vol. 16(3), 186–207, https://doi.org/10.5964/meth.2795 Received: 2019-03-23 • Accepted: 2019-11-08 • Published (VoR): 2020-09-30 Corresponding Author: Piotr Jabkowski, Szamarzewskiego 89C, 60-568 Poznań, Poland. +48 504063762, E-mail: [email protected] Abstract This article addresses the comparability of sampling and fieldwork with an analysis of methodological data describing 1,537 national surveys from five major comparative cross-national survey projects in Europe carried out in the period from 1981 to 2017.
    [Show full text]
  • ESOMAR 28 Questions
    Responses to ESOMAR 28 Questions INTRODUCTION Intro to TapResearch TapResearch connects mobile, tablet and pc users interested in completing surveys with market researchers who need their opinions. Through our partnerships with dozens of leading mobile apps, ad networks and websites, we’re able to reach an audience exceeding 100 million people in the United States. We are focused on being a top-quality partner for ad hoc survey sampling, panel recruitment, and router integrations. Our technology platform enables reliable feasibility estimates, highly competitive costs, sophisticated quality enforcement, and quick-turnaround project management. Intro to ESOMAR ESOMAR (European Society for Opinion and Market Research) is the essential organization for encouraging, advancing and elevating market research worldwide. Since 1948, ESOMAR’s aim has been to promote the value of market and opinion research in effective decision-making. The ICC/ESOMAR Code on Market and Social Research, which was developed jointly with the International Chamber of Commerce, sets out global guidelines for self-regulation for researchers and has been undersigned by all ESOMAR members and adopted or endorsed by more than 60 national market research associations worldwide. Responses to ESOMAR 28 Questions COMPANY PROFILE 1) What experience does your company have in providing online samples for market research? TapResearch connects mobile, tablet and pc users interested in completing surveys with market researchers who need their opinions. Through our partnerships with dozens of leading mobile apps, ad networks and websites, we’re able to reach an audience exceeding 100 million people in the United States - we’re currently adding about 30,000 panelists/day and this rate is increasing.
    [Show full text]
  • Esomar/Grbn Guideline for Online Sample Quality
    ESOMAR/GRBN GUIDELINE FOR ONLINE SAMPLE QUALITY ESOMAR GRBN ONLINE SAMPLE QUALITY GUIDELINE ESOMAR, the World Association for Social, Opinion and Market Research, is the essential organisation for encouraging, advancing and elevating market research: www.esomar.org. GRBN, the Global Research Business Network, connects 38 research associations and over 3500 research businesses on five continents: www.grbn.org. © 2015 ESOMAR and GRBN. Issued February 2015. This Guideline is drafted in English and the English text is the definitive version. The text may be copied, distributed and transmitted under the condition that appropriate attribution is made and the following notice is included “© 2015 ESOMAR and GRBN”. 2 ESOMAR GRBN ONLINE SAMPLE QUALITY GUIDELINE CONTENTS 1 INTRODUCTION AND SCOPE ................................................................................................... 4 2 DEFINITIONS .............................................................................................................................. 4 3 KEY REQUIREMENTS ................................................................................................................ 6 3.1 The claimed identity of each research participant should be validated. .................................................. 6 3.2 Providers must ensure that no research participant completes the same survey more than once ......... 8 3.3 Research participant engagement should be measured and reported on ............................................... 9 3.4 The identity and personal
    [Show full text]
  • Lecture 1: Why Do We Use Statistics, Populations, Samples, Variables, Why Do We Use Statistics?
    1pops_samples.pdf Michael Hallstone, Ph.D. [email protected] Lecture 1: Why do we use statistics, populations, samples, variables, why do we use statistics? • interested in understanding the social world • we want to study a portion of it and say something about it • ex: drug users, homeless, voters, UH students Populations and Samples Populations, Sampling Elements, Frames, and Units A researcher defines a group, “list,” or pool of cases that she wishes to study. This is a population. Another definition: population = complete collection of measurements, objects or individuals under study. 1 of 11 sample = a portion or subset taken from population funny circle diagram so we take a sample and infer to population Why? feasibility – all MD’s in world , cost, time, and stay tuned for the central limits theorem...the most important lecture of this course. Visualizing Samples (taken from) Populations Population Group you wish to study (Mostly made up of “people” in the Then we infer from sample back social sciences) to population (ALWAYS SOME ERROR! “sampling error” Sample (a portion or subset of the population) 4 This population is made up of the things she wishes to actually study called sampling elements. Sampling elements can be people, organizations, schools, whales, molecules, and articles in the popular press, etc. The sampling element is your exact unit of analysis. For crime researchers studying car thieves, the sampling element would probably be individual car thieves – or theft incidents reported to the police. For drug researchers the sampling elements would be most likely be individual drug users. Inferential statistics is truly the basis of much of our scientific evidence.
    [Show full text]
  • Assessment of Socio-Demographic Sample Composition in ESS Round 61
    Assessment of socio-demographic sample composition in ESS Round 61 Achim Koch GESIS – Leibniz Institute for the Social Sciences, Mannheim/Germany, June 2016 Contents 1. Introduction 2 2. Assessing socio-demographic sample composition with external benchmark data 3 3. The European Union Labour Force Survey 3 4. Data and variables 6 5. Description of ESS-LFS differences 8 6. A summary measure of ESS-LFS differences 17 7. Comparison of results for ESS 6 with results for ESS 5 19 8. Correlates of ESS-LFS differences 23 9. Summary and conclusions 27 References 1 The CST of the ESS requests that the following citation for this document should be used: Koch, A. (2016). Assessment of socio-demographic sample composition in ESS Round 6. Mannheim: European Social Survey, GESIS. 1. Introduction The European Social Survey (ESS) is an academically driven cross-national survey that has been conducted every two years across Europe since 2002. The ESS aims to produce high- quality data on social structure, attitudes, values and behaviour patterns in Europe. Much emphasis is placed on the standardisation of survey methods and procedures across countries and over time. Each country implementing the ESS has to follow detailed requirements that are laid down in the “Specifications for participating countries”. These standards cover the whole survey life cycle. They refer to sampling, questionnaire translation, data collection and data preparation and delivery. As regards sampling, for instance, the ESS requires that only strict probability samples should be used; quota sampling and substitution are not allowed. Each country is required to achieve an effective sample size of 1,500 completed interviews, taking into account potential design effects due to the clustering of the sample and/or the variation in inclusion probabilities.
    [Show full text]
  • Statistical Theory and Methodology for the Analysis of Microbial Compositions, with Applications
    Statistical Theory and Methodology for the Analysis of Microbial Compositions, with Applications by Huang Lin BS, Xiamen University, China, 2015 Submitted to the Graduate Faculty of the Graduate School of Public Health in partial fulfillment of the requirements for the degree of Doctor of Philosophy University of Pittsburgh 2020 UNIVERSITY OF PITTSBURGH GRADUATE SCHOOL OF PUBLIC HEALTH This dissertation was presented by Huang Lin It was defended on April 2nd 2020 and approved by Shyamal Das Peddada, PhD, Professor and Chair, Department of Biostatistics, Graduate School of Public Health, University of Pittsburgh Jeanine Buchanich, PhD, Research Associate Professor, Department of Biostatistics, Graduate School of Public Health, University of Pittsburgh Ying Ding, PhD, Associate Professor, Department of Biostatistics, Graduate School of Public Health, University of Pittsburgh Matthew Rogers, PhD, Research Assistant Professor, Department of Surgery, UPMC Children's Hospital of Pittsburgh Hong Wang, PhD, Research Assistant Professor, Department of Biostatistics, Graduate School of Public Health, University of Pittsburgh Dissertation Director: Shyamal Das Peddada, PhD, Professor and Chair, Department of Biostatistics, Graduate School of Public Health, University of Pittsburgh ii Copyright c by Huang Lin 2020 iii Statistical Theory and Methodology for the Analysis of Microbial Compositions, with Applications Huang Lin, PhD University of Pittsburgh, 2020 Abstract Increasingly researchers are finding associations between the microbiome and human diseases such as obesity, inflammatory bowel diseases, HIV, and so on. Determining what microbes are significantly different between conditions, known as differential abundance (DA) analysis, and depicting the dependence structure among them, are two of the most challeng- ing and critical problems that have received considerable interest.
    [Show full text]
  • Sampling Methods It’S Impractical to Poll an Entire Population—Say, All 145 Million Registered Voters in the United States
    Sampling Methods It’s impractical to poll an entire population—say, all 145 million registered voters in the United States. That is why pollsters select a sample of individuals that represents the whole population. Understanding how respondents come to be selected to be in a poll is a big step toward determining how well their views and opinions mirror those of the voting population. To sample individuals, polling organizations can choose from a wide variety of options. Pollsters generally divide them into two types: those that are based on probability sampling methods and those based on non-probability sampling techniques. For more than five decades probability sampling was the standard method for polls. But in recent years, as fewer people respond to polls and the costs of polls have gone up, researchers have turned to non-probability based sampling methods. For example, they may collect data on-line from volunteers who have joined an Internet panel. In a number of instances, these non-probability samples have produced results that were comparable or, in some cases, more accurate in predicting election outcomes than probability-based surveys. Now, more than ever, journalists and the public need to understand the strengths and weaknesses of both sampling techniques to effectively evaluate the quality of a survey, particularly election polls. Probability and Non-probability Samples In a probability sample, all persons in the target population have a change of being selected for the survey sample and we know what that chance is. For example, in a telephone survey based on random digit dialing (RDD) sampling, researchers know the chance or probability that a particular telephone number will be selected.
    [Show full text]
  • Summary of Human Subjects Protection Issues Related to Large Sample Surveys
    Summary of Human Subjects Protection Issues Related to Large Sample Surveys U.S. Department of Justice Bureau of Justice Statistics Joan E. Sieber June 2001, NCJ 187692 U.S. Department of Justice Office of Justice Programs John Ashcroft Attorney General Bureau of Justice Statistics Lawrence A. Greenfeld Acting Director Report of work performed under a BJS purchase order to Joan E. Sieber, Department of Psychology, California State University at Hayward, Hayward, California 94542, (510) 538-5424, e-mail [email protected]. The author acknowledges the assistance of Caroline Wolf Harlow, BJS Statistician and project monitor. Ellen Goldberg edited the document. Contents of this report do not necessarily reflect the views or policies of the Bureau of Justice Statistics or the Department of Justice. This report and others from the Bureau of Justice Statistics are available through the Internet — http://www.ojp.usdoj.gov/bjs Table of Contents 1. Introduction 2 Limitations of the Common Rule with respect to survey research 2 2. Risks and benefits of participation in sample surveys 5 Standard risk issues, researcher responses, and IRB requirements 5 Long-term consequences 6 Background issues 6 3. Procedures to protect privacy and maintain confidentiality 9 Standard issues and problems 9 Confidentiality assurances and their consequences 21 Emerging issues of privacy and confidentiality 22 4. Other procedures for minimizing risks and promoting benefits 23 Identifying and minimizing risks 23 Identifying and maximizing possible benefits 26 5. Procedures for responding to requests for help or assistance 28 Standard procedures 28 Background considerations 28 A specific recommendation: An experiment within the survey 32 6.
    [Show full text]
  • MRS Guidance on How to Read Opinion Polls
    What are opinion polls? MRS guidance on how to read opinion polls June 2016 1 June 2016 www.mrs.org.uk MRS Guidance Note: How to read opinion polls MRS has produced this Guidance Note to help individuals evaluate, understand and interpret Opinion Polls. This guidance is primarily for non-researchers who commission and/or use opinion polls. Researchers can use this guidance to support their understanding of the reporting rules contained within the MRS Code of Conduct. Opinion Polls – The Essential Points What is an Opinion Poll? An opinion poll is a survey of public opinion obtained by questioning a representative sample of individuals selected from a clearly defined target audience or population. For example, it may be a survey of c. 1,000 UK adults aged 16 years and over. When conducted appropriately, opinion polls can add value to the national debate on topics of interest, including voting intentions. Typically, individuals or organisations commission a research organisation to undertake an opinion poll. The results to an opinion poll are either carried out for private use or for publication. What is sampling? Opinion polls are carried out among a sub-set of a given target audience or population and this sub-set is called a sample. Whilst the number included in a sample may differ, opinion poll samples are typically between c. 1,000 and 2,000 participants. When a sample is selected from a given target audience or population, the possibility of a sampling error is introduced. This is because the demographic profile of the sub-sample selected may not be identical to the profile of the target audience / population.
    [Show full text]
  • Chapter 3: Simple Random Sampling and Systematic Sampling
    Chapter 3: Simple Random Sampling and Systematic Sampling Simple random sampling and systematic sampling provide the foundation for almost all of the more complex sampling designs that are based on probability sampling. They are also usually the easiest designs to implement. These two designs highlight a trade-off inherent in all sampling designs: do we select sample units at random to minimize the risk of introducing biases into the sample or do we select sample units systematically to ensure that sample units are well- distributed throughout the population? Both designs involve selecting n sample units from the N units in the population and can be implemented with or without replacement. Simple Random Sampling When the population of interest is relatively homogeneous then simple random sampling works well, which means it provides estimates that are unbiased and have high precision. When little is known about a population in advance, such as in a pilot study, simple random sampling is a common design choice. Advantages: • Easy to implement • Requires little advance knowledge about the target population Disadvantages: • Imprecise relative to other designs if the population is heterogeneous • More expensive to implement than other designs if entities are clumped and the cost to travel among units is appreciable How it is implemented: • Select n sample units at random from N available in the population All units within the population must have the same probability of being selected, therefore each and every sample of size n drawn from the population has an equal chance of being selected. There are many strategies available for selecting a random sample.
    [Show full text]
  • Intro Surveys for Honors Theses, 2009
    PROGRAM ON SURVEY RESEARCH HARVARD UNIVERSITY ` Introduction to Survey Research ` The Survey Process ` Survey Resources at Harvard ` Sampling, Coverage, and Nonresponse ` Thinking About Modes April 14, 2009 ` Question Wording `Representation ◦The research design make inference to a larger population ◦Various population characteristics are represented in research data the same way there are present in population ◦Example: General population survey to estimate population characteristics Reporting `Realism Research Survey ◦A full picture of subjects emerges And ◦Relationships between multiple variables and multiple ways of looking at Methods the same variables can be studied Theories Analysis ◦Example: A qualitative case study to evaluate the nature of democracy in a small town with community meetings `Randomization ◦Variables which are not important in model are completely randomized ◦Effects of non-randomized variables can be tested ◦Example: Randomized clinical trial to test effectiveness of new cancer drug Integrated Example: Surveys and the Research Process Theories About Things `Good Measures: ◦Questions or measures impact your ability to study concepts ◦Think carefully about the underlying concepts a survey is trying to measure. Do the survey questions do a good job of capturing this? ◦The PSR Tip Sheet on Questionnaire Design contains good ideas on Concepts Population Validity External how to write survey questions. Specification Error ` Measures Sample Good Samples: ◦Samples give you the ability to generalize your findings Survey Errors ◦Think carefully about the population you are trying to generalize Internal Validity Internal your findings to. Does your sample design do a good job of Data Respondents representing these people? ◦The PSR Tip Sheet on Survey Sampling, Coverage, and Nonresponse contains thinks to think about in designing or evaluating a sample.
    [Show full text]