Chapter 3 Randomization, Probability and Sampling Distributions

Total Page:16

File Type:pdf, Size:1020Kb

Chapter 3 Randomization, Probability and Sampling Distributions Chapter 3 Randomization, Probability and Sampling Distributions 3.1 Assignments and Randomization Recall that Dawn’s study of her cat Bob was presented in Chapter 1. Table 3.1 presents her data. Reading from this table, we see that in Dawn’s study, the chicken-flavored treats were presented to Bob on days (trials): 1, 5, 7, 8, 9, 11, 13, 15, 16 and 18. Why did she choose these days? How did she choose these days? It will be easier to begin with the ‘How’ question. We have been looking at the data collected by Dawn. We have listed the observations; separated them by treatment; sorted them within treatment; and, within treatments, drawn dot plots and computed means, medians, variances and standard deviations. But now we need to get into our time machine and travel back in time to before Dawn collected her data. We go back to when Dawn had her study largely planned: treatments selected; trials defined; response specified; and the decision to have a balanced study with 20 trials. We are at the point where Dawn pondered, “Which 10 trials should have chicken-flavored treats assigned to them? How should I decide?” Table 3.1: Dawn’s data on Bob’s consumption of cat treats. ‘C’ [‘T’] is for chicken [tuna] flavored. Day: 12345678910 Flavor: CTTTCTCCCT NumberConsumed: 4 3 5 0 5 4 5 6 1 7 Day: 11 12 13 14 15 16 17 18 19 20 Flavor: CTCTCCTCTT NumberConsumed: 6 3 7 1 3 6 3 8 1 2 55 The answer is that Dawn did this by using a process called randomization. I will explain what randomization is by showing you three equivalent ways to randomize. First, some terminology. We call the list of 10 trials above an assignment of treatments to trials. It tells us which trials were assigned to the first treatment (chicken). It also implies which trials were assigned to the second treatment (tuna); namely, all of the trials not listed above. If we are going to study assignments—and we are—it is easier if we make our assignments as simple to display as possible. Thus, an assignment will be presented by listing the trials that it assigns to treatment 1. A natural question is: How many different assignments were possible for Dawn’s study? The answer is 184,756. I will give a brief digression into how I obtained this number. You might recall from math the expression m!, which is read em-factorial. If m is a positive integer, then this expression is defined as: m!= m(m − 1)(m − 2) ··· 1 (3.1) Thus, for example, 1! = 1;2! = 2(1) = 2;3! = 3(2)(1)= 6; and so on. By special definition (which will allow us to write more easily certain formulas that will arise later in these notes), 0!= 1. Finally, for any other value of m (negatives, non-integers), the expression m! is not defined. We have the following result. You don’t need to worry about proving it; it is a given in these notes. Result 3.1 (The number of possible assignments.) For a total of n = n1 + n2 units, the number of possible assignments of two treatments to the units, with n1 units assigned to treatment 1 and the remaining n2 units assigned to treatment 2, is n! (3.2) n1!n2! I will evaluate Equation 3.2 for three of the studies presented in Chapters 1 and 2. • For Cathy’s study, n =6 and n1 = n2 =3. Thus, the number of possible assignments is 6! 6(5)(4) = = 20. 3!3! 3(2)(1) Notice that it is always possible to reduce the amount of arithmetic we do by canceling some terms in the numerator and denominator. In particular, the 6! in the numerator can be written as 6(5)(4)3! and its 3! cancels a 3! in the denominator. 56 • For Kymn’s study, n = 10 and n1 = n2 =5. Thus, the number of possible assignments is 10! 10(9)(8)(7)(6) = = 252. 5!5! 5(4)(3)(2)(1) • For Dawn’s study, n = 20 and n1 = n2 = 10. Thus, the number of possible assignments is 20! = 184,756. 10!10! Notice that for Cathy’s and Kymn’s study, I determined the answer by hand because the numbers are small enough to handle easily. Dawn’s study is trickier. Many of you, perhaps most, perhaps all, will consider it easy to determine the answer: 184,756. But I will not require you to do so. As a guide, I will never have you evaluate m! for any m> 10. Sara’s study is a real challenge. The number of possible assignments is 80! . 40!40! This answer, to four significant digits, is 1.075 × 1023. Don’t worry about how I obtained this answer. If this issue, however, keeps you awake at night, then send me an email and I will tell you. If enough people email me, then I will put a brief explanation in the next version of these Course Notes. I now will describe three ways—two physical and one electronic—that Dawn could have per- formed her randomization. 1. A box with 20 cards. Take 20 cards of the same size, shape, texture, etc. and number them 1, 2, ..., 20, with one number to each card. Place the cards in a box; mix the cards thoroughly and select 10 cards at random without replacement. The numbers on the cards selected denoted the trials that will be assigned treatment 1. 2. A deck of 20 ordinary playing cards. (This method is especially suited for units that are trials.) We need to have 10 black cards (spades or clubs) and 10 red cards (diamonds or hearts). We don’t care about the rank (ace, king, 3, etc.) of the cards. The cards are thoroughly shuffled and placed in a pile, face down. Before each trial select the top card from the pile; if it is a black card, then treatment 1 is assigned to the trial; if it is a red card, then treatment 2 is assigned to the trial. The selected card is set aside and the above process is repeated for the remaining trials. 3. Using a website. This method will be explained in Section 3.5 later in this chapter. Are you familiar with the term black box? I like the definition in Wikipedia http://en.wikipedia.org/wiki/Black_box 57 which is: In science and engineering, a black box is a device, system or object which can be viewed solely in terms of its input, output and transfer characteristics without any knowledge of its internal workings, that is, its implementation is ”opaque” (black). Almost anything might be referred to as a black box: a transistor, an algorithm, or the human mind. Our website for randomization is a black box. It executes a computer program that supposedly is mathematically equivalent to my two methods of randomization that involve using cards. I say it’s a black box because we aren’t really interested in the computer code of the program. Thus, if you want to think about how randomization works, I recommend you think of the cards. For my purposes of instruction the most convenient method for me is to use the website to obtain examples for you. If I were to replicate Dawn’s study on my cat Buddy I would use the second card method above. But that’s me. If you perform a project for this class, you may randomize however you please, as long as you use one of the three methods above. Before I get back to Dawn’s study, I want to deal with an extremely common misconception about randomization. Randomization is a process or a method that is fair in the sense that every possible assignment has the same chance of being selected. Randomization does not guaran- tee that the assignment it yields will look random. (Among the many issues involved is how one decides what it means for an assignment to look random.) Later in these Course Notes, we will discuss designs that involve restricted randomization; in particular, we will learn about the Randomized Pairs Design. I used the website and obtained the following assignment for Dawn’s study: 1, 2, 4, 7, 9, 10, 11, 14, 15, 18. Note that this assignment is different from the one that Dawn used. Given that there are 184,756 possible assignments I would have been very surprised if I had obtained the same assignment as Dawn! Here is the commonality of our three methods of randomization: Before we select an assign- ment, all 184,756 possible assignments for Dawn’s study are equally likely to be selected. For the first method of randomizing, this fact is conveyed by saying that the cards are indistinguishable; they are thoroughly mixed; and 10 cards are selected at random. For the second method of ran- domizing, this fact is conveyed by saying that the cards are shuffled thoroughly. Finally, whereas the electronic methods’ operations are a total mystery to us, the programmer claims that it makes all assignments equally likely. In this class we will use the website randomizer for a variety of purposes (not just randomization) and we will accept without question its claim of making every assignment equally likely. The most important thing to remember about probability is that it is always about a future uncertainty. Now, back to Dawn’s study of Bob. Remember, we are at a place in time before Dawn selected an assignment.
Recommended publications
  • Sampling Methods It’S Impractical to Poll an Entire Population—Say, All 145 Million Registered Voters in the United States
    Sampling Methods It’s impractical to poll an entire population—say, all 145 million registered voters in the United States. That is why pollsters select a sample of individuals that represents the whole population. Understanding how respondents come to be selected to be in a poll is a big step toward determining how well their views and opinions mirror those of the voting population. To sample individuals, polling organizations can choose from a wide variety of options. Pollsters generally divide them into two types: those that are based on probability sampling methods and those based on non-probability sampling techniques. For more than five decades probability sampling was the standard method for polls. But in recent years, as fewer people respond to polls and the costs of polls have gone up, researchers have turned to non-probability based sampling methods. For example, they may collect data on-line from volunteers who have joined an Internet panel. In a number of instances, these non-probability samples have produced results that were comparable or, in some cases, more accurate in predicting election outcomes than probability-based surveys. Now, more than ever, journalists and the public need to understand the strengths and weaknesses of both sampling techniques to effectively evaluate the quality of a survey, particularly election polls. Probability and Non-probability Samples In a probability sample, all persons in the target population have a change of being selected for the survey sample and we know what that chance is. For example, in a telephone survey based on random digit dialing (RDD) sampling, researchers know the chance or probability that a particular telephone number will be selected.
    [Show full text]
  • MRS Guidance on How to Read Opinion Polls
    What are opinion polls? MRS guidance on how to read opinion polls June 2016 1 June 2016 www.mrs.org.uk MRS Guidance Note: How to read opinion polls MRS has produced this Guidance Note to help individuals evaluate, understand and interpret Opinion Polls. This guidance is primarily for non-researchers who commission and/or use opinion polls. Researchers can use this guidance to support their understanding of the reporting rules contained within the MRS Code of Conduct. Opinion Polls – The Essential Points What is an Opinion Poll? An opinion poll is a survey of public opinion obtained by questioning a representative sample of individuals selected from a clearly defined target audience or population. For example, it may be a survey of c. 1,000 UK adults aged 16 years and over. When conducted appropriately, opinion polls can add value to the national debate on topics of interest, including voting intentions. Typically, individuals or organisations commission a research organisation to undertake an opinion poll. The results to an opinion poll are either carried out for private use or for publication. What is sampling? Opinion polls are carried out among a sub-set of a given target audience or population and this sub-set is called a sample. Whilst the number included in a sample may differ, opinion poll samples are typically between c. 1,000 and 2,000 participants. When a sample is selected from a given target audience or population, the possibility of a sampling error is introduced. This is because the demographic profile of the sub-sample selected may not be identical to the profile of the target audience / population.
    [Show full text]
  • Categorical Data Analysis
    Categorical Data Analysis Related topics/headings: Categorical data analysis; or, Nonparametric statistics; or, chi-square tests for the analysis of categorical data. OVERVIEW For our hypothesis testing so far, we have been using parametric statistical methods. Parametric methods (1) assume some knowledge about the characteristics of the parent population (e.g. normality) (2) require measurement equivalent to at least an interval scale (calculating a mean or a variance makes no sense otherwise). Frequently, however, there are research problems in which one wants to make direct inferences about two or more distributions, either by asking if a population distribution has some particular specifiable form, or by asking if two or more population distributions are identical. These questions occur most often when variables are qualitative in nature, making it impossible to carry out the usual inferences in terms of means or variances. For such problems, we use nonparametric methods. Nonparametric methods (1) do not depend on any assumptions about the parameters of the parent population (2) generally assume data are only measured at the nominal or ordinal level. There are two common types of hypothesis-testing problems that are addressed with nonparametric methods: (1) How well does a sample distribution correspond with a hypothetical population distribution? As you might guess, the best evidence one has about a population distribution is the sample distribution. The greater the discrepancy between the sample and theoretical distributions, the more we question the “goodness” of the theory. EX: Suppose we wanted to see whether the distribution of educational achievement had changed over the last 25 years. We might take as our null hypothesis that the distribution of educational achievement had not changed, and see how well our modern-day sample supported that theory.
    [Show full text]
  • Lecture 8: Sampling Methods
    Lecture 8: Sampling Methods Donglei Du ([email protected]) Faculty of Business Administration, University of New Brunswick, NB Canada Fredericton E3B 9Y2 Donglei Du (UNB) ADM 2623: Business Statistics 1 / 30 Table of contents 1 Sampling Methods Why Sampling Probability vs non-probability sampling methods Sampling with replacement vs without replacement Random Sampling Methods 2 Simple random sampling with and without replacement Simple random sampling without replacement Simple random sampling with replacement 3 Sampling error vs non-sampling error 4 Sampling distribution of sample statistic Histogram of the sample mean under SRR 5 Distribution of the sample mean under SRR: The central limit theorem Donglei Du (UNB) ADM 2623: Business Statistics 2 / 30 Layout 1 Sampling Methods Why Sampling Probability vs non-probability sampling methods Sampling with replacement vs without replacement Random Sampling Methods 2 Simple random sampling with and without replacement Simple random sampling without replacement Simple random sampling with replacement 3 Sampling error vs non-sampling error 4 Sampling distribution of sample statistic Histogram of the sample mean under SRR 5 Distribution of the sample mean under SRR: The central limit theorem Donglei Du (UNB) ADM 2623: Business Statistics 3 / 30 Why sampling? The physical impossibility of checking all items in the population, and, also, it would be too time-consuming The studying of all the items in a population would not be cost effective The sample results are usually adequate The destructive nature of certain tests Donglei Du (UNB) ADM 2623: Business Statistics 4 / 30 Sampling Methods Probability Sampling: Each data unit in the population has a known likelihood of being included in the sample.
    [Show full text]
  • 7.2 Sampling Plans and Experimental Design
    7.2 Sampling Plans and Experimental Design Statistics 1, Fall 2008 Inferential Statistics • Goal: Make inferences about the population based on data from a sample • Examples – Estimate population mean, μ or σ, based on data from the sample Methods of Sampling • I’ll cover five common sampling methods, more exist – Simple random sampling – Stratified random sampling – Cluster sampling – Systematic 1‐in‐k sampling – Convenience sampling Simple Random Sampling • Each sample of size n from a population of size N has an equal chance of being selected • Implies that each subject has an equal chance of being included in the sample • Example: Select a random sample of size 5 from this class. Computer Generated SRS • Obtain a list of the population (sampling frame) • Use Excel – Generate random number for each subject using rand() function – copy|paste special|values to fix random numbers – sort on random number, sample is the first n in this list • Use R (R is a free statistical software package) – sample(1:N,n), N=population size, n=sample size – Returns n randomly selected digits between 1 and N – default is sampling WOR Stratified Random Sampling • The population is divided into subgroups, or strata • A SRS is selected from each strata • Ensures representation from each strata, no guarantee with SRS Stratified Random Sampling • Example: Stratify the class by gender, randomly selected 3 female and 3 male students • Example: Voter poll –stratify the nation by state, randomly selected 100 voters in each state Cluster Sampling • The subjects in
    [Show full text]
  • STANDARDS and GUIDELINES for STATISTICAL SURVEYS September 2006
    OFFICE OF MANAGEMENT AND BUDGET STANDARDS AND GUIDELINES FOR STATISTICAL SURVEYS September 2006 Table of Contents LIST OF STANDARDS FOR STATISTICAL SURVEYS ....................................................... i INTRODUCTION......................................................................................................................... 1 SECTION 1 DEVELOPMENT OF CONCEPTS, METHODS, AND DESIGN .................. 5 Section 1.1 Survey Planning..................................................................................................... 5 Section 1.2 Survey Design........................................................................................................ 7 Section 1.3 Survey Response Rates.......................................................................................... 8 Section 1.4 Pretesting Survey Systems..................................................................................... 9 SECTION 2 COLLECTION OF DATA................................................................................... 9 Section 2.1 Developing Sampling Frames................................................................................ 9 Section 2.2 Required Notifications to Potential Survey Respondents.................................... 10 Section 2.3 Data Collection Methodology.............................................................................. 11 SECTION 3 PROCESSING AND EDITING OF DATA...................................................... 13 Section 3.1 Data Editing ........................................................................................................
    [Show full text]
  • Ch7 Sampling Techniques
    7 - 1 Chapter 7. Sampling Techniques Introduction to Sampling Distinguishing Between a Sample and a Population Simple Random Sampling Step 1. Defining the Population Step 2. Constructing a List Step 3. Drawing the Sample Step 4. Contacting Members of the Sample Stratified Random Sampling Convenience Sampling Quota Sampling Thinking Critically About Everyday Information Sample Size Sampling Error Evaluating Information From Samples Case Analysis General Summary Detailed Summary Key Terms Review Questions/Exercises 7 - 2 Introduction to Sampling The way in which we select a sample of individuals to be research participants is critical. How we select participants (random sampling) will determine the population to which we may generalize our research findings. The procedure that we use for assigning participants to different treatment conditions (random assignment) will determine whether bias exists in our treatment groups (Are the groups equal on all known and unknown factors?). We address random sampling in this chapter; we will address random assignment later in the book. If we do a poor job at the sampling stage of the research process, the integrity of the entire project is at risk. If we are interested in the effect of TV violence on children, which children are we going to observe? Where do they come from? How many? How will they be selected? These are important questions. Each of the sampling techniques described in this chapter has advantages and disadvantages. Distinguishing Between a Sample and a Population Before describing sampling procedures, we need to define a few key terms. The term population means all members that meet a set of specifications or a specified criterion.
    [Show full text]
  • Statistical Sampling: an Overview for Criminal Justice Researchers April 28, 2016 Stan Orchowsky
    Statistical Sampling: An Overview for Criminal Justice Researchers April 28, 2016 Stan Orchowsky: Good afternoon everyone. My name is Stan Orchowsky, I'm the research director for the Justice Research and Statistics Association. It's my pleasure to welcome you this afternoon to the next in our Training and Technical Assistance Webinar Series on Statistical Analysis for Criminal Justice Research. We want to thank the Bureau of Justice Statistics, which is helping to support this webinar series. Stan Orchowsky: Today's topic will be on statistical sampling, we will be providing an overview for criminal justice research. And I'm going to turn this over now to Dr. Farley, who's going to give you the objectives for the webinar. Erin Farley: Greetings everyone. Erin Farley: Okay, so the webinar objectives. Sampling is a very important and useful tool in social science criminal justice research, and that is because in many, if not most, situations accessing an entire population to conduct research is not feasible. Oftentimes it would be too time-consuming and expensive. As a result, the use of sampling methodology, if utilized properly, can save time, money, and produce sample estimates that can be utilized to make inferences about the larger population. Erin Farley: The objectives of this webinar include: describing the different types of probability and non-probability sampling designs; discussing the strengths and weaknesses of these techniques, as well as discussing the importance of sampling to the external validity of experimental designs and statistical analysis. Erin Farley: This slide presents a simple diagram illustrating the process and goal of sampling.
    [Show full text]
  • SAMPLING When Collecting Data from an Experiment Or a General
    SAMPLING When collecting data from an experiment or a general population, it is usually impossible to measure every individual in the population so a subsample must be taken of a portion of the individuals in the population. The manner in which the sample is taken and the number of individuals included in the sample affect the adequacy of the sample. An appropriate sample is one that measures as closely as possible the value that would be obtained if all individuals in the population were measured. The difference between the sample value and the population value constitutes the sampling error. Thus, a good sampling procedure must give a small sampling error. The sampling unit is the unit on which the actual measurement of a character is made. In replicated experiments, where the plot is the population, the sampling unit is smaller than the plot. Examples of sampling units are a plant, a leaf, a group of plants (hill, row), a specified area, a farm, a village, a farmer, etc. The characteristics of a good sampling unit are: (1) it is easy to identify (2) it is easy to measure (3) it has high precision and low cost The sampling unit that has these characteristics may vary with the crop, the character to be measured, and the cultural practice used. For example, in counting tiller number in transplanted rice, a plant or hill may be appropriate, but in broadcast rice, a hill is not easily identifiable as a unit and there is large variation from plant to plant due to nonuniform spacing.
    [Show full text]
  • Use of Sampling in the Census Technical Session 5.2
    Regional Workshop on the Operational Guidelines of the WCA 2020 Dar es Salaam, Tanzania 23-27 March 2020 Use of sampling in the census Technical Session 5.2 Eloi OUEDRAOGO Statistician, Agricultural Census Team FAO Statistics Division (ESS) 1 CONTENTS Complete enumeration censuses versus sample-based censuses Uses of sampling at other census stages Sample designs based on list frames Sample designs based on area frames Sample designs based on multiple frames Choice of sample design Country examples 2 Background As mentioned in the WCA 2020, Vol. 1 (Chapter 4, paragraph 4.34), when deciding whether to conduct a census by complete or sample enumeration, in addition to efficiency considerations (precision versus costs), one should take into account: desired level of aggregation for census data; use of the census as a frame for ongoing sampling surveys; data content of the census; and capacity to deal with sampling methods and subsequent statistical analysis based on samples. 3 Complete enumeration versus sample enumeration census Complete enumeration Sample enumeration Advantages 1. Reliable census results for the 1. Is generally less costly that a smallest administrative and complete enumeration geographic units and on rare events 2. Contributes to decrease the overall (such as crops/livestock types) response burden 2. Provides a reliable frame for the organization of subsequent regular 3. Requires a smaller number of infra-annual and annual sample enumerators and supervisors than a surveys. In terms of frames, it is census conducted by complete much less demanding in respect of enumeration. Consequently, the the holdings’ characteristics non-sampling errors can be 3.
    [Show full text]
  • “Methods of Measuring Public Opinion”
    PUBLIC OPINION IN THE 21ST CENTURY DRAFT: NOT FOR QUOTATION OR CITATION Russell G. Brooker, Alverno College Todd Schaefer, Central Washington University “Methods of Measuring Public Opinion” “Truth reveals itself in degrees, and we can progress from an incomplete to a more and ever more complete comprehension of truth. Truth is not a thing, not an object that we either have in entirety or have not at all.” - Goethe The formal academic study of public opinion is relatively new but the practical study of public opinion is not new at all. Governments have paid attention to public opinion as long as there have been governments. Even the most oppressive tyrants need to know what the people are thinking, even if just to oppress them more effectively. As the famous investigator of public opinion, V.O. Key, Jr. said, “Governments must concern themselves with the opinions of their citizens, if only to provide a basis for repression of disaffection.” (1961, 3) One of the earliest expressions of public opinion was rebellion. Peasant rebellions have occurred throughout history. When the king saw his subjects in open rebellion, it was a pretty clear sign that the public’s support for his government was eroding. Unpaid taxes was another clue; when rulers saw their tax receipts dwindle and heard reports of tax collectors being killed, they knew that public opinion was turning against them. For centuries rulers have established secret police forces to find out which people oppose the government and to eliminate them. Secret police have, among other things, acted as monitors of public opinion.
    [Show full text]
  • An Introduction to Randomization*
    An Introduction to Randomization* TAF – CEGA Impact Evaluation Workshop Day 1 * http://ocw.mit.edu (J-PAL Impact Evaluation Workshop materials) Outline Background What is randomized evaluation? Advantages and limitations of experiments Conclusions An example: The “Vote 2002” campaign Arceneaux, Gerber and Green (2006) Intervention: get-out-the-vote phone calls to increase voter turnout in Iowa and Michigan, 2002 midterm elections Treatment group = 60,000 individuals (35,000 actually reached by phone) Control group = >2,000,000 individuals Main outcome: turnout (did the individual vote?) Effect sizes using experimental v. non-experimental methods How to measure impact? What would have happened in the absence of the intervention program? { Since the counterfactual is not observable, key goal of all impact evaluation methods is to construct of “mimic” the counterfactual Constructing the counterfactual Counterfactual is often constructed by selecting a group not affected by the program Randomized: { Use random assignment of the program to create a control group which mimics the counterfactual. Non-randomized: { Argue that a certain excluded group mimics the counterfactual. Validity A tool to assess credibility of a study Internal validity { Relates to ability to draw causal inference, i.e. can we attribute our impact estimates to the program and not to something else External validity { Relates to ability to generalize to other settings of interest, i.e. can we generalize our impact estimates from this program to other populations,
    [Show full text]