Randomisation, Replication, Response Surfaces and Rosemary

Total Page:16

File Type:pdf, Size:1020Kb

Randomisation, Replication, Response Surfaces and Rosemary Randomisation, Replication, Response Surfaces and Rosemary 1 A.C. Atkinson [email protected] Department of Statistics London School of Economics London WC2A 2AE, UK RAB AND ME ² One joint publication ² \One hundred years of the design of experiments on and o® the pages of Biometrika". Bka 88, 53{97 ² Perhaps this talk should be ² \Randomization, Replication, Response Surfaces and Rose- mary" ² We found that the subject of the \Design of Experiments" pretty much fell into two disjoint halves, corresponding to our own interests ² Here are some examples: AGRICULTURAL FIELD TRIAL ² Objective Compare 113 varieties of wheat ² Experimental Variables 4 replicates of each variety divided between blocks (¯elds, farms) ² Response Variables Yield " Rust resistance " Straw ? Resistance to flattening " PHOTOGRAPHIC INDUSTRY ² Objective Optimise sensitisation conditions for an emulsion (i.e. speed) ² Experimental Variables Sensitisor I, Sensitisor II Dye Reaction Time ² Response Variables Sensitivity " Contrast " Fog # Graininess # CLINICAL TRIALS ² Objective Compare several treatments (drugs, therapies) ² Experimental Variables Treatment allocation - A, B, C, ... Concomitant variables/ prognostic factors 9 History of => patient Measurements on ;> ² Response Variables Physical or chemical measurements \Quality of life" CONSUMER DURABLE: FRIDGE, CAR, CAMERA,.. ² Objective Make a \better" product ² Experimental Variables Conditions of use have also to be included Need \robustness" to abuse TWO KINDS OF EXPERIMENT `Agricultural' experiment: ² Physically distinct units; ² Complicated error structure associated with units, plots and blocks; ² Discrete treatments with no or little structure. Industrial experiments are often also of this type. `Industrial' experiment: ² Unit, for example, just another portion of chemical reagents, with no speci¯c properties; ² Simple error structure; ² Complicated treatment structure - maybe the setting of several con- tinuous variables. FISHERIAN PRINCIPLES `Agricultural' experiment (Fisher was at Rothamsted) ² Replication ² Randomization ² Local control ² Ideas of { Treatment Structure { Block Structure `Industrial' experiment ² Gossett (1917) and later. Days as blocks in laboratory testing ² Box (1974 in BHH and earlier). \Block what you can and randomize what you cannot" DESIGNING AN EXPERIMENT At least three phases: ² (a) choice of treatments; ² (b) choice of experimental units; and ² (c) deciding which treatment to apply to which experimental unit. The relative importance of the three phases depends on the application. Agriculture and medicine: insu±cient experimental units which are all alike, so some sort of blocking must be considered: divide experimental units into groups likely to be homogeneous. In an industrial experiment a block might be a batch of raw material. As soon as there is structure, phase (c) becomes important. In this context the `design' is the function allocating treatments to units. DESIGNING AN EXPERIMENT 2 (c). Allocation of treatments to units. Agricultural Experiment. 113 treatments £ 4 replicates. ² With large blocks replicate 113 treatments at each of four sites. These may be very di®erent - clay, sand, wet, marshy,.... ² Analyse with additive block e®ects ² Blocks are usually too small for complete replication ² Design so that, if possible, each treatment occurs in the same number of blocks - balance ² Allocate treatments at random, subject to design. If design has treat- ments A, B, C, ... and you have treatments 1, 2, 3, ... Randomize mapping from numbers to alphabet. ² Again analyse with additive block e®ects. DESIGNING AN EXPERIMENT 3 (c). Allocation of treatments to units. Industrial Experiment. Treatments are sensitisor, dye, reaction time ² Units are just another batch of chemical ² But there may be technical analytical variables: development condi- tions, time between manufacture and photographic testing, batches of material, shift of operators, ... ² Brien and Bailey (2006) \Multiple Randomizations" ² Should be allowed for in design as extra factors ² Randomize order of experiments if possible to avoid confounding with `lurking' variables COMMON STRUCTURE Concomitant Random variables error ² # z & Model Response y = f(x; z; ¯) + ² y x % 9 > Explanatory/ => Control/ variables > Design ;> ² Experimental Design Choose x ANALYSIS ² Second-order assumptions: yi = f(xi; ¯) + ²i with 2 E(²i) = 0 var(²i) = σ lead to least squares ² Additional normality of ²i gives distribution of estimates and test sta- tistics ² Randomization Analysis { BHH have yields of 11 tomato plants (?) six on treatment A, ¯ve on B. Calculatey ¹A ¡ y¹B for the data. Also for all 11!/5!6! permutations and compare observed value with this reference dis- tribution { Not much used for linear models, but may be for clinical trials. SMITH 1918 A remarkable paper, 35 years ahead of its time. K. Smith. \On the standard deviations of adjusted and interpolated values of an observed polynomial function and its constants and the guid- ance they give towards a proper choice of the distribution of observations". Bka 12, 1{85. ² Polynomials up to order six in one variable ² Design region X : ¡1 · x · 1 ² Additive errors of constant variance, so least squares giving predictions y^(x) ² \G-optimum" designs minimizing maximum, for x 2 X , of varfy^(x)g ² Also considered non-constant variance ² Showed the designs were optimum SMITH, PEARSON AND FISHER ² Kirstine Smith (b. 1878) was a Danish pupil of Hald who worked with K. Pearson ² Her 1916 paper \On the \best" values of the constants in frequency distributions" upset Fisher - he thought parameter estimates shouldn't depend on grouping: ML rather than minimum chi-squared ² Pearson twice refused Fisher's paper \I must keep the little space I have in Biometrika free from controversy" ² Fisher never again (?) published in Biometrika ² In 1924 Smith became a school teacher in Denmark - \because she felt a need to work more closely with people" OPTIMUM EXPERIMENTAL DESIGN 1 ² Uni¯ed Theory providing ² Numerical Methods for design construction and comparison ² Modern theory developed by Kiefer JACK KIEFER ² Kiefer (1959). \Optimum experimental designs (with discussion)" JRSSB ² Proposer of the vote of thanks (Tocher) \I think Dr Kiefer has shown conclusively just how useful mathematics is in the theory of design..." ² No comment from Box! ² Kiefer in reply \I must admit being somewhat disappointed to see that such a large proportion of the comments have been engendered by a careless reading of my paper..." ² David Cox arranged a conference in 1974 at Imperial College at which Kiefer was the main speaker ² Papers published in Biometrika 1975 OPTIMUM EXPERIMENTAL DESIGN 2 Requires: ² A model Many models are linear y = ¯T f(x) + ² f(x) is a p £ 1 vector of powers and products of x The errors ² are independent, with variance σ2 In matrix form E(y) = F ¯ Some models are nonlinear in the parameters y = 1 ¡ e¡θx + ² OPTIMUM EXPERIMENTAL DESIGN 3 Also requires: ² A design region X With two factors x1 and x2 the square ¡1 · x1 · 1; ¡1 · x2 · 1 ² The scaling is for convenience: for temperature could be: ¡1; 20oC ; 1; 40oC Figure 1: A square design region x_2 -1.0 -0.5 0.0 0.5 1.0 -1.0 -0.5 0.0 0.5 1.0 x_1 OPTIMUM EXPERIMENTAL DESIGN 4 An experimental design is a set of n points in X ² Design Measure 8 9 < = x1 x2 : : : xm » = : ; w1 w2 : : : wm ² The design has { Support points x1 : : : xm { design weights w1 : : : wm ² Exact design wi = ri=n; ri # replicates at xi ² Continuous/ Approximate Design » is a measure over the design points. Removes dependence of design on n. Figure 2: A random design. Any good? x_2 -1.0 -0.5 0.0 0.5 1.0 -1.0 -0.5 0.0 0.5 1.0 x_1 INFERENCE ABOUT BETA ² The Least Squares Estimator of ¯ from the design » is ¯^ = (F T WF )¡1F T W Y; where W = diag wi ² The 100(1 ¡ ®)% Con¯dence Region for ¯ is the set of values for which ^ T T ^ 2 (¯ ¡ ¯) F WF (¯ ¡ ¯) · ps Fp;º;1¡® ² Con¯dence regions are ellipsoids ² Volume of con¯dence ellipsoids / jF T WF j¡1=2 THE INFORMATION MATRIX ² The Information Matrix F T WF is often written F T WF = M(») The information matrix depends on the design ² D-optimum designs max jF T WF j = det M(»¤) They minimise the volumes of the con¯dence region ² Other alphabetical criteria maximize other functions of M(») ² Often maximise » numerically to ¯nd »¤ THE PREDICTED RESPONSE ² Prediction at x is y^(x) = ¯^T f(x) ² The variance of prediction is varfy^(x)g = nσ2f T (x)M ¡1(»)f(x) ² The standardised variance is d(x; ») = f T (x)M ¡1(»)f(x) GENERAL EQUIVALENCE THEOREM 1 { Links var(¯^) varfy^(x)g parameters predictions { Provides Algorithm Kiefer and Wolfowitz (1959) (with a check on convergence) GENERAL EQUIVALENCE THEOREM 2 ² De¯nition »¤ maximises det M(») ² Minimax Equivalence »¤ minimises max x 2 X d(x; ») ² max x 2 X d(x; »¤) = p So we can check any purported optimum design ² Maximum is at points of support of »¤ SECOND-ORDER POLYNOMIAL ² Model 2 E(y) = ¯0 + ¯1x + ¯2x ² Design Region X x 2 [¡1; 1] ² Compare Two Designs Remember - designs are standardized for n Figure 3: Two designs. Which is better? Two designs -1.0 -0.5 0.0 0.5 1.0 -1.0 -0.5 0.0 0.5 1.0 x Figure 4: Variances for the two designs Variance for two designs 0 1 2 3 4 5 6 -1.0 -0.5 0.0 0.5 1.0 x COMPARISON OF DESIGNS ² Three-Point Design Clearly D-optimum max x 2 X d(x; ») = 3 and occurs at design points ² As found by Smith ² Six-Point Design Not D-optimum { d(x; ») > 3 at x = ¡1; 1 { < 3 at other 4 points OPTIMUM EXPERIMENTAL DESIGN 5 Dependence on n ² Can always (?)
Recommended publications
  • Understanding Replication of Experiments in Software Engineering: a Classification Omar S
    Understanding replication of experiments in software engineering: A classification Omar S. Gómez a, , Natalia Juristo b,c, Sira Vegas b a Facultad de Matemáticas, Universidad Autónoma de Yucatán, 97119 Mérida, Yucatán, Mexico Facultad de Informática, Universidad Politécnica de Madrid, 28660 Boadilla del Monte, Madrid, Spain c Department of Information Processing Science, University of Oulu, Oulu, Finland abstract Context: Replication plays an important role in experimental disciplines. There are still many uncertain-ties about how to proceed with replications of SE experiments. Should replicators reuse the baseline experiment materials? How much liaison should there be among the original and replicating experiment-ers, if any? What elements of the experimental configuration can be changed for the experiment to be considered a replication rather than a new experiment? Objective: To improve our understanding of SE experiment replication, in this work we propose a classi-fication which is intend to provide experimenters with guidance about what types of replication they can perform. Method: The research approach followed is structured according to the following activities: (1) a litera-ture review of experiment replication in SE and in other disciplines, (2) identification of typical elements that compose an experimental configuration, (3) identification of different replications purposes and (4) development of a classification of experiment replications for SE. Results: We propose a classification of replications which provides experimenters in SE with guidance about what changes can they make in a replication and, based on these, what verification purposes such a replication can serve. The proposed classification helped to accommodate opposing views within a broader framework, it is capable of accounting for less similar replications to more similar ones regarding the baseline experiment.
    [Show full text]
  • Comparison of Response Surface Designs in a Spherical Region
    World Academy of Science, Engineering and Technology International Journal of Mathematical and Computational Sciences Vol:6, No:5, 2012 Comparison of Response Surface Designs in a Spherical Region Boonorm Chomtee, John J. Borkowski Abstract—The objective of the research is to study and compare Where X is the design matrix, p is the number of model response surface designs: Central composite designs (CCD), Box- parameters, N is the design size, σ 2 is the maximum of Behnken designs (BBD), Small composite designs (SCD), Hybrid max designs, and Uniform shell designs (USD) over sets of reduced models f′′(x)( X X )−1 f (x) approximated over the set of candidate when the design is in a spherical region for 3 and 4 design variables. points. The values of the two criteria were calculated using The two optimality criteria ( D and G ) are considered which larger Matlab software [1]. values imply a better design. The comparison of design optimality criteria of the response surface designs across the full second order B. Reduced Models model and sets of reduced models for 3 and 4 factors based on the The set of reduced models is consistent with the definition two criteria are presented. of weak heredity given in Chipman [2]. That is, (i) a quadratic 2 Keywords—design optimality criteria, reduced models, response xi term is in the model only if the xi term is also in the model surface design, spherical design region and (ii) an interaction xi x j term is in the model only if the xi or x or both terms are also in the model.
    [Show full text]
  • Relationships Between Sleep Duration and Adolescent Depression: a Conceptual Replication
    HHS Public Access Author manuscript Author ManuscriptAuthor Manuscript Author Sleep Health Manuscript Author . Author manuscript; Manuscript Author available in PMC 2020 February 25. Published in final edited form as: Sleep Health. 2019 April ; 5(2): 175–179. doi:10.1016/j.sleh.2018.12.003. Relationships between sleep duration and adolescent depression: a conceptual replication AT Berger1, KL Wahlstrom2, R Widome1 1Division of Epidemiology and Community Health, University of Minnesota School of Public Health, 1300 South 2nd St, Suite #300, Minneapolis, MN 55454 2Department of Organizational Leadership, Policy and Development, 210D Burton Hall, College of Education and Human Development, University of Minnesota, Minneapolis, MN 55455 Abstract Objective—Given the growing concern about research reproducibility, we conceptually replicated a previous analysis of the relationships between adolescent sleep and mental well-being, using a new dataset. Methods—We conceptually reproduced an earlier analysis (Sleep Health, June 2017) using baseline data from the START Study. START is a longitudinal research study designed to evaluate a natural experiment in delaying high school start times, examining the impact of sleep duration on weight change in adolescents. In both START and the previous study, school day bedtime, wake up time, and answers to a six-item depression subscale were self-reported using a survey administered during the school day. Logistic regression models were used to compute the association and 95% confidence intervals (CI) between the sleep variables (sleep duration, wake- up time and bed time) and a range of outcomes. Results—In both analyses, greater sleep duration was associated with lower odds (P < 0.0001) of all six indicators of depressive mood.
    [Show full text]
  • Topics in Experimental Design
    Ronald Christensen Professor of Statistics Department of Mathematics and Statistics University of New Mexico Copyright c 2016 Topics in Experimental Design Springer Preface This (non)book assumes that the reader is familiar with the basic ideas of experi- mental design and with linear models. I think most of the applied material should be accessible to people with MS courses in regression and ANOVA but I had no hesi- tation in using linear model theory, if I needed it, to discuss more technical aspects. Over the last several years I’ve been working on revisions to my books Analysis of Variance, Design, and Regression (ANREG); Plane Answers to Complex Ques- tions: The Theory of Linear Models (PA); and Advanced Linear Modeling (ALM). In each of these books there was material that no longer seemed sufficiently relevant to the themes of the book for me to retain. Most of that material related to Exper- imental Design. Additionally, due to Kempthorne’s (1952) book, many years ago I figured out p f designs and wrote a chapter explaining them for the very first edition of ALM, but it was never included in any of my books. I like all of this material and think it is worthwhile, so I have accumulated it here. (I’m not actually all that wild about the recovery of interblock information in BIBs.) A few years ago my friend Chris Nachtsheim came to Albuquerque and gave a wonderful talk on Definitive Screening Designs. Chapter 5 was inspired by that talk along with my longstanding desire to figure out what was going on with Placett- Burman designs.
    [Show full text]
  • Lecture 19: Causal Association (Kanchanaraksa)
    This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this site. Copyright 2008, The Johns Hopkins University and Sukon Kanchanaraksa. All rights reserved. Use of these materials permitted only in accordance with license rights granted. Materials provided “AS IS”; no representations or warranties provided. User assumes all responsibility for use, and all liability related thereto, and must independently review all materials for accuracy and efficacy. May contain materials owned by others. User is responsible for obtaining permissions for use from third parties as needed. Causal Association Sukon Kanchanaraksa, PhD Johns Hopkins University From Association to Causation The following conditions have been met: − The study has an adequate sample size − The study is free of bias Adjustment for possible confounders has been done − There is an association between exposure of interest and the disease outcome Is the association causal? 3 Henle-Koch's Postulates (1884 and 1890) To establish a causal relationship between a parasite and a disease, all four must be fulfilled: 1. The organism must be found in all animals suffering from the disease—but not in healthy animals 2. The organism must be isolated from a diseased animal and grown in pure culture 3. The cultured organism should cause disease when introduced into a healthy animals 4. The organism must be re-isolated from the experimentally infected animal — Wikipedia Picture is from http://en.wikipedia.org/wiki/Robert_Koch 4 1964 Surgeon General’s Report 5 What Is a Cause? “The characterization of the assessment called for a specific term.
    [Show full text]
  • Blinding Analyses in Experimental Psychology
    Flexible Yet Fair: Blinding Analyses in Experimental Psychology Gilles Dutilh1, Alexandra Sarafoglou2, and Eric–Jan Wagenmakers2 1 University of Basel Hospital 2 University of Amsterdam Correspondence concerning this article should be addressed to: Eric-Jan Wagenmakers University of Amsterdam Department of Psychology, Psychological Methods Group Nieuwe Achtergracht 129B 1018XE Amsterdam, The Netherlands E–mail may be sent to [email protected]. Abstract The replicability of findings in experimental psychology can be im- proved by distinguishing sharply between hypothesis-generating research and hypothesis-testing research. This distinction can be achieved by preregistra- tion, a method that has recently attracted widespread attention. Although preregistration is fair in the sense that it inoculates researchers against hind- sight bias and confirmation bias, preregistration does not allow researchers to analyze the data flexibly without the analysis being demoted to exploratory. To alleviate this concern we discuss how researchers may conduct blinded analyses (MacCoun & Perlmutter, 2015). As with preregistration, blinded analyses break the feedback loop between the analysis plan and analysis out- come, thereby preventing cherry-picking and significance seeking. However, blinded analyses retain the flexibility to account for unexpected peculiarities in the data. We discuss different methods of blinding, offer recommenda- tions for blinding of popular experimental designs, and introduce the design for an online blinding protocol. This research was supported by a talent grant from the Netherlands Organisation for Scientific Research (NWO) to AS (406-17-568). BLINDING IN EXPERIMENTAL PSYCHOLOGY 2 When extensive series of observations have to be made, as in astronomical, meteorological, or magnetical observatories, trigonometrical surveys, and extensive chemical or physical researches, it is an advantage that the numerical work should be executed by assistants who are not interested in, and are perhaps unaware of, the expected results.
    [Show full text]
  • Section 4.3 Good and Poor Ways to Experiment
    Chapter 4 Gathering Data Section 4.3 Good and Poor Ways to Experiment Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Elements of an Experiment Experimental units: The subjects of an experiment; the entities that we measure in an experiment. Treatment: A specific experimental condition imposed on the subjects of the study; the treatments correspond to assigned values of the explanatory variable. Explanatory variable: Defines the groups to be compared with respect to values on the response variable. Response variable: The outcome measured on the subjects to reveal the effect of the treatment(s). 3 Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Experiments ▪ An experiment deliberately imposes treatments on the experimental units in order to observe their responses. ▪ The goal of an experiment is to compare the effect of the treatment on the response. ▪ Experiments that are randomized occur when the subjects are randomly assigned to the treatments; randomization helps to eliminate the effects of lurking variables. 4 Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 3 Components of a Good Experiment 1. Control/Comparison Group: allows the researcher to analyze the effectiveness of the primary treatment. 2. Randomization: eliminates possible researcher bias, balances the comparison groups on known as well as on lurking variables. 3. Replication: allows us to attribute observed effects to the treatments rather than ordinary variability. 5 Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Component 1: Control or Comparison Group ▪ A placebo is a dummy treatment, i.e. sugar pill. Many subjects respond favorably to any treatment, even a placebo. ▪ A control group typically receives a placebo.
    [Show full text]
  • Partial Confounding Balanced and Unbalanced Partially Confounded Design Example 1
    Chapter 10 Partial Confounding The objective of confounding is to mix the less important treatment combinations with the block effect differences so that higher accuracy can be provided to the other important treatment comparisons. When such mixing of treatment contrasts and block differences is done in all the replicates, then it is termed as total confounding. On the other hand, when the treatment contrast is not confounded in all the replicates but only in some of the replicates, then it is said to be partially confounded with the blocks. It is also possible that one treatment combination is confounded in some of the replicates and another treatment combination is confounded in other replicates which are different from the earlier replicates. So the treatment combinations confounded in some of the replicates and unconfounded in other replicates. So the treatment combinations are said to be partially confounded. The partially confounded contrasts are estimated only from those replicates where it is not confounded. Since the variance of the contrast estimator is inversely proportional to the number of replicates in which they are estimable, so some factors on which information is available from all the replicates are more accurately determined. Balanced and unbalanced partially confounded design If all the effects of a certain order are confounded with incomplete block differences in equal number of replicates in a design, then the design is said to be balanced partially confounded design. If all the effects of a certain order are confounded an unequal number of times in a design, then the design is said to be unbalanced partially confounded design.
    [Show full text]
  • Types of Confounding
    Confounding What is confounding? Why we need it? Advantage and Disadvantage. How to reduce block size? Types of Confounding. Examples. ANOVA TABLE Concept of Confounding. In a factorial experiment we have n factors each are at s levels. If the factors and level of the factors are increased, then total no. of treatment combinations will also increase which we call the effect of heterogeneity that is the variability among the treatment combination will increase and consequently the errors variance will increase. Because of this result the estimate will be least precise estimate. So one is interested to remove the variability among the treatment combination. which is only possible if the size of the block will reduce to make homogeneity in the block. This concept of reducing the block size is call confounding. Who has developed confounding theory? • R.A. Fisher • Yates Definition Confounding of factorial experiment is defined as reduction of block size in such a way that one block is divided into two or more blocks such that treatment comparison of that main or interaction effect is mixed up with block effect. Main or Interaction effect which is mixed up with block effect are called Confounded effect and this experiment is called Confounding experiment. Block Block-1 Block-2 n0p0 n1p0 n0p0 n1p0 n1p1 n0p1 n0p1 n1p1+n1p0 n0p1+n0p0 n1p1 • Block Effect :- (n1p1+n1p0 ) – (n0p1+n0p0 ) • Treatment Effect.:- =Main effect N = (n1-n0) (p1+p0) =n1p1 +n1p0-n0p1-n0p0 =(n1p1+n1p0 ) – (n0p1+n0p0 ) Thus, Treatment comparison – Block effect =0 Here, We can say that main effect N is mixed up with block effect .
    [Show full text]
  • Experimental Design
    A few remarks on Experimental Design Simon Anders • How many replicates do I need? • What is a suitable replicate? • What effects can I hope to see? A preliminary: the word “sample” Statistician: “Our sample comprises 20 male adult subjects chosen at random from the patient population.” Biologist: “For our experiment, we used blood samples from 20 patients chosen at random.” An observed effect is called statistically significant. What does this mean? p < 0.05 ?? If the experiment were repeated with new, independently obtained samples, the effect would likely be observed again. If the experiment were repeated with new, independently obtained samples, the effect would likely be observed again. When can we claim such a thing? Only if we tried more than once! Replication How many replicates? What is the average height of a man in Germany? each dot: height of one subject each dot: height of one subject What is the average height of a man in Germany? each dot: average of n subjects standard deviation of observations standard error of mean = sample size Purposes of replication: 1. More replicates allow for more precise estimates of effect sizes. each dot: height of one subject What is the average height of a man in Germany? each dot: average of n subjects each dot: standard deviation, estimated from n subjects standard deviation of observations standard error of mean = sample size Purposes of replication: 1. More replicates allow for more precise estimates of effect sizes. 2. Replicates allow to estimate how precise our effect-size estimates are. Two Martian scientists, Dr Nullor and Dr Alton, ask: Is body height correlated with hair colour in human males within a city’s population? Dr Nullor’s UFO observes Helsinki: µ(dark) = µ(blond) = 180 cm.
    [Show full text]
  • Agricultural Statistics, Why Replication and Randomization Are Important Authors: Robert Mullen
    4/26/2016 C.O.R.N. Newsletter 2005­40 | Agronomic Crops Network Agricultural Statistics, Why Replication and Randomization are Important Authors: Robert Mullen In CORN Newsletter 2005­39 (http://corn.osu.edu/story.php?issueID=117&layout=1&storyID=657), we had a lengthy discussion as to why replication and randomization are necessary for statistical analysis. From a theoretical perspective the article was important, but from an applied perspective why do we care? Today we will discuss the potential influence of factors other than those we are trying to evaluate on experimental outcomes, and how proper statistical design can ensure the conclusions reached are correct. Replication Assume you want to evaluate a fungicide treatment on your farm, so you split a field in two and apply the treatment to one half and leave the other half untreated. At the end of the year you harvest each of the two halves and observe a 3 bushel per acre increase in yield on the treated side. This 3 bushel per acre difference seems like a good deal, so you decide that next year all of your acres will be treated with this new fungicide. Are you sure that the additional 3 bushels per acre was due to the application of the fungicide? Closer inspection of the field reveals that the half of the field that showed the yield response was dominated by a lighter texture soil that drained better than the other half of the field. Due to excessive moisture (this is not necessarily this past year) the half of the field with better drainage might be expected to perform better.
    [Show full text]
  • Aspects of Design for Repeated Measurements R
    17 ASPECTS OF DESIGN FOR REPEATED MEASUREMENTS R. MEAD The design of trials and investigations is extremely important and includes the choices of experimental or observational units, the form and timing of measurements, the choice of applied or environmental treat­ ments and the interrelationship of these three components. It became obvious later during the workshop that many difficulties of analysis could be attributed to design deficiencies (in a broad sense). The general principles of design2 applicable to all investigations are: (a) efficient use of resources (b) asking many questions (c) reducing a 2 • The consequent statistical principles are: (i) Using appropriate amounts and forms of replication. (ii) Using random allocation of treatments to units within stated de­ sign restriction to provide a valid basis for inferences and for esti­ mating a 2 • (iii) Using small "blocks" to control random variation. (iv) Using factorial structure with the implied priority ordering of ef­ fe,cts. (v) Using the minimum necessary number of levels of quantitative factors. The aspects discussed in this paper are (i), (iii) and (v). 1. REPLICATION AND RESOURCES Replication for comparative mean values must provide adequate power; for random variance a 2 and a difference that should be detected, 18 d, this implies a replication, n, where Note that n may include implicit, as well as explicit, replication, derived from treatment structure or regression. For a given replication level there will be a total amount of re­ sources, represented by the total degrees of freedom (N -1) in an anal­ ysis of variance. These resources are used in three ways: (i) Asking treatment questions (ii) Estimating 0'2 (iii) Controlling variation.
    [Show full text]