Harmonizing Fully Optimal Designs with Classic Randomization in Fixed Trial Experiments Arxiv:1810.08389V1 [Stat.ME] 19 Oct 20
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
How Many Stratification Factors Is Too Many" to Use in a Randomization
How many Strati cation Factors is \Too Many" to Use in a Randomization Plan? Terry M. Therneau, PhD Abstract The issue of strati cation and its role in patient assignment has gen- erated much discussion, mostly fo cused on its imp ortance to a study [1,2] or lack thereof [3,4]. This rep ort fo cuses on a much narrower prob- lem: assuming that strati ed assignment is desired, how many factors can b e accomo dated? This is investigated for two metho ds of balanced patient assignment, the rst is based on the minimization metho d of Taves [5] and the second on the commonly used metho d of strati ed assignment [6,7]. Simulation results show that the former metho d can accomo date a large number of factors 10-20 without diculty, but that the latter b egins to fail if the total numb er of distinct combinations of factor levels is greater than approximately n=2. The two metho ds are related to a linear discriminant mo del, which helps to explain the results. 1 1 Intro duction This work arose out of my involvement with lung cancer studies within the North Central Cancer Treatment Group. These studies are small, typically 100 to 200 patients, and there are several imp ortant prognostic factors on whichwe wish to balance. Some of the factors, such as p erformance or Karnofsky score, are likely to contribute a larger survival impact than the treatment under study; hazards ratios of 3{4 are not uncommon. In this setting it b ecomes very dicult to interpret study results if any of the factors are signi cantly out of balance. -
Some Restricted Randomization Rules in Sequential Designs Jose F
This article was downloaded by: [Georgia Tech Library] On: 10 April 2013, At: 12:48 Publisher: Taylor & Francis Informa Ltd Registered in England and Wales Registered Number: 1072954 Registered office: Mortimer House, 37-41 Mortimer Street, London W1T 3JH, UK Communications in Statistics - Theory and Methods Publication details, including instructions for authors and subscription information: http://www.tandfonline.com/loi/lsta20 Some Restricted randomization rules in sequential designs Jose F. Soares a & C.F. Jeff Wu b a Federal University of Minas Gerais, Brazil b University of Wisconsin Madison and Mathematical Sciences Research Institute, Berkeley Version of record first published: 27 Jun 2007. To cite this article: Jose F. Soares & C.F. Jeff Wu (1983): Some Restricted randomization rules in sequential designs, Communications in Statistics - Theory and Methods, 12:17, 2017-2034 To link to this article: http://dx.doi.org/10.1080/03610928308828586 PLEASE SCROLL DOWN FOR ARTICLE Full terms and conditions of use: http://www.tandfonline.com/page/terms-and-conditions This article may be used for research, teaching, and private study purposes. Any substantial or systematic reproduction, redistribution, reselling, loan, sub-licensing, systematic supply, or distribution in any form to anyone is expressly forbidden. The publisher does not give any warranty express or implied or make any representation that the contents will be complete or accurate or up to date. The accuracy of any instructions, formulae, and drug doses should be independently verified with primary sources. The publisher shall not be liable for any loss, actions, claims, proceedings, demand, or costs or damages whatsoever or howsoever caused arising directly or indirectly in connection with or arising out of the use of this material. -
Optimal Design of Experiments Session 4: Some Theory
Optimal design of experiments Session 4: Some theory Peter Goos 1 / 40 Optimal design theory Ï continuous or approximate optimal designs Ï implicitly assume an infinitely large number of observations are available Ï is mathematically convenient Ï exact or discrete designs Ï finite number of observations Ï fewer theoretical results 2 / 40 Continuous versus exact designs continuous Ï ½ ¾ x1 x2 ... xh Ï » Æ w1 w2 ... wh Ï x1,x2,...,xh: design points or support points w ,w ,...,w : weights (w 0, P w 1) Ï 1 2 h i ¸ i i Æ Ï h: number of different points exact Ï ½ ¾ x1 x2 ... xh Ï » Æ n1 n2 ... nh Ï n1,n2,...,nh: (integer) numbers of observations at x1,...,xn P n n Ï i i Æ Ï h: number of different points 3 / 40 Information matrix Ï all criteria to select a design are based on information matrix Ï model matrix 2 3 2 T 3 1 1 1 1 1 1 f (x1) ¡ ¡ Å Å Å T 61 1 1 1 1 17 6f (x2)7 6 Å ¡ ¡ Å Å 7 6 T 7 X 61 1 1 1 1 17 6f (x3)7 6 ¡ Å ¡ Å Å 7 6 7 Æ 6 . 7 Æ 6 . 7 4 . 5 4 . 5 T 100000 f (xn) " " " " "2 "2 I x1 x2 x1x2 x1 x2 4 / 40 Information matrix Ï (total) information matrix 1 1 Xn M XT X f(x )fT (x ) 2 2 i i Æ σ Æ σ i 1 Æ Ï per observation information matrix 1 T f(xi)f (xi) σ2 5 / 40 Information matrix industrial example 2 3 11 0 0 0 6 6 6 0 6 0 0 0 07 6 7 1 6 0 0 6 0 0 07 ¡XT X¢ 6 7 2 6 7 σ Æ 6 0 0 0 4 0 07 6 7 4 6 0 0 0 6 45 6 0 0 0 4 6 6 / 40 Information matrix Ï exact designs h X T M ni f(xi)f (xi) Æ i 1 Æ where h = number of different points ni = number of replications of point i Ï continuous designs h X T M wi f(xi)f (xi) Æ i 1 Æ 7 / 40 D-optimality criterion Ï seeks designs that minimize variance-covariance matrix of ¯ˆ ¯ 2 T 1¯ Ï . -
Advanced Power Analysis Workshop
Advanced Power Analysis Workshop www.nicebread.de PD Dr. Felix Schönbrodt & Dr. Stella Bollmann www.researchtransparency.org Ludwig-Maximilians-Universität München Twitter: @nicebread303 •Part I: General concepts of power analysis •Part II: Hands-on: Repeated measures ANOVA and multiple regression •Part III: Power analysis in multilevel models •Part IV: Tailored design analyses by simulations in 2 Part I: General concepts of power analysis •What is “statistical power”? •Why power is important •From power analysis to design analysis: Planning for precision (and other stuff) •How to determine the expected/minimally interesting effect size 3 What is statistical power? A 2x2 classification matrix Reality: Reality: Effect present No effect present Test indicates: True Positive False Positive Effect present Test indicates: False Negative True Negative No effect present 5 https://effectsizefaq.files.wordpress.com/2010/05/type-i-and-type-ii-errors.jpg 6 https://effectsizefaq.files.wordpress.com/2010/05/type-i-and-type-ii-errors.jpg 7 A priori power analysis: We assume that the effect exists in reality Reality: Reality: Effect present No effect present Power = α=5% Test indicates: 1- β p < .05 True Positive False Positive Test indicates: β=20% p > .05 False Negative True Negative 8 Calibrate your power feeling total n Two-sample t test (between design), d = 0.5 128 (64 each group) One-sample t test (within design), d = 0.5 34 Correlation: r = .21 173 Difference between two correlations, 428 r₁ = .15, r₂ = .40 ➙ q = 0.273 ANOVA, 2x2 Design: Interaction effect, f = 0.21 180 (45 each group) All a priori power analyses with α = 5%, β = 20% and two-tailed 9 total n Two-sample t test (between design), d = 0.5 128 (64 each group) One-sample t test (within design), d = 0.5 34 Correlation: r = .21 173 Difference between two correlations, 428 r₁ = .15, r₂ = .40 ➙ q = 0.273 ANOVA, 2x2 Design: Interaction effect, f = 0.21 180 (45 each group) All a priori power analyses with α = 5%, β = 20% and two-tailed 10 The power ofwithin-SS designs Thepower 11 May, K., & Hittner, J. -
Approximation Algorithms for D-Optimal Design
Approximation Algorithms for D-optimal Design Mohit Singh School of Industrial and Systems Engineering, Georgia Institute of Technology, Atlanta, GA 30332, [email protected]. Weijun Xie Department of Industrial and Systems Engineering, Virginia Tech, Blacksburg, VA 24061, [email protected]. Experimental design is a classical statistics problem and its aim is to estimate an unknown m-dimensional vector β from linear measurements where a Gaussian noise is introduced in each measurement. For the com- binatorial experimental design problem, the goal is to pick k out of the given n experiments so as to make the most accurate estimate of the unknown parameters, denoted as βb. In this paper, we will study one of the most robust measures of error estimation - D-optimality criterion, which corresponds to minimizing the volume of the confidence ellipsoid for the estimation error β − βb. The problem gives rise to two natural vari- ants depending on whether repetitions of experiments are allowed or not. We first propose an approximation 1 algorithm with a e -approximation for the D-optimal design problem with and without repetitions, giving the first constant factor approximation for the problem. We then analyze another sampling approximation algo- 4m 12 1 rithm and prove that it is (1 − )-approximation if k ≥ + 2 log( ) for any 2 (0; 1). Finally, for D-optimal design with repetitions, we study a different algorithm proposed by literature and show that it can improve this asymptotic approximation ratio. Key words: D-optimal Design; approximation algorithm; determinant; derandomization. History: 1. Introduction Experimental design is a classical problem in statistics [5, 17, 18, 23, 31] and recently has also been applied to machine learning [2, 39]. -
A Review on Optimal Experimental Design
A Review On Optimal Experimental Design Noha A. Youssef London School Of Economics 1 Introduction Finding an optimal experimental design is considered one of the most impor- tant topics in the context of the experimental design. Optimal design is the design that achieves some targets of our interest. The Bayesian and the Non- Bayesian approaches have introduced some criteria that coincide with the target of the experiment based on some speci¯c utility or loss functions. The choice between the Bayesian and Non-Bayesian approaches depends on the availability of the prior information and also the computational di±culties that might be encountered when using any of them. This report aims mainly to provide a short summary on optimal experi- mental design focusing more on Bayesian optimal one. Therefore a background about the Bayesian analysis was given in Section 2. Optimality criteria for non-Bayesian design of experiments are reviewed in Section 3 . Section 4 il- lustrates how the Bayesian analysis is employed in the design of experiments. The remaining sections of this report give a brief view of the paper written by (Chalenor & Verdinelli 1995). An illustration for the Bayesian optimality criteria for normal linear model associated with di®erent objectives is given in Section 5. Also, some problems related to Bayesian optimal designs for nor- mal linear model are discussed in Section 5. Section 6 presents some ideas for Bayesian optimal design for one way and two way analysis of variance. Section 7 discusses the problems associated with nonlinear models and presents some ideas for solving these problems. -
Randomized Complete Block Designs
1 2 RANDOMIZED COMPLETE BLOCK DESIGNS Randomization can in principal be used to take into account factors that can be treated by blocking, but Introduction to Blocking blocking usually results in smaller error variance, hence better estimates of effect. Thus blocking is Nuisance factor: A factor that probably has an effect sometimes referred to as a method of variance on the response, but is not a factor that we are reduction design. interested in. The intuitive idea: Run in parallel a bunch of Types of nuisance factors and how to deal with them experiments on groups (called blocks) of units that in designing an experiment: are fairly similar. Characteristics Examples How to treat The simplest block design: The randomized complete Unknown, Experimenter or Randomization block design (RCBD) uncontrollable subject bias, order Blinding of treatments v treatments Known, IQ, weight, Analysis of (They could be treatment combinations.) uncontrollable, previous learning, Covariance measurable temperature b blocks, each with v units Known, Temperature, Blocking Blocks chosen so that units within a block are moderately location, time, alike (or at least similar) and units in controllable (by batch, particular different blocks are substantially different. choosing rather machine or (Thus the total number of experimental units than adjusting) operator, age, is n = bv.) gender, order, IQ, weight The v experimental units within each block are randomly assigned to the v treatments. (So each treatment is assigned one unit per block.) 3 4 Note that experimental units are assigned randomly RCBD Model: only within each block, not overall. Thus this is sometimes called a restricted randomization. -
Minimax Efficient Random Designs with Application to Model-Robust Design
Minimax efficient random designs with application to model-robust design for prediction Tim Waite [email protected] School of Mathematics University of Manchester, UK Joint work with Dave Woods S3RI, University of Southampton, UK Supported by the UK Engineering and Physical Sciences Research Council 8 Aug 2017, Banff, AB, Canada Tim Waite (U. Manchester) Minimax efficient random designs Banff - 8 Aug 2017 1 / 36 Outline Randomized decisions and experimental design Random designs for prediction - correct model Extension of G-optimality Model-robust random designs for prediction Theoretical results - tractable classes Algorithms for optimization Examples: illustration of bias-variance tradeoff Tim Waite (U. Manchester) Minimax efficient random designs Banff - 8 Aug 2017 2 / 36 Randomized decisions A well known fact in statistical decision theory and game theory: Under minimax expected loss, random decisions beat deterministic ones. Experimental design can be viewed as a game played by the Statistician against nature (Wu, 1981; Berger, 1985). Therefore a random design strategy should often be beneficial. Despite this, consideration of minimax efficient random design strategies is relatively unusual. Tim Waite (U. Manchester) Minimax efficient random designs Banff - 8 Aug 2017 3 / 36 Game theory Consider a two-person zero-sum game. Player I takes action θ 2 Θ and Player II takes action ξ 2 Ξ. Player II experiences a loss L(θ; ξ), to be minimized. A random strategy for Player II is a probability measure π on Ξ. Deterministic actions are a special case (point mass distribution). Strategy π1 is preferred to π2 (π1 π2) iff Eπ1 L(θ; ξ) < Eπ2 L(θ; ξ) : Tim Waite (U. -
Practical Aspects for Designing Statistically Optimal Experiments
Journal of Statistical Science and Application 2 (2014) 85-92 D D AV I D PUBLISHING Practical Aspects for Designing Statistically Optimal Experiments Mark J. Anderson, Patrick J. Whitcomb Stat-Ease, Inc., Minneapolis, MN USA Due to operational or physical considerations, standard factorial and response surface method (RSM) design of experiments (DOE) often prove to be unsuitable. In such cases a computer-generated statistically-optimal design fills the breech. This article explores vital mathematical properties for evaluating alternative designs with a focus on what is really important for industrial experimenters. To assess “goodness of design” such evaluations must consider the model choice, specific optimality criteria (in particular D and I), precision of estimation based on the fraction of design space (FDS), the number of runs to achieve required precision, lack-of-fit testing, and so forth. With a focus on RSM, all these issues are considered at a practical level, keeping engineers and scientists in mind. This brings to the forefront such considerations as subject-matter knowledge from first principles and experience, factor choice and the feasibility of the experiment design. Key words: design of experiments, optimal design, response surface methods, fraction of design space. Introduction Statistically optimal designs emerged over a half century ago (Kiefer, 1959) to provide these advantages over classical templates for factorials and RSM: Efficiently filling out an irregularly shaped experimental region such as that shown in Figure 1 (Anderson and Whitcomb, 2005), 1300 e Flash r u s s e r 1200 P - B 1100 Short Shot 1000 450 460 A - Temperature Figure 1. Example of irregularly-shaped experimental region (a molding process). -
A General Overview of Adaptive Randomization Design for Clinical Trials
tri ome cs & Bi B f i o o l s t a a n t r i s u t i o c J s Journal of Biometrics & Biostatistics Lin et al., J Biom Biostat 2016, 7:2 ISSN: 2155-6180 DOI: 10.4172/2155-6180.1000294 Review Article Open Access A General Overview of Adaptive Randomization Design for Clinical Trials Jianchang Lin1*, Li-An Lin2 and Serap Sankoh1 1Takeda Pharmaceutical Company Limited, Cambridge, MA, USA 2Merck Research Laboratories, Whitehouse Station, New Jersey , USA *Corresponding author: Jianchang Lin, Takeda Pharmaceutical Company Limited, Cambridge, MA, USA, Tel: +1-617-679-7000; E-mail: [email protected] Rec date: March 25, 2016; Acc date: April 14, 2016; Pub date: April 21, 2016 Copyright: © 2016 Lin J, et al.This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Abstract With the recent release of FDA draft guidance (2010), adaptive designs, including adaptive randomization (e.g. response-adaptive (RA) randomization) has become popular in clinical trials because of its advantages of flexibility and efficiency gains, which also have the significant ethical advantage of assigning fewer patients to treatment arms with inferior outcomes. In this paper, we presented a general overview of adaptive randomization designs for clinical trials, including Bayesian and frequentist approaches as well as response-adaptive randomization. Examples were used to demonstrate the procedure for design parameters calibration and operating characteristics. Both advantages and disadvantages of adaptive randomization were discussed in the summary from practical perspective of clinical trials. -
Optimal Design
LECTURE 11 Optimal Design The issue of how to optimally design experiments has been around for a lont time, extending back to at least 1918 (Smith). Before we start discussing the highlights of this topic we need to set up the problem. Note that we are going to set it up for the types of situations that we have enountered, although in fact the problem can be set up in more general ways. We will define X (n×p) as our design matrix, β(p×1) our vector of regression parameters, y(n×1) our vector of observations, and (n×1) our error vector giving y = Xβ + . We assume will be iid with mean zero and Cov() = σ 2 I . As before, we have βˆ = (X X)−1 X y Var(β)ˆ = σ 2(X X)−1 ˆ yˆx = xβ 2 −1 Var(yˆx ) = σ x(X X) x . The design problem consists of selecting row vectors x(1×p), i = 1, 2,...,n from the design space X such that the design defined by these n vectors is, in some defined sense, optimal. We are assuming that n is fixed. By and large, solutions to this problem consist of developing some sensible criterion based on the above model and using it to obtain optimal designs. One of the first to state a criterion and obtain optimal designs for regression problems was Smith(1918). The criterion she proposed was: minimize the maxi- mum variance of any predicted value (obtained by using the regression function) over the experimental space. I.e., min max Var(yˆx ). -
Restricted Randomization Or Row-Column Designs?
Rectangular experiments: restricted randomization or row-column designs? R. A. Bailey [email protected] Thanks to CAPES for support in Brasil The problem An agricultural experiment to compare n treatments. The experimental area has r rows and n columns. n z }| { r Use a randomized complete-block design with rows as blocks. (In each row, choose one of the n! orders with equal probability.) What should we do if the randomization produces a plan with one treatment always at one side of the rectangle? Example Federer (1955 book): guayule trees B D G A F C E A G C D F B E G E D F B C A B A C F G E D G B F C D A E Example Federer (1955 book): guayule trees B D G A F C E A G C D F B E G E D F B C A B A C F G E D G B F C D A E Solution: Simple-minded restricted randomization Keep re-randomizing until you get a plan you like. Analyse as usual. Solution: Use a Latinized design, but analyse as usual Deliberately construct a design in which no treatment occurs more than once in any column. Solution (following Yates): Super-valid restricted randomization, with usual analysis Solution: Efficient row-column design, with analysis allowing for rows and columns Solution: Use a carefully chosen Latinized design; REML/ANOVA estimates of variance components Proposed courses of action Solution (Fisher): Continue to randomize and analyse as usual Solution: Use a Latinized design, but analyse as usual Deliberately construct a design in which no treatment occurs more than once in any column.