Top-Level Schedule Distribution Models

Presented at the 2013 ICEAA Professional Development & Training Workshop - www.iceaaonline.com PR-155 Top-level Schedule Distribution Models Peter Frederic 19 March 2013 2013 Professional Development & Training Workshop New Orleans, LA June 18-21, 2013 Tecolote Research, Inc. Tecolote Research, Inc. 420 Fairview Ave. Suite 201 415 E. Ocean Ave., Suite H Goleta, CA 93117-3626 Lompoc, CA 93436 (805) 571-6366 (805) 588-1330 Presented at the 2013 ICEAA Professional Development & Training Workshop - www.iceaaonline.com Tecolote Research, Inc. PR-155,Top-level Schedule Distribution Models Top-level Schedule Distribution Models Peter Frederic Tecolote Research, Inc. Abstract In the initial stages of a development project, it is sometimes necessary to build a summary-level schedule for planning and budgeting purposes before the day-by-day details of the project are fully defined or understood. However, when uncertainty assessments are performed on schedule networks containing few activities, the distribution forms chosen for individual activity durations can have a significant impact on the overall results. It is therefore important to choose uncertainty distribution forms that accurately represent the behavior of the sub-network of activities represented by each summary activity. In this paper, we investigated probability theory to see if there were statistical distributions that were well suited to modeling the completion of typical schedule sub-networks consisting of multiple parallel activities. In order to test the applicability of the distributions investigated, we developed an Excel/@Risk tool to compare how various distributions behave versus simulated data from a simplified schedule network. We evaluated numerous distribution forms including: general Beta, PERT Beta, Log-normal, Weibull, Erlang, and Poisson distributions. We concluded that only the general BETA distribution could accurately model a sub-network consisting of multiple parallel paths. We propose additional research to develop Beta parameters to represent a variety of network topologies (e.g. mostly serial/mostly parallel, generous reserves/no reserves, many discrete risks/few discrete risks, etc.). Use or disclosure of data contained on this page is subject to Page i the restriction on the title page of this document. Presented at the 2013 ICEAA Professional Development & Training Workshop - www.iceaaonline.com Tecolote Research, Inc. PR-155, Top-level Schedule Distribution Models Table of Contents 1 INTRODUCTION ________________________________________________________ 1 2 THE PROCESS __________________________________________________________ 2 2.1 LOG-NORMAL DISTRIBUTION ____________________________________________ 3 2.2 WEIBULL DISTRIBUTION ________________________________________________ 4 2.3 ERLANG DISTRIBUTION _________________________________________________ 5 2.4 POISSON DISTRIBUTION_________________________________________________ 6 2.1 PERT-BETA DISTRIBUTION _____________________________________________ 7 2.1 BETA DISTRIBUTION ___________________________________________________ 8 3 ADDITIONAL RESEARCH REQUIRED ____________________________________ 10 4 CONCLUSIONS_________________________________________________________ 11 Page ii Presented at the 2013 ICEAA Professional Development & Training Workshop - www.iceaaonline.com Tecolote Research, Inc. PR-155, Top-level Schedule Distribution Models Page iii Presented at the 2013 ICEAA Professional Development & Training Workshop - www.iceaaonline.com Tecolote Research, Inc. PR-155, Top-level Schedule Distribution Models 1 INTRODUCTION In the initial stages of a development project, it is sometimes necessary to build a summary-level schedule for planning and budgeting purposes before the day-by-day details of the project are fully defined or understood. However, when uncertainty assessments are performed on schedule networks containing few activities, the distribution forms chosen for individual activity durations can have a significant impact on the overall results. It is therefore important to choose uncertainty distribution forms that accurately represent the behavior of the sub-network of activities represented by each summary activity. In the sections below,we evaluate numerous distribution forms including: Log-normal, Weibull, Erlang, Poisson, PERT Beta, and general Beta distributions. Page 1 Presented at the 2013 ICEAA Professional Development & Training Workshop - www.iceaaonline.com Tecolote Research, Inc. PR-155, Top-level Schedule Distribution Models 2 THE PROCESS We investigated probability theory to see if there were statistical distributions that were well suited to modeling the completion of multiple parallel activities. In order to test the applicability of the distributions investigated, we developed an Excel™/@Risk™ tool to compare how various distributions behave versus simulated data from a simplified schedule network. The model simulates 50 parallel activities feeding into one. Each of the 50 activities was modeled as a lognormal distribution with a mean duration of ten days and a standard deviation of four days. The test process is: 1) use @Risk to run 10,000 iterations of the sample network, 2) capture the final completion date in all 10,000 iterations, 3) bin the simulation results into a histogram, and 4) use the Excel Solver to fit PDFs for various distributions to the simulated data histogram. For each distribution type tested, the sections below provide a very brief description of the distribution as well as a probability density graph that shows how well the distribution fit the test data. Page 2 Presented at the 2013 ICEAA Professional Development & Training Workshop - www.iceaaonline.com Tecolote Research, Inc. PR-155, Top-level Schedule Distribution Models 2.1 LOG-NORMAL DISTRIBUTION From Wikipedia: "A log-normal distribution is a probability distribution of a random variable whose logarithm is normally distributed… A variable might be modeled as log-normal if it can be thought of as the multiplicative product of many independent random variables each of which is positive." PDF: Where: x = Duration µ = Mean ln(duration) α = Standard deviation ln(duration) The log-normal distribution matches the “50 parallel activities feeding into one” simulation data surprisingly well, but falls significantly short at the high tail of the distribution: Page 3 Presented at the 2013 ICEAA Professional Development & Training Workshop - www.iceaaonline.com Tecolote Research, Inc. PR-155, Top-level Schedule Distribution Models 2.2 WEIBULL DISTRIBUTION From Wikipedia: "The Weibull distribution is used: • In survival analysis • In reliability engineering and failure analysis • In industrial engineering to represent manufacturing and delivery times" PDF: Where: x = Duration k = Shape parameter λ = Scale parameter The Weibull distribution does not match the “50 parallel activities feeding into one” simulation data well: Page 4 Presented at the 2013 ICEAA Professional Development & Training Workshop - www.iceaaonline.com Tecolote Research, Inc. PR-155, Top-level Schedule Distribution Models 2.3 ERLANG DISTRIBUTION From Wikipedia: "The Erlang distribution was developed by A. K. Erlang to examine the number of telephone calls which might be made at the same time to the operators of the switching stations. This work on telephone traffic engineering has been expanded to consider waiting times in queuing systems in general." PDF: Where: x = Duration k = Shape parameter λ = Rate parameter The Erlang distribution matches the “50 parallel activities feeding into one” simulation data fairly well, but falls significantly short at the high tail of the distribution: Page 5 Presented at the 2013 ICEAA Professional Development & Training Workshop - www.iceaaonline.com Tecolote Research, Inc. PR-155, Top-level Schedule Distribution Models 2.4 POISSON DISTRIBUTION From Wikipedia: “…the Poisson distribution (pronounced pwason(or Poisson law of small numbers[1]) is a discrete probability distribution that expresses the probability of a given number of events occurring in a fixed interval of time and/or space if these events occur with a known average rate and independently of the time since the last event.” Because it is a discrete distribution, it is not applicable to the finish time of multiple parallel activities. Page 6 Presented at the 2013 ICEAA Professional Development & Training Workshop - www.iceaaonline.com Tecolote Research, Inc. PR-155, Top-level Schedule Distribution Models 2.1 PERT-BETA DISTRIBUTION From the @Risk™ help files: "The PERT distribution (meaning Program Evaluation and Review Technique) is rather like a Triangular distribution, in that it has the same set of three parameters. Technically it is a special case of a scaled Beta (or BetaGeneral) distribution. In this sense it can be used as a pragmatic and readily understandable distribution." PDF: f(y, a, m, b) = Where: y = Duration µ = Mean = (a + 4 * m + b) / 6 α = Shape parameter = 6 * (µ - a) / (b - a) β = Shape parameter = 6 * (b - µ) / (b - a) a = Absolute minimum y b = Absolute maximum y m = Most likely value of y and: Because the shape parameters of the PERT form of the Beta curve are constrained, it is not able to match the “50 parallel activities feeding into one” simulation data well, especially at the high tail of the distribution: Page 7 Presented at the 2013 ICEAA Professional Development & Training Workshop - www.iceaaonline.com Tecolote Research, Inc. PR-155, Top-level Schedule Distribution Models 2.1 BETA DISTRIBUTION From

Load more