Estimation of Sample Mean and Variance for Monte-Carlo Simulations

Total Page:16

File Type:pdf, Size:1020Kb

Estimation of Sample Mean and Variance for Monte-Carlo Simulations Estimation of Sample Mean and Variance for Monte-Carlo Simulations David B. Thomas and Wayne Luk Imperial College London {dt10,wl}@doc.ic.ac.uk Abstract accurately accumulated to provide the overall answer. To keep up with the simulations this accumulation must Monte-Carlo simulations are able to provide estimates happen at full-speed, processing large streams of data of solutions for problems that are otherwise intractable, in one pass, and using constant resources to process one by examining the aggregate behaviour of large numbers result per clock-cycle. The accumulators must also be of random simulations. Because these simulations are able to maintain accuracy over large numbers of sam- independent and embarrassingly parallel, FPGAs are ples, to ensure that numerical stability problems do not increasingly being used to implement Monte-Carlo ap- bias the overall simulation results. plications. However, as the number of simulation runs This paper examines methods for the calculation of increases, the problem of accurately accumulating the mean and variance statistics over large data sets using aggregate statistics, such as the mean and variance, FPGAs. Our contributions are: becomes very difficult. This paper examines three ac- cumulation methods, adapting them for use in FPGA • Identification of accumulation methods used in applications. In particular, we develop a mean and software that can be adapted to work in hardware. variance calculator based on cascading accumulators, An adaptation of the cascading accumulators which is able to process streams of floating-point data • method for hardware, allowing floating-point data in one pass, while operating in fixed-point internally. to be efficiently processed in fixed-point format. This method has the advantage that it calculates the ex- act sample mean and an accurate numerically stable • An empirical evaluation of the accuracy of five dif- sample variance, while using few logic resources and ferent hardware accumulation methods, showing providing performance to match commercial floating- that only the cascading accumulator and double- point operators: clock rates of 434MHz are achieved in precision methods maintain sufficient accuracy. Virtex-5, 1.46 times faster than a double-precision ac- • A comparison of area and speed, showing that the cumulator, while using one eighth of the resources. cascading accumulators method using one eighth of the logic resources of the double precision method, while operating 1.46 times faster. 1. Introduction Monte-Carlo simulations are a class of applications 2. Motivation that often map particularly well to FPGAs, due to the embarrassingly parallel nature of the computation. The At an abstract level Monte-Carlo simulations can huge number of independent simulation threads allow be described as a tuple (P,O,A, f ,a, p,d0): FPGA-based simulators to be heavily pipelined [6], and P : The set of possible simulation parameters, including also allow multiple simulation instances to be placed environmental parameters and the starting state. within one FPGA, providing a roughly linear increase O : The set of possible simulation run outputs, contain- in performance as the size of the FPGA is increased. A ing the results produced by a single simulation run. second advantage is the relatively low IO required per simulator instance, meaning that low performancebuses A : The set of accumulator states. can support very large amounts of computation without f : P 7→ O : A stochastic simulation function which becoming a bottleneck [8]. maps a simulation input to one possible result. This A side-effect of high-performance FPGA Monte- function must involve some randomness, so two execu- Carlo simulators is that they generate huge numbers of tions using the same simulation input will not provide results per second, and statistics on these results must be the same output. a : A × O 7→ A : A deterministic accumulation function is only necessary to estimate one quantile of the results; that combines the current accumulator state with the re- this occurs, for example, in Value-at-Risk calculations, sult from a simulation run. The functionshould be order where one wishes to estimate the 1% quantile of loss independent, i.e. a(a(d,e1),e2)= a(a(d,e2),e1). (how muchone mightlose every one in a hundreddays). p ∈ P : An element from the simulation input set giving In this paper we consider the simplest and most the starting point for the application. common type of accumulator: the arithmetic mean. This statistic estimates the expected value of the d ∈ A : The initial state of the accumulator. 0 stochastic process as the sample count increases to in- The simulation is then executed by generating a se- finity, and is used in many applications: for example, in ries of simulation results, and accumulating the results: pricing financial instruments one typically requires the di = a(di−1,ei) ei = f (p) (1) average price over all possible future outcomes, while in Monte-Carlo integration it is necessary to estimate ∞ As i → the estimate accumulated into di will asymp- the average occupancy over the integration domain. totically converge on the average properties of f (p). If we assume that the output of each simulation is In a hardware implementation it is convenient to a scalar value, then the sample mean (x) of the first n view each simulator instance as a black box, with one simulation runs is defined as: input that accepts p, and one output that produces a n stream of results derived from independent simulation 1 x = ∑ xi (2) runs. The rate of result generation will vary between n i=1 different simulations, and between different architec- tures implementing a given simulation. In the extreme For example, if the samples were generated by a Monte- case the simulator will generate one new result every Carlo simulation for option pricing, where each value xi cycle [4], while in other cases the simulator will on av- is the price of the option at the end of one simulation erage produce less than one result per cycle. trial, then x is an estimate of the true price of the option. Even when generating less than one result per cy- By itself the value of x is not very helpful, as it cle, the simulator may exhibit bursty behaviour, where gives no idea how much confidence can be placed in the estimate. The second key statistic is the sample vari- the simulator produces one result per cycle for a short 2 period of time. This may be somewhat predictable: if ance (s ), which provides an estimate of how much the for example the simulator uses a circular pipeline of individual samples are spread around the mean: depth k, and each simulation run requires m circuits of 1 n s2 = (x − x)2 (3) the pipeline, then there will be a burst of k results fol- n ∑ i lowedbyagapof k(m−1) cycles [6, 9]. However, more i=1 complicated simulations may take a random number of An important related statistic is the sample standard de- cycles to complete, such that the length of each burst viation (s), which is simply the square root of the vari- follows a probability distribution [7]. The tails of this ance. The advantage of the standard deviation is that it distribution may be very long, making the probability is measured in the same units as the data samples, while of a very long burst small, but still non-zero. the variance is measured in squared units. In such a situation it is important not to drop any The definitions of mean and variance are straight- results in the burst, as this may bias the overall result forward, but actually calculating the statistics is more of the simulation. For this reason it is necessary to de- complex, particularly for the large numbers of samples sign accumulatorsthat can handlethe worst case, so that generated by Monte-Carlo simulations: an FPGA might they can accept a sustained rate of one input value per be configured with 10 or more simulation cores, each cycle. This has the desirable side-effect of making the generating one result per cycle at 500MHz, so even one interface between the simulation black-box and the ac- second of execution provides over 232 samples, which cumulator modular, as the simulator can simply throw imposes a number of requirements on the accumulators. results out whenever they become ready, without need- The first requirement is that the accumulator must ing to worry about whether the downstream accumula- be able to operate in a streaming mode, using constant tor is ready to accept them. resources, and able to accept one new floating-point So far we have not specified exactly what the ac- value per cycle at the same clock rate as the floating- cumulator does, as there are a number of choices. In point cores used to calculate the value. In the formal some applications it is necessary to estimate the whole definition of variance (Equation 3) it is assumed that the empirical distribution function of the simulation results, mean has already been calculated. This is a problem for requiring complex histogram operations, but this is rel- streaming data applications, as it requires all the inter- atively uncommon. A slightly simpler case is when it mediate samples to be stored between the calculation of the mean and the variance. This is a particular problem 3. Existing Methods in an FPGA, as a RAM that is both large enough and In this section we identify a number of existing al- fast enough to buffer all the samples is likely to be very gorithms for calculating the mean and variance of data, expensive, so the accumulator must be able to operate and identify those that can be adapted for use in a hard- on just one pass through the data.
Recommended publications
  • Applied Biostatistics Mean and Standard Deviation the Mean the Median Is Not the Only Measure of Central Value for a Distribution
    Health Sciences M.Sc. Programme Applied Biostatistics Mean and Standard Deviation The mean The median is not the only measure of central value for a distribution. Another is the arithmetic mean or average, usually referred to simply as the mean. This is found by taking the sum of the observations and dividing by their number. The mean is often denoted by a little bar over the symbol for the variable, e.g. x . The sample mean has much nicer mathematical properties than the median and is thus more useful for the comparison methods described later. The median is a very useful descriptive statistic, but not much used for other purposes. Median, mean and skewness The sum of the 57 FEV1s is 231.51 and hence the mean is 231.51/57 = 4.06. This is very close to the median, 4.1, so the median is within 1% of the mean. This is not so for the triglyceride data. The median triglyceride is 0.46 but the mean is 0.51, which is higher. The median is 10% away from the mean. If the distribution is symmetrical the sample mean and median will be about the same, but in a skew distribution they will not. If the distribution is skew to the right, as for serum triglyceride, the mean will be greater, if it is skew to the left the median will be greater. This is because the values in the tails affect the mean but not the median. Figure 1 shows the positions of the mean and median on the histogram of triglyceride.
    [Show full text]
  • Estimating the Mean and Variance of a Normal Distribution
    Estimating the Mean and Variance of a Normal Distribution Learning Objectives After completing this module, the student will be able to • explain the value of repeating experiments • explain the role of the law of large numbers in estimating population means • describe the effect of increasing the sample size or reducing measurement errors or other sources of variability Knowledge and Skills • Properties of the arithmetic mean • Estimating the mean of a normal distribution • Law of Large Numbers • Estimating the Variance of a normal distribution • Generating random variates in EXCEL Prerequisites 1. Calculating sample mean and arithmetic average 2. Calculating sample standard variance and standard deviation 3. Normal distribution Citation: Neuhauser, C. Estimating the Mean and Variance of a Normal Distribution. Created: September 9, 2009 Revisions: Copyright: © 2009 Neuhauser. This is an open‐access article distributed under the terms of the Creative Commons Attribution Non‐Commercial Share Alike License, which permits unrestricted use, distribution, and reproduction in any medium, and allows others to translate, make remixes, and produce new stories based on this work, provided the original author and source are credited and the new work will carry the same license. Funding: This work was partially supported by a HHMI Professors grant from the Howard Hughes Medical Institute. Page 1 Pretest 1. Laura and Hamid are late for Chemistry lab. The lab manual asks for determining the density of solid platinum by repeating the measurements three times. To save time, they decide to only measure the density once. Explain the consequences of this shortcut. 2. Tom and Bao Yu measured the density of solid platinum three times: 19.8, 21.4, and 21.9 g/cm3.
    [Show full text]
  • Business Statistics Unit 4 Correlation and Regression.Pdf
    RCUB, B.Com 4 th Semester Business Statistics – II Rani Channamma University, Belagavi B.Com – 4th Semester Business Statistics – II UNIT – 4 : CORRELATION Introduction: In today’s business world we come across many activities, which are dependent on each other. In businesses we see large number of problems involving the use of two or more variables. Identifying these variables and its dependency helps us in resolving the many problems. Many times there are problems or situations where two variables seem to move in the same direction such as both are increasing or decreasing. At times an increase in one variable is accompanied by a decline in another. For example, family income and expenditure, price of a product and its demand, advertisement expenditure and sales volume etc. If two quantities vary in such a way that movements in one are accompanied by movements in the other, then these quantities are said to be correlated. Meaning: Correlation is a statistical technique to ascertain the association or relationship between two or more variables. Correlation analysis is a statistical technique to study the degree and direction of relationship between two or more variables. A correlation coefficient is a statistical measure of the degree to which changes to the value of one variable predict change to the value of another. When the fluctuation of one variable reliably predicts a similar fluctuation in another variable, there’s often a tendency to think that means that the change in one causes the change in the other. Uses of correlations: 1. Correlation analysis helps inn deriving precisely the degree and the direction of such relationship.
    [Show full text]
  • Approaching Mean-Variance Efficiency for Large Portfolios
    Approaching Mean-Variance Efficiency for Large Portfolios ∗ Mengmeng Ao† Yingying Li‡ Xinghua Zheng§ First Draft: October 6, 2014 This Draft: November 11, 2017 Abstract This paper studies the large dimensional Markowitz optimization problem. Given any risk constraint level, we introduce a new approach for estimating the optimal portfolio, which is developed through a novel unconstrained regression representation of the mean-variance optimization problem, combined with high-dimensional sparse regression methods. Our estimated portfolio, under a mild sparsity assumption, asymptotically achieves mean-variance efficiency and meanwhile effectively controls the risk. To the best of our knowledge, this is the first time that these two goals can be simultaneously achieved for large portfolios. The superior properties of our approach are demonstrated via comprehensive simulation and empirical studies. Keywords: Markowitz optimization; Large portfolio selection; Unconstrained regression, LASSO; Sharpe ratio ∗Research partially supported by the RGC grants GRF16305315, GRF 16502014 and GRF 16518716 of the HKSAR, and The Fundamental Research Funds for the Central Universities 20720171073. †Wang Yanan Institute for Studies in Economics & Department of Finance, School of Economics, Xiamen University, China. [email protected] ‡Hong Kong University of Science and Technology, HKSAR. [email protected] §Hong Kong University of Science and Technology, HKSAR. [email protected] 1 1 INTRODUCTION 1.1 Markowitz Optimization Enigma The groundbreaking mean-variance portfolio theory proposed by Markowitz (1952) contin- ues to play significant roles in research and practice. The optimal mean-variance portfolio has a simple explicit expression1 that only depends on two population characteristics, the mean and the covariance matrix of asset returns. Under the ideal situation when the underlying mean and covariance matrix are known, mean-variance investors can easily compute the optimal portfolio weights based on their preferred level of risk.
    [Show full text]
  • A Family of Skew-Normal Distributions for Modeling Proportions and Rates with Zeros/Ones Excess
    S S symmetry Article A Family of Skew-Normal Distributions for Modeling Proportions and Rates with Zeros/Ones Excess Guillermo Martínez-Flórez 1, Víctor Leiva 2,* , Emilio Gómez-Déniz 3 and Carolina Marchant 4 1 Departamento de Matemáticas y Estadística, Facultad de Ciencias Básicas, Universidad de Córdoba, Montería 14014, Colombia; [email protected] 2 Escuela de Ingeniería Industrial, Pontificia Universidad Católica de Valparaíso, 2362807 Valparaíso, Chile 3 Facultad de Economía, Empresa y Turismo, Universidad de Las Palmas de Gran Canaria and TIDES Institute, 35001 Canarias, Spain; [email protected] 4 Facultad de Ciencias Básicas, Universidad Católica del Maule, 3466706 Talca, Chile; [email protected] * Correspondence: [email protected] or [email protected] Received: 30 June 2020; Accepted: 19 August 2020; Published: 1 September 2020 Abstract: In this paper, we consider skew-normal distributions for constructing new a distribution which allows us to model proportions and rates with zero/one inflation as an alternative to the inflated beta distributions. The new distribution is a mixture between a Bernoulli distribution for explaining the zero/one excess and a censored skew-normal distribution for the continuous variable. The maximum likelihood method is used for parameter estimation. Observed and expected Fisher information matrices are derived to conduct likelihood-based inference in this new type skew-normal distribution. Given the flexibility of the new distributions, we are able to show, in real data scenarios, the good performance of our proposal. Keywords: beta distribution; centered skew-normal distribution; maximum-likelihood methods; Monte Carlo simulations; proportions; R software; rates; zero/one inflated data 1.
    [Show full text]
  • Beating Monte Carlo
    SIMULATION BEATING MONTE CARLO Simulation methods using low-discrepancy point sets beat Monte Carlo hands down when valuing complex financial derivatives, report Anargyros Papageorgiou and Joseph Traub onte Carlo simulation is widely used to ods with basic Monte Carlo in the valuation of alised Faure achieves accuracy of 10"2 with 170 M price complex financial instruments, and a collateraliscd mortgage obligation (CMO). points, while modified Sobol uses 600 points. much time and money have been invested in We found that deterministic methods beat The Monte Carlo method, on the other hand, re• the hope of improving its performance. How• Monte Carlo: quires 2,700 points for the same accuracy, ever, recent theoretical results and extensive • by a wide margin. In particular: (iii) Monte Carlo tends to waste points due to computer testing indicate that deterministic (i) Both the generalised Faure and modified clustering, which severely compromises its per• methods, such as simulations using Sobol or Sobol methods converge significantly faster formance when the sample size is small. Faure points, may be superior in both speed than Monte Carlo. • as the sample size and the accuracy de• and accuracy. (ii) The generalised Faure method always con• mands grow. In particular: Tn this paper, we refer to a deterministic verges at least as fast as the modified Sobol (i) Deterministic methods are 20 to 50 times method by the name of the sequence of points method and often faster. faster than Monte Carlo (the speed-up factor) it uses, eg, the Sobol method. Wc tested the gen• (iii) The Monte Carlo method is sensitive to the even with moderate sample sizes (2,000 de• eralised Faure sequence due to Tezuka (1995) initial seed.
    [Show full text]
  • Random Processes
    Chapter 6 Random Processes Random Process • A random process is a time-varying function that assigns the outcome of a random experiment to each time instant: X(t). • For a fixed (sample path): a random process is a time varying function, e.g., a signal. – For fixed t: a random process is a random variable. • If one scans all possible outcomes of the underlying random experiment, we shall get an ensemble of signals. • Random Process can be continuous or discrete • Real random process also called stochastic process – Example: Noise source (Noise can often be modeled as a Gaussian random process. An Ensemble of Signals Remember: RV maps Events à Constants RP maps Events à f(t) RP: Discrete and Continuous The set of all possible sample functions {v(t, E i)} is called the ensemble and defines the random process v(t) that describes the noise source. Sample functions of a binary random process. RP Characterization • Random variables x 1 , x 2 , . , x n represent amplitudes of sample functions at t 5 t 1 , t 2 , . , t n . – A random process can, therefore, be viewed as a collection of an infinite number of random variables: RP Characterization – First Order • CDF • PDF • Mean • Mean-Square Statistics of a Random Process RP Characterization – Second Order • The first order does not provide sufficient information as to how rapidly the RP is changing as a function of timeà We use second order estimation RP Characterization – Second Order • The first order does not provide sufficient information as to how rapidly the RP is changing as a function
    [Show full text]
  • Lecture 3: Measure of Central Tendency
    Lecture 3: Measure of Central Tendency Donglei Du ([email protected]) Faculty of Business Administration, University of New Brunswick, NB Canada Fredericton E3B 9Y2 Donglei Du (UNB) ADM 2623: Business Statistics 1 / 53 Table of contents 1 Measure of central tendency: location parameter Introduction Arithmetic Mean Weighted Mean (WM) Median Mode Geometric Mean Mean for grouped data The Median for Grouped Data The Mode for Grouped Data 2 Dicussion: How to lie with averges? Or how to defend yourselves from those lying with averages? Donglei Du (UNB) ADM 2623: Business Statistics 2 / 53 Section 1 Measure of central tendency: location parameter Donglei Du (UNB) ADM 2623: Business Statistics 3 / 53 Subsection 1 Introduction Donglei Du (UNB) ADM 2623: Business Statistics 4 / 53 Introduction Characterize the average or typical behavior of the data. There are many types of central tendency measures: Arithmetic mean Weighted arithmetic mean Geometric mean Median Mode Donglei Du (UNB) ADM 2623: Business Statistics 5 / 53 Subsection 2 Arithmetic Mean Donglei Du (UNB) ADM 2623: Business Statistics 6 / 53 Arithmetic Mean The Arithmetic Mean of a set of n numbers x + ::: + x AM = 1 n n Arithmetic Mean for population and sample N P xi µ = i=1 N n P xi x¯ = i=1 n Donglei Du (UNB) ADM 2623: Business Statistics 7 / 53 Example Example: A sample of five executives received the following bonuses last year ($000): 14.0 15.0 17.0 16.0 15.0 Problem: Determine the average bonus given last year. Solution: 14 + 15 + 17 + 16 + 15 77 x¯ = = = 15:4: 5 5 Donglei Du (UNB) ADM 2623: Business Statistics 8 / 53 Example Example: the weight example (weight.csv) The R code: weight <- read.csv("weight.csv") sec_01A<-weight$Weight.01A.2013Fall # Mean mean(sec_01A) ## [1] 155.8548 Donglei Du (UNB) ADM 2623: Business Statistics 9 / 53 Will Rogers phenomenon Consider two sets of IQ scores of famous people.
    [Show full text]
  • “Mean”? a Review of Interpreting and Calculating Different Types of Means and Standard Deviations
    pharmaceutics Review What Does It “Mean”? A Review of Interpreting and Calculating Different Types of Means and Standard Deviations Marilyn N. Martinez 1,* and Mary J. Bartholomew 2 1 Office of New Animal Drug Evaluation, Center for Veterinary Medicine, US FDA, Rockville, MD 20855, USA 2 Office of Surveillance and Compliance, Center for Veterinary Medicine, US FDA, Rockville, MD 20855, USA; [email protected] * Correspondence: [email protected]; Tel.: +1-240-3-402-0635 Academic Editors: Arlene McDowell and Neal Davies Received: 17 January 2017; Accepted: 5 April 2017; Published: 13 April 2017 Abstract: Typically, investigations are conducted with the goal of generating inferences about a population (humans or animal). Since it is not feasible to evaluate the entire population, the study is conducted using a randomly selected subset of that population. With the goal of using the results generated from that sample to provide inferences about the true population, it is important to consider the properties of the population distribution and how well they are represented by the sample (the subset of values). Consistent with that study objective, it is necessary to identify and use the most appropriate set of summary statistics to describe the study results. Inherent in that choice is the need to identify the specific question being asked and the assumptions associated with the data analysis. The estimate of a “mean” value is an example of a summary statistic that is sometimes reported without adequate consideration as to its implications or the underlying assumptions associated with the data being evaluated. When ignoring these critical considerations, the method of calculating the variance may be inconsistent with the type of mean being reported.
    [Show full text]
  • Notes on Calculating Computer Performance
    Notes on Calculating Computer Performance Bruce Jacob and Trevor Mudge Advanced Computer Architecture Lab EECS Department, University of Michigan {blj,tnm}@umich.edu Abstract This report explains what it means to characterize the performance of a computer, and which methods are appro- priate and inappropriate for the task. The most widely used metric is the performance on the SPEC benchmark suite of programs; currently, the results of running the SPEC benchmark suite are compiled into a single number using the geometric mean. The primary reason for using the geometric mean is that it preserves values across normalization, but unfortunately, it does not preserve total run time, which is probably the figure of greatest interest when performances are being compared. Cycles per Instruction (CPI) is another widely used metric, but this method is invalid, even if comparing machines with identical clock speeds. Comparing CPI values to judge performance falls prey to the same prob- lems as averaging normalized values. In general, normalized values must not be averaged and instead of the geometric mean, either the harmonic or the arithmetic mean is the appropriate method for averaging a set running times. The arithmetic mean should be used to average times, and the harmonic mean should be used to average rates (1/time). A number of published SPECmarks are recomputed using these means to demonstrate the effect of choosing a favorable algorithm. 1.0 Performance and the Use of Means We want to summarize the performance of a computer; the easiest way uses a single number that can be compared against the numbers of other machines.
    [Show full text]
  • Basic Statistics and Monte-Carlo Method -2
    Applied Statistical Mechanics Lecture Note - 10 Basic Statistics and Monte-Carlo Method -2 고려대학교 화공생명공학과 강정원 Table of Contents 1. General Monte Carlo Method 2. Variance Reduction Techniques 3. Metropolis Monte Carlo Simulation 1.1 Introduction Monte Carlo Method Any method that uses random numbers Random sampling the population Application • Science and engineering • Management and finance For given subject, various techniques and error analysis will be presented Subject : evaluation of definite integral b I = ρ(x)dx a 1.1 Introduction Monte Carlo method can be used to compute integral of any dimension d (d-fold integrals) Error comparison of d-fold integrals Simpson’s rule,… E ∝ N −1/ d − Monte Carlo method E ∝ N 1/ 2 purely statistical, not rely on the dimension ! Monte Carlo method WINS, when d >> 3 1.2 Hit-or-Miss Method Evaluation of a definite integral b I = ρ(x)dx a h X X X ≥ ρ X h (x) for any x X Probability that a random point reside inside X O the area O O I N' O O r = ≈ O O (b − a)h N a b N : Total number of points N’ : points that reside inside the region N' I ≈ (b − a)h N 1.2 Hit-or-Miss Method Start Set N : large integer N’ = 0 h X X X X X Choose a point x in [a,b] = − + X Loop x (b a)u1 a O N times O O y = hu O O Choose a point y in [0,h] 2 O O a b if [x,y] reside inside then N’ = N’+1 I = (b-a) h (N’/N) End 1.2 Hit-or-Miss Method Error Analysis of the Hit-or-Miss Method It is important to know how accurate the result of simulations are The rule of 3σ’s Identifying Random Variable N = 1 X X n N n=1 From
    [Show full text]
  • Monte Carlo Simulation by Marco Liu, Cqf
    OPTIMAL NUMBER OF TRIALS FOR MONTE CARLO SIMULATION BY MARCO LIU, CQF This article presents a way to estimate the number of trials required several hours. It would be beneficial to know what level of precision, for a desired confidence interval in the context of a Monte Carlo or confidence interval, we could achieve for a certain number of simulation. iterations in advance of running the program. Without the confidence interval, the time commitment for a trial-and-error process would Traditional valuation approaches such as Option Pricing Method be significant. In this article, we will present a simple method to (“OPM”) or Probability-Weighted Expected Return Method estimate the number of simulations needed for a desired level of (“PWERM”) may not be adequate in providing fair value estimation precision when running a Monte Carlo simulation. for financial instruments that require distribution assumptions on multiple input parameters. In such cases, a numerical method, Monte Carlo simulation for instance, is often used. The Monte Carlo 95% of simulation is a computerized algorithmic procedure that outputs a area wide range of values – typically unknown probability distribution – by simulating one or multiple input parameters via known probability distributions. Statistically, 95% of the area under a normal distribution curve is described as being plus or minus 1.96 standard This technique is often used to find fair value for financial deviations from the mean. For 90%, the z-statistic is 1.64. instruments for which probabilistic distributions are unknown. The simulation procedure is typically repeated multiple times and the average of the results is taken.
    [Show full text]