Variance Difference Between Maximum Likelihood Estimation Method and Expected a Posteriori Estimation Method Viewed from Number of Test Items

Total Page:16

File Type:pdf, Size:1020Kb

Variance Difference Between Maximum Likelihood Estimation Method and Expected a Posteriori Estimation Method Viewed from Number of Test Items Vol. 11(16), pp. 1579-1589, 23 August, 2016 DOI: 10.5897/ERR2016.2807 Article Number: B2D124860158 ISSN 1990-3839 Educational Research and Reviews Copyright © 2016 Author(s) retain the copyright of this article http://www.academicjournals.org/ERR Full Length Research Paper Variance difference between maximum likelihood estimation method and expected A posteriori estimation method viewed from number of test items Jumailiyah Mahmud*, Muzayanah Sutikno and Dali S. Naga 1Institute of Teaching and Educational Sciences of Mataram, Indonesia. 2State University of Jakarta, Indonesia. 3 University of Tarumanegara, Indonesia. Received 8 April, 2016; Accepted 12 August, 2016 The aim of this study is to determine variance difference between maximum likelihood and expected A posteriori estimation methods viewed from number of test items of aptitude test. The variance presents an accuracy generated by both maximum likelihood and Bayes estimation methods. The test consists of three subtests, each with 40 multiple-choice items of 5 alternatives. The total items are 102 and 3159 respondents which were drawn using random matrix sampling technique, thus 73 items were generated which were qualified based on classical theory and IRT. The study examines 5 hypotheses. The results are variance of the estimation method using MLE is higher than the estimation method using EAP on the test consisting of 25 items with F= 1.602, variance of the estimation method using MLE is higher than the estimation method using EAP on the test consisting of 50 items with F= 1.332, variance of estimation with the test of 50 items is higher than the test of 25 items, and variance of estimation with the test of 50 items is higher than the test of 25 items on EAP method with F=1.329. All observed F values ≥ 1.00. 5 RMSE in items 10, 15, 20, and 25 are different in both MLE and EAP, with t = 3.060, 0,011, thereby meaning that statistical null hypothesis are rejected. The study concludes that variance of MLE method is higher than EAP, and the test with 50 items has higher variance than that with 25 items, the accuracy of EAP estimate higher than that of MLE in item 10, 15, 20, and 25. Key words: Variance, RMS of estimation, maximum likelihood, expected A posteriori. INTRODUCTION There are two types of psychological and educational weakness of the classical measurement theory, mainly in measurement theories; classical and modern. Such a its dependence among groups of test-takers and items. modern measurement theory is also known as item Dependence means that result of measurement depends response theory which is developed in response to the on groups of those who do the test. If such a test is *Corresponding author. E-mail: [email protected]. Authors agree that this article remain permanently open access under the terms of the Creative Commons Attribution License 4.0 International License 1580 Educ. Res. Rev. provided to groups of its takers who have high analysis is also done on individual basis. proficiency, level of difficulty of items assessed in the test Correct answer probability, item parameter and test is getting lower. On contrary, if it is given to those with takers’ characteristics are correlated in a logistic formula low capabilities, level of difficulty of the test items model. As a result, item curve characteristic is reflected becomes higher (Hambleton, 1991). It was found that the by logistic model used as a basis of calculation (Naga, classical test theory (CTT) had some limitations, 1992). Model description of a parameter in a curve of however, Item Response Theory (IRT) showed a variety item characteristic is level of difficulty of item itself (bi). of benefits such as: The higher level of difficulty a certain item has, the higher ability the test takers need to answer such item correctly. 1. Estimating item difficulty Thus, as shown in the location, the higher level of 2. More stable in terms of difficulty indeces difficulty the test item has, the righter position it will be. 3. More stable of internal consistency, and In a two-parameter logistic model, it shows that such a 4. Markedly reducing in error’s measurement (Magno, model calculates item level of difficulty (b ) and item 2009). i discrimination index (ai). The picturing in item arch curve IRT, as a model- and item-basedapproach, is obviously (ICC), item discrimination index is shown in the curve of considered succesful in its use in terms of research and item with slope or curve precipitousness. An item with practice applications. The main role of IRT model is to high precipitousness shows high discrimination index or estimate someone’s position on a latent dimension value of ai is high. In contrast, slope item curve shows (Reise and Revicki, 2014). In addition, estimation quality low discrimination index or value of ai is low. Here is a depends on accuracy criteria consisting mean square formula for two-parameter logistic model: error (MSE), bias and estimation variance. The results showed that parameter estimation is much more better by i=1,2,3, (1) implementing priory, particularly for two and three parameter models (Baker, 2004). Pi (θ) = probability of test takers having ability θ chosen The aim of educational measurement is to know level randomly that could answer item i correctly. of test takers’ ability that could be used in selections for θ = subject’s level of ability decision-makings. Results of such selections are used to bi = parameter of item level of difficulty for i identify whether or not candidates can be accepted in a ai = discrimination index for item i particular program. A decision to accept the candidates n = number of items in a test or not is often wrong which may bring negative e = numeral valuing 2.718 implications to further individuals’ developments. An D = scaling factor made in order that logistic function inappropriate decision is often caused by the use of closes to function of normal ogive (Hambleton et al., invalid and unreliable instruments or tests, results of 1991) measurements which are very different from the actual conditions so that it contains high uncertainty. In contrast, Estimation is means fathoming or apprising. Estimation a measurement is believed containing high accuracy if its contains the findings of appropriate values for parameters result has small RMS and variance. of equality using certain ways or methods (Makridakis et Perspective proposed by DeMars reveals assumptions al.,1999). Some differences, however, may be found on item response theory relating to unidimensional, local between them. Such differences are: independency and model specification accuracy (DeMars, 2010). Firstly, unidimensional is a test 1. Regression model is often applied in variables that measuring only a character or a particular test takers’ have linear correlation; while in parameter logistic model, ability. Items in a test, for example, just measure there is a correlation between test item and nonlinear test participants’ ability to count, not to assess their takers’ ability. proficiency in a language either. Statistically, 2. Independent variable in regression model is unidimensional can be calculated its Eigen score using observable. On contrary, independent variable of test factor analysis, indicated by a dominant one. Secondly, takers’ ability is unobservable in item response theory local independency is meant that the influence of test (Hambleton et al., 1991). Since actual scores of test item takers’ ability and test items is supposed to be constant; parameters and ability of test takers are unidentified, an test takers’ responses on the items are not statistically analysis and estimation for parameters of test takers’ connected. “This assumption can be accepted if test ability and item scores is required to do. The identification takers’ answer in a certain item do not influence answers of scores of parameters is known as parameter in other items. Test takers’ answers in some test items estimation. are expected to be unrelated” (Hambleton et al., 1991). A study by Borgatto and Pinheiro (2015) aims to Implication of such assumption causes is that the test can determine the impact of ability estimation on IRT with be analyzed item per item. For the test takers, an respect to the difficulty level of the test and to evaluate Mahmud et al. 1581 whether the error of estimation can be influenced by average score as a justifiable deviation. method of estimation used. There are 2 estimation Hambleton et al. (1991) explains that score of residual methods used, which are weighted likelihood estimator error of maximum likelihood estimation method is higher (WLE) and MLE. EAP method is divided according to its than that of expected a posteriori method. Hambleton et prior θ uniform and normal distribution, and this uniform al. (1991) clearly states that scores of residual error of distribution is compared with WLE. The standard MLE is higher than EAP. This indicates that the accuracy measurement to determine the accuracy of estimation of EAP estimation is better than MLE (Hambleton et al., method is Mean Squared Error (MSE). Simulation uses 4 1991). Swaminathan et al. (2003) argues that “Bayes tests based on the number of items which are 15, 30, 45 procedure generally can result to smaller variance and 60 items. The result of study showed that uniformed compared to that generated from maximum likelihood. EAP has better MSE than WLE with 45 items, and from Baker et al. (2004) points out an estimation bias resulted this result it was suggested to use more items in from both of the estimation methods; “there was little estimating ability parameter. difference between the MLE/EM estimates and those Accuracy is meant as the most criteria used to evaluate obtained via the Bayesian procedures”. This means that the works of models of estimating methods.
Recommended publications
  • 1. How Different Is the T Distribution from the Normal?
    Statistics 101–106 Lecture 7 (20 October 98) c David Pollard Page 1 Read M&M §7.1 and §7.2, ignoring starred parts. Reread M&M §3.2. The eects of estimated variances on normal approximations. t-distributions. Comparison of two means: pooling of estimates of variances, or paired observations. In Lecture 6, when discussing comparison of two Binomial proportions, I was content to estimate unknown variances when calculating statistics that were to be treated as approximately normally distributed. You might have worried about the effect of variability of the estimate. W. S. Gosset (“Student”) considered a similar problem in a very famous 1908 paper, where the role of Student’s t-distribution was first recognized. Gosset discovered that the effect of estimated variances could be described exactly in a simplified problem where n independent observations X1,...,Xn are taken from (, ) = ( + ...+ )/ a normal√ distribution, N . The sample mean, X X1 Xn n has a N(, / n) distribution. The random variable X Z = √ / n 2 2 Phas a standard normal distribution. If we estimate by the sample variance, s = ( )2/( ) i Xi X n 1 , then the resulting statistic, X T = √ s/ n no longer has a normal distribution. It has a t-distribution on n 1 degrees of freedom. Remark. I have written T , instead of the t used by M&M page 505. I find it causes confusion that t refers to both the name of the statistic and the name of its distribution. As you will soon see, the estimation of the variance has the effect of spreading out the distribution a little beyond what it would be if were used.
    [Show full text]
  • On the Meaning and Use of Kurtosis
    Psychological Methods Copyright 1997 by the American Psychological Association, Inc. 1997, Vol. 2, No. 3,292-307 1082-989X/97/$3.00 On the Meaning and Use of Kurtosis Lawrence T. DeCarlo Fordham University For symmetric unimodal distributions, positive kurtosis indicates heavy tails and peakedness relative to the normal distribution, whereas negative kurtosis indicates light tails and flatness. Many textbooks, however, describe or illustrate kurtosis incompletely or incorrectly. In this article, kurtosis is illustrated with well-known distributions, and aspects of its interpretation and misinterpretation are discussed. The role of kurtosis in testing univariate and multivariate normality; as a measure of departures from normality; in issues of robustness, outliers, and bimodality; in generalized tests and estimators, as well as limitations of and alternatives to the kurtosis measure [32, are discussed. It is typically noted in introductory statistics standard deviation. The normal distribution has a kur- courses that distributions can be characterized in tosis of 3, and 132 - 3 is often used so that the refer- terms of central tendency, variability, and shape. With ence normal distribution has a kurtosis of zero (132 - respect to shape, virtually every textbook defines and 3 is sometimes denoted as Y2)- A sample counterpart illustrates skewness. On the other hand, another as- to 132 can be obtained by replacing the population pect of shape, which is kurtosis, is either not discussed moments with the sample moments, which gives or, worse yet, is often described or illustrated incor- rectly. Kurtosis is also frequently not reported in re- ~(X i -- S)4/n search articles, in spite of the fact that virtually every b2 (•(X i - ~')2/n)2' statistical package provides a measure of kurtosis.
    [Show full text]
  • Chapter 4: Central Tendency and Dispersion
    Central Tendency and Dispersion 4 In this chapter, you can learn • how the values of the cases on a single variable can be summarized using measures of central tendency and measures of dispersion; • how the central tendency can be described using statistics such as the mode, median, and mean; • how the dispersion of scores on a variable can be described using statistics such as a percent distribution, minimum, maximum, range, and standard deviation along with a few others; and • how a variable’s level of measurement determines what measures of central tendency and dispersion to use. Schooling, Politics, and Life After Death Once again, we will use some questions about 1980 GSS young adults as opportunities to explain and demonstrate the statistics introduced in the chapter: • Among the 1980 GSS young adults, are there both believers and nonbelievers in a life after death? Which is the more common view? • On a seven-attribute political party allegiance variable anchored at one end by “strong Democrat” and at the other by “strong Republican,” what was the most Democratic attribute used by any of the 1980 GSS young adults? The most Republican attribute? If we put all 1980 95 96 PART II :: DESCRIPTIVE STATISTICS GSS young adults in order from the strongest Democrat to the strongest Republican, what is the political party affiliation of the person in the middle? • What was the average number of years of schooling completed at the time of the survey by 1980 GSS young adults? Were most of these twentysomethings pretty close to the average on
    [Show full text]
  • Central Tendency, Dispersion, Correlation and Regression Analysis Case Study for MBA Program
    Business Statistic Central Tendency, Dispersion, Correlation and Regression Analysis Case study for MBA program CHUOP Theot Therith 12/29/2010 Business Statistic SOLUTION 1. CENTRAL TENDENCY - The different measures of Central Tendency are: (1). Arithmetic Mean (AM) (2). Median (3). Mode (4). Geometric Mean (GM) (5). Harmonic Mean (HM) - The uses of different measures of Central Tendency are as following: Depends upon three considerations: 1. The concept of a typical value required by the problem. 2. The type of data available. 3. The special characteristics of the averages under consideration. • If it is required to get an average based on all values, the arithmetic mean or geometric mean or harmonic mean should be preferable over median or mode. • In case middle value is wanted, the median is the only choice. • To determine the most common value, mode is the appropriate one. • If the data contain extreme values, the use of arithmetic mean should be avoided. • In case of averaging ratios and percentages, the geometric mean and in case of averaging the rates, the harmonic mean should be preferred. • The frequency distributions in open with open-end classes prohibit the use of arithmetic mean or geometric mean or harmonic mean. Prepared by: CHUOP Theot Therith 1 Business Statistic • If the distribution is bell-shaped and symmetrical or flat-topped one with few extreme values, the arithmetic mean is the best choice, because it is less affected by sampling fluctuations. • If the distribution is sharply peaked, i.e., the data cluster markedly at the middle or if there are abnormally small or large values, the median has smaller sampling fluctuations than the arithmetic mean.
    [Show full text]
  • Volatility Modeling Using the Student's T Distribution
    Volatility Modeling Using the Student’s t Distribution Maria S. Heracleous Dissertation submitted to the Faculty of the Virginia Polytechnic Institute and State University in partial fulfillment of the requirements for the degree of Doctor of Philosophy in Economics Aris Spanos, Chair Richard Ashley Raman Kumar Anya McGuirk Dennis Yang August 29, 2003 Blacksburg, Virginia Keywords: Student’s t Distribution, Multivariate GARCH, VAR, Exchange Rates Copyright 2003, Maria S. Heracleous Volatility Modeling Using the Student’s t Distribution Maria S. Heracleous (ABSTRACT) Over the last twenty years or so the Dynamic Volatility literature has produced a wealth of uni- variateandmultivariateGARCHtypemodels.Whiletheunivariatemodelshavebeenrelatively successful in empirical studies, they suffer from a number of weaknesses, such as unverifiable param- eter restrictions, existence of moment conditions and the retention of Normality. These problems are naturally more acute in the multivariate GARCH type models, which in addition have the problem of overparameterization. This dissertation uses the Student’s t distribution and follows the Probabilistic Reduction (PR) methodology to modify and extend the univariate and multivariate volatility models viewed as alternative to the GARCH models. Its most important advantage is that it gives rise to internally consistent statistical models that do not require ad hoc parameter restrictions unlike the GARCH formulations. Chapters 1 and 2 provide an overview of my dissertation and recent developments in the volatil- ity literature. In Chapter 3 we provide an empirical illustration of the PR approach for modeling univariate volatility. Estimation results suggest that the Student’s t AR model is a parsimonious and statistically adequate representation of exchange rate returns and Dow Jones returns data.
    [Show full text]
  • Measures of Central Tendency for Censored Wage Data
    MEASURES OF CENTRAL TENDENCY FOR CENSORED WAGE DATA Sandra West, Diem-Tran Kratzke, and Shail Butani, Bureau of Labor Statistics Sandra West, 2 Massachusetts Ave. N.E. Washington, D.C. 20212 Key Words: Mean, Median, Pareto Distribution medians can lead to misleading results. This paper tested the linear interpolation method to estimate the median. I. INTRODUCTION First, the interval that contains the median was determined, The research for this paper began in connection with the and then linear interpolation is applied. The occupational need to measure the central tendency of hourly wage data minimum wage is used as the lower limit of the lower open from the Occupational Employment Statistics (OES) survey interval. If the median falls in the uppermost open interval, at the Bureau of Labor Statistics (BLS). The OES survey is all that can be said is that the median is equal to or above a Federal-State establishment survey of wage and salary the upper limit of hourly wage. workers designed to produce data on occupational B. MEANS employment for approximately 700 detailed occupations by A measure of central tendency with desirable properties is industry for the Nation, each State, and selected areas within the mean. In addition to the usual desirable properties of States. It provided employment data without any wage means, there is another nice feature that is proven by West information until recently. Wage data that are produced by (1985). It is that the percent difference between two means other Federal programs are limited in the level of is bounded relative to the percent difference between occupational, industrial, and geographic coverage.
    [Show full text]
  • Redalyc.Robust Analysis of the Central Tendency, Simple and Multiple
    International Journal of Psychological Research ISSN: 2011-2084 [email protected] Universidad de San Buenaventura Colombia Courvoisier, Delphine S.; Renaud, Olivier Robust analysis of the central tendency, simple and multiple regression and ANOVA: A step by step tutorial. International Journal of Psychological Research, vol. 3, núm. 1, 2010, pp. 78-87 Universidad de San Buenaventura Medellín, Colombia Available in: http://www.redalyc.org/articulo.oa?id=299023509005 How to cite Complete issue Scientific Information System More information about this article Network of Scientific Journals from Latin America, the Caribbean, Spain and Portugal Journal's homepage in redalyc.org Non-profit academic project, developed under the open access initiative International Journal of Psychological Research, 2010. Vol. 3. No. 1. Courvoisier, D.S., Renaud, O., (2010). Robust analysis of the central tendency, ISSN impresa (printed) 2011-2084 simple and multiple regression and ANOVA: A step by step tutorial. International ISSN electrónica (electronic) 2011-2079 Journal of Psychological Research, 3 (1), 78-87. Robust analysis of the central tendency, simple and multiple regression and ANOVA: A step by step tutorial. Análisis robusto de la tendencia central, regresión simple y múltiple, y ANOVA: un tutorial paso a paso. Delphine S. Courvoisier and Olivier Renaud University of Geneva ABSTRACT After much exertion and care to run an experiment in social science, the analysis of data should not be ruined by an improper analysis. Often, classical methods, like the mean, the usual simple and multiple linear regressions, and the ANOVA require normality and absence of outliers, which rarely occurs in data coming from experiments. To palliate to this problem, researchers often use some ad-hoc methods like the detection and deletion of outliers.
    [Show full text]
  • Calculating Variance and Standard Deviation
    VARIANCE AND STANDARD DEVIATION Recall that the range is the difference between the upper and lower limits of the data. While this is important, it does have one major disadvantage. It does not describe the variation among the variables. For instance, both of these sets of data have the same range, yet their values are definitely different. 90, 90, 90, 98, 90 Range = 8 1, 6, 8, 1, 9, 5 Range = 8 To better describe the variation, we will introduce two other measures of variation—variance and standard deviation (the variance is the square of the standard deviation). These measures tell us how much the actual values differ from the mean. The larger the standard deviation, the more spread out the values. The smaller the standard deviation, the less spread out the values. This measure is particularly helpful to teachers as they try to find whether their students’ scores on a certain test are closely related to the class average. To find the standard deviation of a set of values: a. Find the mean of the data b. Find the difference (deviation) between each of the scores and the mean c. Square each deviation d. Sum the squares e. Dividing by one less than the number of values, find the “mean” of this sum (the variance*) f. Find the square root of the variance (the standard deviation) *Note: In some books, the variance is found by dividing by n. In statistics it is more useful to divide by n -1. EXAMPLE Find the variance and standard deviation of the following scores on an exam: 92, 95, 85, 80, 75, 50 SOLUTION First we find the mean of the data: 92+95+85+80+75+50 477 Mean = = = 79.5 6 6 Then we find the difference between each score and the mean (deviation).
    [Show full text]
  • Robust Statistical Procedures for Testing the Equality of Central Tendency Parameters Under Skewed Distributions
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Repository@USM ROBUST STATISTICAL PROCEDURES FOR TESTING THE EQUALITY OF CENTRAL TENDENCY PARAMETERS UNDER SKEWED DISTRIBUTIONS SHARIPAH SOAAD SYED YAHAYA UNIVERSITI SAINS MALAYSIA 2005 ii ROBUST STATISTICAL PROCEDURES FOR TESTING THE EQUALITY OF CENTRAL TENDENCY PARAMETERS UNDER SKEWED DISTRIBUTIONS by SHARIPAH SOAAD SYED YAHAYA Thesis submitted in fulfillment of the requirements for the degree of Doctor of Philosophy May 2005 ACKNOWLEDGEMENTS Firstly, my sincere appreciations to my supervisor Associate Professor Dr. Abdul Rahman Othman without whose guidance, support, patience, and encouragement, this study could not have materialized. I am indeed deeply indebted to him. My sincere thanks also to my co-supervisor, Associate Professor Dr. Sulaiman for his encouragement and support throughout this study. I would also like to thank Universiti Utara Malaysia (UUM) for sponsoring my study at Universiti Sains Malaysia. Thanks are due to Professor H.J. Keselman for according me access to his UNIX server to run the relevant programs; Professor R.R. Wilcox and Professor R.G. Staudte for promptly responding to my queries via emails; Pn. Azizah Ahmad and Pn. Ramlah Din for patiently helping me with some of the computer programs and En. Sofwan for proof reading the chapters of this thesis. I am deeply grateful to my parents and my wonderful daughter, Nur Azzalia Kamaruzaman for their inspiration, patience, enthusiasm and effort. I would also like to thank my brothers for their constant support. Finally, my grateful recognition are due to (in alphabetical order) Azilah and Nung, Bidin and Kak Syera, Izan, Linda, Min, Shikin, Yen, Zurni and to those who had directly or indirectly lend me their friendship, moral support and endless encouragement during my study.
    [Show full text]
  • Fitting Models (Central Tendency)
    FITTING MODELS (CENTRAL TENDENCY) This handout is one of a series that accompanies An Adventure in Statistics: The Reality Enigma by me, Andy Field. These handouts are offered for free (although I hope you will buy the book).1 Overview In this handout we will look at how to do the procedures explained in Chapter 4 using R an open-source free statistics software. If you are not familiar with R there are many good websites and books that will get you started; for example, if you like An Adventure In Statistics you might consider looking at my book Discovering Statistics Using R. Some basic things to remember • RStudio: I assume that you're working with RStudio because most sane people use this software instead of the native R interface. You should download and install both R and RStudio. A few minutes on google will find you introductions to RStudio in the event that I don't write one, but these handouts don't particularly rely on RStudio except in settingthe working directory (see below). • Dataframe: A dataframe is a collection of columns and rows of data, a bit like a spreadsheet (but less pretty) • Variables: variables in dataframes are referenced using the $ symbol, so catData$fishesEaten would refer to the variable called fishesEaten in the dataframe called catData • Case sensitivity: R is case sensitive so it will think that the variable fishesEaten is completely different to the variable fisheseaten. If you get errors make sure you check for capitals or lower case letters where they shouldn't be. • Working directory: you should place the data files that you need to use for this handout in a folder, then set it as the working directory by navigating to that folder when you execute the command accessed through the Session>Set Working Directory>Choose Directory ..
    [Show full text]
  • Analysis of Variance with Summary Statistics in Microsoft Excel
    American Journal of Business Education – April 2010 Volume 3, Number 4 Analysis Of Variance With Summary Statistics In Microsoft Excel David A. Larson, University of South Alabama, USA Ko-Cheng Hsu, University of South Alabama, USA ABSTRACT Students regularly are asked to solve Single Factor Analysis of Variance problems given only the sample summary statistics (number of observations per category, category means, and corresponding category standard deviations). Most undergraduate students today use Excel for data analysis of this type. However, Excel, like all other statistical software packages, requires an input data set in order to invoke its Anova: Single Factor procedure. The purpose of this paper is therefore to provide the student with an Excel macro that, given just the sample summary statistics as input, generates an equivalent underlying data set. This data set can then be used as the required input data set in Excel for Single Factor Analysis of Variance. Keywords: Analysis of Variance, Summary Statistics, Excel Macro INTRODUCTION ost students have migrated to Excel for data analysis because of Excel‟s pervasive accessibility. This means, given just the summary statistics for Analysis of Variance, the Excel user is either limited to solving the problem by hand or to solving the problem using an Excel Add-in. Both of Mthese options have shortcomings. Solving by hand means there is no Excel counterpart solution that puts the entire answer „right there in front of the student‟. Using an Excel Add-in is often not an option because, typically, Excel Add-ins are not available. The purpose of this paper, therefore, is to explain how to accomplish analysis of variance using an equivalent input data set and also to provide the Excel user with a straight-forward macro that accomplishes this technique.
    [Show full text]
  • The Multivariate Normal Distribution
    Multivariate normal distribution Linear combinations and quadratic forms Marginal and conditional distributions The multivariate normal distribution Patrick Breheny September 2 Patrick Breheny University of Iowa Likelihood Theory (BIOS 7110) 1 / 31 Multivariate normal distribution Linear algebra background Linear combinations and quadratic forms Definition Marginal and conditional distributions Density and MGF Introduction • Today we will introduce the multivariate normal distribution and attempt to discuss its properties in a fairly thorough manner • The multivariate normal distribution is by far the most important multivariate distribution in statistics • It’s important for all the reasons that the one-dimensional Gaussian distribution is important, but even more so in higher dimensions because many distributions that are useful in one dimension do not easily extend to the multivariate case Patrick Breheny University of Iowa Likelihood Theory (BIOS 7110) 2 / 31 Multivariate normal distribution Linear algebra background Linear combinations and quadratic forms Definition Marginal and conditional distributions Density and MGF Inverse • Before we get to the multivariate normal distribution, let’s review some important results from linear algebra that we will use throughout the course, starting with inverses • Definition: The inverse of an n × n matrix A, denoted A−1, −1 −1 is the matrix satisfying AA = A A = In, where In is the n × n identity matrix. • Note: We’re sort of getting ahead of ourselves by saying that −1 −1 A is “the” matrix satisfying
    [Show full text]