UCLA Electronic Theses and Dissertations
Total Page:16
File Type:pdf, Size:1020Kb
UCLA UCLA Electronic Theses and Dissertations Title Structural Models of Coefficient of Variation Matrices Permalink https://escholarship.org/uc/item/0zj3b6qk Author Trickey, Kirsten Alexandra Publication Date 2015 Peer reviewed|Thesis/dissertation eScholarship.org Powered by the California Digital Library University of California UNIVERSITY OF CALIFORNIA Los Angeles Structural Models of Coefficient of Variation Matrices A dissertation submitted in partial satisfaction of the requirements for the degree Doctor of Philosophy in Psychology by Kirsten Alexandra Trickey 2015 ABSTRACT OF THE DISSERTATION Structural Models of Coefficient of Variation Matrices by Kirsten Alexandra Trickey Doctor of Philosophy in Psychology University of California, Los Angles, 2015 Professor Peter M. Bentler, Chair The coefficient of variation (CV) measures variability relative to the mean, and can be useful when increases in the mean correspond with systematic increases in variability. The univariate CV has been studied and applied extensively, but until recently it was not possible to study the structure of relative covariation occurring in multivariate data. However, Boik and Shirvani (2009) demonstrated that relative covariation could be modeled using estimators they developed to describe the sampling distribution of the CV matrix. This matrix, denoted Ψ, is defined as −1 −1 Ψ = 퐷흁 Σ 퐷흁 , where 퐷휇 is diagonal matrix containing variable means and Σ is the covariance matrix. The present research builds on this previous work by considering a more general class of structure models of the CV matrix. ii Specifically, we investigated how structural equation models of the CV matrix could be estimated and applied. First, a statistical theory for the estimation and evaluation of structural equation models of CV matrices was developed for both normally and arbitrarily distributed variables using generalized least squares. Computational algorithms were then written to implement the theory and to allow CV models to be estimated. Using these algorithms, a series of simulation studies were conducted to determine the quality of the estimators proposed by Boik and Shirvani (2009) and the quality of the subsequent model parameters, standard errors, and test statistics, which rely on those estimators. The simulations considered a range of sample sizes, normal and log-normal data, different numbers of variables, and models with either one or two factors. It was found that Boik and Shirvani’s theoretical estimators of the variance of the sampling distribution of the Ψ converged very slowly to their expected values and that they were particularly unreliable for log-normal data. That said, the estimation methods relying on these estimators, were generally able to estimate factor loadings accurately across conditions and when the sample sizes were fairly large and the number of variables was small, they also produced reasonably accurate estimates of the variance parameters, standard errors and test-statistics. However, in small samples with large numbers of variables, the variance estimates and the model fit statistics tended to be too low and the standard errors were typically overly conservative. In addition, when the data were log-normal the model fit statistics were problematic regardless of whether the estimator relied on normal theory. This general pattern of results was observed in both one-factor and two-factor models. The discussions below address some possible explanations for the estimation problems noted here and propose future work that should be done to better understand and potentially correct these problems. iii Some of this work was initiated here and included in a short series of follow-up studies. Specifically, we addressed the numerical stability of the CV matrix and its sampling distribution covariance in terms of condition numbers. It was found that these matrices were both typically less stable than their counterparts in structural covariance modeling. In addition, we observed that Winsorizing the data used for the estimation produced modest improvements in the numerical stability. It remains to be seen how this might affect model estimation. Finally, a one-factor CV model was fit to a longitudinal dataset assessing alcohol use over four years. Although each estimation method seemed to be able to reproduce the sample CV matrix with some accuracy, the model fit statistics indicated the model should be rejected. Given the non-normal distributions of the variables in the model, the appropriate interpretation of these results is ambiguous, but the interpretation and implications are discussed. iv The dissertation of Kirsten Alexandra Trickey is approved. Li Cai Martin M. Monti Frederic Paik Schoenberg Peter M. Bentler, Committee Chair University of California, Los Angeles 2015 v To my wonderful Hika… I am pleased to report that there will be more mystical yellow orbs for you to retrieve in the future. vi TABLE OF CONTENTS ABSTRACT OF THE DISSERTATION ........................................................................... ii TABLE OF CONTENTS .................................................................................................. vii LIST OF FIGURES ........................................................................................................... xi LIST OF TABLES .......................................................................................................... xvii ACKNOWLEDGMENTS ............................................................................................. xxiv VITA ............................................................................................................................... xxv Chapter 1. Introduction and Motivation .............................................................................. 1 Background and Basic Definition ................................................................................... 2 Properties and Appropriate Use of the CV ..................................................................... 3 Potential Applications ..................................................................................................... 5 Chapter 2. Existing Statistical Methods ............................................................................ 10 The Univariate Case ...................................................................................................... 10 The Multivariate Case ................................................................................................... 11 Chapter 3. The Coefficient of Variation Matrix ............................................................... 13 Definition ...................................................................................................................... 13 Sampling Distribution ................................................................................................... 13 Chapter 4. Structural Models ............................................................................................ 18 A General Structural Modeling Method ....................................................................... 18 Modeling Coefficient of Variation Matrices................................................................. 20 Modeling Covariance Matrices ..................................................................................... 21 Applying Covariance Methods to CV Matrix Modeling? ............................................ 22 Defining Models of Coefficient of Variation Matrices................................................. 23 Chapter 5. General Simulation Method ............................................................................ 25 Models........................................................................................................................... 25 vii Data Generation ............................................................................................................ 27 Estimation ..................................................................................................................... 29 Replications................................................................................................................... 30 Figures........................................................................................................................... 31 Chapter 6. Analysis of the Convergence of Σ̂흍 to Σ흍 ....................................................... 33 Method .......................................................................................................................... 33 Results ........................................................................................................................... 35 Discussion ..................................................................................................................... 36 Tables ............................................................................................................................ 40 Chapter 7. Quality of Parameter, Standard Error and Test Statistic Estimates: One Factor Models of Normal Data with Variables with Equal Population Means ......................................... 42 Method .......................................................................................................................... 42 Results ........................................................................................................................... 43 Discussion ....................................................................................................................