Latent Class Analysis

Total Page:16

File Type:pdf, Size:1020Kb

Latent Class Analysis International Journal for Cross-Disciplinary Subjects in Education (IJCDSE), Volume 11, Issue 2, 2020 Latent Class Analysis Diana Mindrila University of West Georgia, USA Abstract A variety of phenomena examined in education can difficulties in the implementation of this method. The be described using categorical latent models, which use of LCA became more widespread after 1974, help identify groups of individuals who have a series when Goodman [6] developed a more general of characteristics in common. Usually, data are too procedure for computing maximum likelihood complex to identify and describe groups through parameter estimates. Later, Dempster, Laird, and inspection alone; therefore, multivariate grouping Rubin [7] developed the “expectation-maximization techniques such as latent class analysis (LCA) should algorithm”, which placed LCA in the log-linear model be employed. LCA refers to a set of classification framework and increased its generalizability [8]. The procedures used to identify subgroups of individuals expectation-maximization algorithm allowed who have several characteristics in common [1]. The researchers to assess model fit [9] and to estimate following article provides a general description of more complex models such as LCA with covariates LCA. It describes the latent class model and explains [10] and longitudinal “individual growth trajectories” the steps involved in latent class modeling. Finally, [11]. From this point on, researchers developed the article includes an empirical application of LCA several procedures for estimating changes in latent with binary indicators using the Mplus statistical class membership over time [12]. software. 3. A General Description of LCA 1. Introduction The LCA method is a multivariate classification A variety of phenomena examined in education procedure that allows researchers to categorize can be described using categorical latent variables, individuals into homogeneous groups [2]. It is which help identify groups of individuals who share a sometimes referred to as “mixture modeling based set of characteristics. Usually the array of observed clustering”, “mixture-likelihood approach to data is too complex to identify and describe groups clustering”, or “finite mixture modeling” [13]. In fact, through inspection alone; therefore, researchers use “finite mixture modeling” is a more general term for person-oriented classification procedures such latent latent variable modeling where latent variables are class analysis (LCA) to accomplish this goal. categorical. The latent categories represent a set of Although LCA has been discussed in the sub-populations of individuals, and individuals’ methodological literature, most authors focused either memberships to these sub-populations are inferred on computational procedures [2], on conceptual based on patterns of variations in the data [13]. explanations [1,3], or on describing empirical LCA postulates the existence of a categorical applications [4]. The present article provides an variable free of measurement error [1]. The latent overview of all aspects of LCA, by discussing both the construct that underlies the data is not measured theoretical foundations and the practical applications directly, it is estimated based on the variance shared of this method. For the applied researchers, the article by a set of observed variables. Researchers take a includes a step-by-step description of LCA with a similar approach when conducting factor analysis, clear example of how to apply this procedure using which also infers underlying latent variables based on the Mplus statistical software and how to interpret the the relationships among observed variables. results. Nevertheless, factor analysis groups variables, and is, therefore, a variable-oriented classification procedure. 2. Milestones in the Development of LCA In contrast, LCA groups individuals and is considered a person-oriented classification procedure [14]. Lazarfeld and Henry [5] brought the first major Further, in latent class models the latent variable is contribution to the development of LCA. Although categorical rather than continuous [2], which means they were not the first suggesting the possibility of that groups do not necessarily differ quantitatively but estimating categorical latent variables, they were the have distinct combinations of characteristics and first authors to provide a detailed, comprehensive, represent a phenomenon that is inherently categorical mathematical and conceptual description of LCA. rather than continuous [1]. Nevertheless, the absence of a reliable and general LCA is similar to cluster analysis (CA), which is method for calculating parameter estimates posed also a person-oriented classification procedure. An important distinction between CA and LCA is that Copyright © 2020, Infonomics Society 4323 International Journal for Cross-Disciplinary Subjects in Education (IJCDSE), Volume 11, Issue 2, 2020 cluster analysis does not estimate the error of 푃(푖 1 , 푖 2 , … . , 푖 푟 ) = ∑_(푘 = 1)^퐾 푃(퐶 = 푘)푃(퐶 measurement and is, therefore, conducted “at the = 푘)푃(퐶 = 푘) … 푃(푖 푟 │퐶 = 푘) observed level” [3]. In contrast, LCA assumes that a latent categorical variable underlies the data; it When the observed indicators are continuous, estimates the error of measurement of the latent parameters are represented by the means and categorical variable, which is taken into account in the variances of the observed variables and the LCA calculation of parameter estimates and the estimation model can be expressed as: of class membership probabilities. Additionally, LCA 퐾 allows the computation of fit indices, which show how 푓(푦 푝 ) = ∑ 푃(퐶 = 푘)푓(푦 푝 |퐶 = 푘) well the latent class model fits the data [3]. 푘=1 LCA relies on the assumption that homogeneous sub-populations exist within the data. These where yp represents the set of responses provided by a subgroups have distinct probability distributions and person p on all the observed indicators [1]. are mutually exclusive [15]; therefore, the The statistical model described above is the basis percentages of individuals assigned to each latent of all types of LCA: exploratory LCA, confirmatory class add up to 100%. LCA also relies on the LCA, latent profile analysis (LPA), latent transition assumption of local independence. Specifically, LCA analysis (LTA), multilevel LCA, etc. [1]. Although assumes that the variance shared by the observed most often used as an exploratory procedure, LCA can indicators is accounted for only by the latent also be used as a confirmatory procedure by categorical variable [16]. Further, LCA assumes that constraining model parameters based on the the number of latent classes specified by the latent researchers’ hypotheses [17]. LPA is a special case of model is correct [3]. LCA where observed indicators are continuous rather than ordered categorical or binary [2]. LPA is based 4. The LCA Model on the assumption of multivariate normality, meaning that the multivariate distribution of the continuous A mixture model includes a measurement model observed variables is normal within each group [3]. and a structural model. LCA is the measurement The LTA approach is a variation of LCA for model, which consists of a set of observed variables, longitudinal data, which allows researchers to also referred to as observed indicators, regressed on a examine changes in latent class memberships across latent categorical variable [2]. Figure 1 illustrates time [1]. Multilevel LCA is employed with data from relationships between a set of r observed indicators i complex sampling designs and allows the estimation and an underlying categorical variable C [2]. of a multilevel latent class model to account for the nested structure of the data. Mplus permits the estimation of exploratory and confirmatory LCA models, LTA models, as well as multilevel LCA models. Further, Mplus allows the estimation of relationships among latent categorical variables and second order factors, covariates, or observed dependent variables also known as distal outcomes [2]. The current article will further describe the exploratory LCA model. 4.1 Model estimation Figure 1. Diagram of a general latent class model The computation procedures used for estimating model parameters are based on the type of variables Observed variables can be continuous, counts, used as observed indicators (Table 1). The default ordered categorical, binary, or unordered categorical estimation method for mixture modeling in Mplus is variables [2]. When estimating a latent variable C with robust maximum likelihood (MLR), which uses “log- k latent classes (C=k; k=1, 2,…K), the “marginal item likelihood functions derived from the probability probability” for item ij=1 can be expressed as: density function underlying the latent class model” 퐾 [2]. Other estimation procedures such as maximum 푃(푖 푗 = 1) = ∑ 푃(퐶 = 푘)푃(푖 푗 = 1|퐶 = 푘) likelihood (ML), or Bayesian estimation (BAYES) 푘=1 can be specified in Mplus using the ESTIMATOR Assuming that the assumption of local option of the ANALYSIS command. Although MLR independence is met, the joint probability for all corrects standard errors and test statistics, other observed variables can be expressed as: estimators such as BAYES may provide more accurate results with smaller sample sizes, ordinal data, and non-normal continuous variables [2]. Copyright © 2020, Infonomics Society 4324 International Journal for Cross-Disciplinary Subjects in Education (IJCDSE), Volume 11, Issue 2, 2020 class is labeled based on its
Recommended publications
  • Advances in Nonnegative Matrix Decomposition with Application to Cluster Analysis Aalto Un Iversity
    Departm en t of In form ation an d Com pu ter Scien ce Aa lto- He Zhang Zhang He DD 127 Advances in Nonnegative / 2014 Advances in Nonnegative Matrix Decomposition with Application to Cluster Analysis Analysis Cluster to Application with Decomposition Matrix Nonnegative in Advances Matrix Decomposition with Application to Cluster Analysis He Zhang 9HSTFMG*aficid+ 9HSTFMG*aficid+ ISBN 978-952-60-5828-3 BUSINESS + ISBN 978-952-60-5829-0 (pdf) ECONOMY ISSN-L 1799-4934 ISSN 1799-4934 ART + ISSN 1799-4942 (pdf) DESIGN + ARCHITECTURE Aalto Un iversity Aalto University School of Science SCIENCE + Department of Information and Computer Science TECHNOLOGY www.aalto.fi CROSSOVER DOCTORAL DOCTORAL DISSERTATIONS DISSERTATIONS Aalto University publication series DOCTORAL DISSERTATIONS 127/2014 Advances in Nonnegative Matrix Decomposition with Application to Cluster Analysis He Zhang A doctoral dissertation completed for the degree of Doctor of Science (Technology) to be defended, with the permission of the Aalto University School of Science, at a public examination held at the lecture hall T2 of the school on 19 September 2014 at 12. Aalto University School of Science Department of Information and Computer Science Supervising professor Aalto Distinguished Professor Erkki Oja Thesis advisor Dr. Zhirong Yang Preliminary examiners Research Scientist Ph.D. Rafal Zdunek, Wroclaw University of Technology, Poland Associate Professor Ph.D. Morten Mørup, DTU Compute, Denmark Opponent Associate Professor Ali Taylan Cemgil, Bogazici University, Turkey Aalto University publication series DOCTORAL DISSERTATIONS 127/2014 © He Zhang ISBN 978-952-60-5828-3 ISBN 978-952-60-5829-0 (pdf) ISSN-L 1799-4934 ISSN 1799-4934 (printed) ISSN 1799-4942 (pdf) http://urn.fi/URN:ISBN:978-952-60-5829-0 Unigrafia Oy Helsinki 2014 Finland Abstract Aalto University, P.O.
    [Show full text]
  • Conducting Confirmatory Latent Class Analysis Using Mplus W
    This article was downloaded by: [University of California, Los Angeles (UCLA)] On: 28 December 2011, At: 15:49 Publisher: Psychology Press Informa Ltd Registered in England and Wales Registered Number: 1072954 Registered office: Mortimer House, 37-41 Mortimer Street, London W1T 3JH, UK Structural Equation Modeling: A Multidisciplinary Journal Publication details, including instructions for authors and subscription information: http://www.tandfonline.com/loi/hsem20 Conducting Confirmatory Latent Class Analysis Using Mplus W. Holmes Finch a & Kendall Cotton Bronk a a Ball State University Available online: 07 Jan 2011 To cite this article: W. Holmes Finch & Kendall Cotton Bronk (2011): Conducting Confirmatory Latent Class Analysis Using Mplus , Structural Equation Modeling: A Multidisciplinary Journal, 18:1, 132-151 To link to this article: http://dx.doi.org/10.1080/10705511.2011.532732 PLEASE SCROLL DOWN FOR ARTICLE Full terms and conditions of use: http://www.tandfonline.com/page/terms-and-conditions This article may be used for research, teaching, and private study purposes. Any substantial or systematic reproduction, redistribution, reselling, loan, sub-licensing, systematic supply, or distribution in any form to anyone is expressly forbidden. The publisher does not give any warranty express or implied or make any representation that the contents will be complete or accurate or up to date. The accuracy of any instructions, formulae, and drug doses should be independently verified with primary sources. The publisher shall not be liable for any loss, actions, claims, proceedings, demand, or costs or damages whatsoever or howsoever caused arising directly or indirectly in connection with or arising out of the use of this material.
    [Show full text]
  • 25Th , 2017 University of Connecticut
    Modern Modeling Methods Conference May 22nd – 25th, 2017 University of Connecticut 2017 Modern Modeling Methods Conference Welcome and thank you for joining us for the 7th annual Modern Modeling Methods Conference at the University of Connecticut. Special thanks to all of the keynote speakers and concurrent presenters for making this wonderful program possible! I look forward to this week all year long. I love seeing all the wonderful modeling work that researchers are doing, and I love getting the chance to interact with people whose work I am reading and citing throughout the year. It’s a pleasure and an honor to have Dr. Ken Bollen back at the modeling conference. Dr. Bollen was one of our 5 keynote speakers at the first modeling conference in 2011. I am also extremely excited to welcome Dr. Steven Boker to the Modeling conference for the first time. I have been a great fan of his work ever since I saw him speak at UVA over a decade ago. Thank you to the following people for their contributions to the conference: Lisa Rasicot, Casey Davis, Kasey Neves, Cheryl Lowe, Robbin Haboian-Demircan, and conference services for providing administrative and logistical support for the conference. I couldn’t keep this conference going without them! I hope to see you all again at our eighth annual Modern Modeling Methods Conference May 22nd- 23rd, 2018. Confirmed keynote speakers for 2018 include Dr. Peter Molenaar (Penn State) and Tenko Raykov (MSU). Tenko Raykov will be offering a full day Pre-Conference workshop on IRT from a Latent Variable Modeling Perspective on May 21st.
    [Show full text]
  • Download Paper
    Running head: LATENT CLASS ANALYSIS FREQUENTLY ASKED QUESTIONS 1 Ten Frequently Asked Questions about Latent Class Analysis Karen Nylund-Gibson, Ph.D. University of California, Santa Barbara Department of Education Santa Barbara, CA 93106 [email protected] Andrew Young Choi, M.A. University of California, Santa Barbara Department of Counseling, Clinical, and School Psychology Santa Barbara, CA 93106 [email protected] Author Note Karen Nylund-Gibson, Department of Education, University of California, Santa Barbara; Andrew Young Choi, Department of Counseling, Clinical, and School Psychology, University of California, Santa Barbara. Correspondence concerning this article should be addressed to Karen Nylund-Gibson at [email protected] The research reported here was supported in part by the Institute of Education Sciences, U.S. Department of Education, through Grant #R305A160157 to the University of California, Santa Barbara. The opinions expressed are those of the authors and do not represent views of the Institute of Education Sciences or the U.S. Department of Education. Nylund-Gibson, K., & Choi, A. Y. (In press). Ten frequently asked questions about latent class analysis. Translational Issues in Psychological Science. © 2018, American Psychological Association. This paper is not the copy of record and may not exactly replicate the final, authoritative version of the article. Please do not copy or cite without authors' permission. The final article will be available, upon publication, via its DOI: 10.1037/tps0000176 LATENT CLASS ANALYSIS FREQUENTLY ASKED QUESTIONS 2 ABSTRACT Latent class analysis (LCA) is a statistical method used to identify unobserved subgroups in a population with a chosen set of indicators. Given the increasing popularity of LCA, our aim is to equip psychological researchers with the theoretical and statistical fundamentals that we believe will facilitate the application of LCA models in practice.
    [Show full text]
  • UNIVERSITY of CALIFORNIA Santa Barbara the Intersection of Sample
    UNIVERSITY OF CALIFORNIA Santa Barbara The Intersection of Sample Size, Number of Indicators, and Class Enumeration in LCA: A Monte Carlo Study A dissertation submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy in Education by Diane Morovati Dissertation Committee: Professor Karen Nylund-Gibson Professor John Yun Professor Nancy Collins June 2014 The dissertation of Diane Morovati is approved. _____________________________________________ John T. Yun _____________________________________________ Nancy L. Collins _____________________________________________ Karen Nylund-Gibson, Committee Chair June 2014 The Intersection of Sample Size, Number of Indicators, and Class Enumeration in LCA: A Monte Carlo Study Copyright © 2014 by Diane Morovati iii To my Mom, Thank you for teaching me to work hard and never give up. This is just the beginning… iv ACKNOWLEDGEMENTS First and foremost, I acknowledge my parents, Lia and Morad Morovati, for unconditionally loving me and providing me with so much to be grateful for. This dissertation is dedicated to my mom because she deserves special praise for always pushing me to strive for the best, even when I wanted to give up. Mom, thank you for always believing in me. Thank you for raising me to be strong and independent. I wouldn’t be where I am today if it wasn’t for you. I love you and I am so proud to be your daughter. To my younger sister and best friend, Dahlia Morovati, and my little brother, Daniel Morovati, words cannot express my love for you both. I am so proud of you and look forward to your many future accomplishments. I want to thank my grandmother, Faezehjoon, for being such a strong and supportive woman in my life.
    [Show full text]
  • An Introduction to Latent Class Growth Analysis and Growth Mixture Modeling Tony Jung and K
    Social and Personality Psychology Compass 2/1 (2008): 302–317, 10.1111/j.1751-9004.2007.00054.x An Introduction to Latent Class Growth Analysis and Growth Mixture Modeling Tony Jung and K. A. S. Wickrama* Iowa State University Abstract In recent years, there has been a growing interest among researchers in the use of latent class and growth mixture modeling techniques for applications in the social and psychological sciences, in part due to advances in and availability of computer software designed for this purpose (e.g., Mplus and SAS Proc Traj). Latent growth modeling approaches, such as latent class growth analysis (LCGA) and growth mixture modeling (GMM), have been increasingly recognized for their usefulness for identifying homogeneous subpopulations within the larger heterogeneous population and for the identification of meaningful groups or classes of individuals. The purpose of this paper is to provide an overview of LCGA and GMM, compare the different techniques of latent growth modeling, discuss current debates and issues, and provide readers with a practical guide for conducting LCGA and GMM using the Mplus software. Researchers in the fields of social and psychological sciences are often interested in modeling the longitudinal developmental trajectories of individuals, whether for the study of personality development or for better understanding how social behaviors unfold over time (whether it be days, months, or years). This usually requires an extensive dataset con- sisting of longitudinal, repeated measures of variables, sometimes including multiple cohorts, and analyzing this data using various longitudinal latent variable modeling techniques such as latent growth curve models (cf. MacCallum & Austin, 2000).
    [Show full text]
  • Chapter Number
    A Longitudinal Typology of Neighbourhood-level Social Fragmentation: A Finite Mixture Model Approach Peter Lekkas a Natasha J Howard b Ivana Stankov c Mark Daniel d Catherine Paquet a Abstract Neighbourhoods are social enclaves. And, from an epidemiological vantage there is substantive research examining how social traits of neighbourhoods affect health. However, this research has often focused on the effects of social deprivation. Less attention has been given to social fragmentation (SF), a construct aligned with the notions of lesser: social cohesion, social capital, collective functioning, and social isolation. Concurrently, there has been limited research that has described the spatial and temporal patterning of neighbourhood-level social traits. With a focus on SF the main aims of this paper were to model and describe the time-varying and spatial nature of SF. Conceptually, this research was informed by ‘thinking in time’ and by the ‘lifecourse-of-place’ perspective. While, from an analytical perspective, a longitudinal (3-time points over 10-years) neighbourhood database was created for the metropolitan region of Adelaide, Australia. Latent Transition Analysis was then used to model the developmental profile of SF where neighbourhoods were proxied by ‘suburbs’, and the measurement model for SF was formed of 9-conceptually related census-based indicators. A four-class, nominal-level latent status model of SF was identified: class- A=low SF; class-B=mixed-level SF/inner urban; class-C=mixed-level SF/peri-urban; and class-D=high SF. Class-A and -D neighbourhoods were the most prevalent at all time points. And, while certain neighbourhoods were inferred to have changed their SF class across time, most neighbourhoods were characterised by intransience.
    [Show full text]
  • Latent Class Analysis and Finite Mixture Modeling
    OXFORD LIBRARY OF PSYCHOLOGY Editor-in-Chief PETER E. NATHAN The Oxford Handbook of Quantitative Methods Edited by Todd D. Little VoLUME 2: STATISTICAL ANALYSIS OXFORD UNIVERSITY PRESS OXFORD UN lVI!RSlTY PRESS l )xfiml University Press is a department of the University of Oxford. It limhcrs the University's objective of excellence in research, scholarship, nml cduc:uion by publishing worldwide. Oxlilrd New York Auckland Cape Town Dares Salaam Hong Kong Karachi Kuala Lumpur Madrid Melbourne Mexico City Nairobi New Delhi Shanghai Taipei Toronto With offices in Arf\CIIIina Austria Brazil Chile Czech Republic France Greece ( :umcnmln Hungary Italy Japan Poland Portugal Singapore Smuh Knr~'a Swir£crland Thailand Turkey Ukraine Vietnam l lxllml is n registered trademark of Oxford University Press in the UK and certain other l·nttntrics. Published in the United States of America by Oxfiml University Press 198 M<tdison Avenue, New York, NY 10016 © Oxfi.ml University Press 2013 All rights reserved. No parr of this publication may be reproduced, stored in a retrieval system, or transmitted, in any form or by any means, without the prior permission in writing of Oxford University Press, or as expressly permitted by law, hy license, nr under terms agreed with the appropriate reproduction rights organization. Inquiries concerning reproduction outside the scope of the above should be sent to the Rights Department, Oxford University Press, at the address above. You must not circulate this work in any other form and you must impose this same condition on any acquirer. Library of Congress Cataloging-in-Publication Data The Oxford handbook of quantitative methods I edited by Todd D.
    [Show full text]
  • Latent Class Cluster Analysis
    1 LATENT CLASS CLUSTER ANALYSIS Jeroen K. Vermunt Tilburg University Jay Magidson Statistical Innovations Inc. INTRODUCTION Kaufman and Rousseeuw (1990) define cluster analysis as the classification of similar objects into groups, where the number of groups, as well as their forms are unknown. The \form of a group" refers to the parameters of cluster; that is, to its cluster-specific means, variances, and covariances that also have a geometrical interpretation. A similar definition is given by Everitt (1993) who speaks about deriving a useful division into a number of classes, where both the number of classes and the properties of the classes are to be determined. These could also be definitions of exploratory LC analysis, in which objects are assumed to belong to one of a set of K latent classes, with the number of classes and their sizes not known a priori. In addition, objects belonging to the same class are similar with respect to the observed variables in the sense that their observed scores are assumed to come from the same probability distributions, whose parameters are, however, unknown quantities to be estimated. Because of the similarity between cluster and exploratory LC analysis, it is not surprising that the latter method is becoming a more and more popular clustering tool. In this paper, we want to describe the state-of-art in the field of LC cluster analysis. Most of the work in this field involves continuous indicators assuming (restricted) multivariate normal distributions within classes. Although authors seldom refer to the work of Gibson (1959) and Lazarsfeld and Henry (1968), actually they are using what these authors called latent profile analysis: that is, latent structure models with a single categorical latent variable and a set of continuous indicators.
    [Show full text]
  • The Utility of Latent Class Analysis to Understand Heterogeneity in Youth’S Coping Strategies
    The Utility of Latent Class Analysis to Understand Heterogeneity in Youth’s Coping Strategies: A Methodological Introduction Karen Nylund-Gibson1, Adam C. Garber1, Jay Singh1, Melissa R. Witkow2 Adrienne Nishina3, Amy Bellmore4 1University of California, Santa Barbara, 2Willamette University, 3University of California, Davis, 4Univerity of Wisconsin Corresponding Author: Karen Nylund-Gibson, [email protected] Preprint published February 9, 2021 Note: This preprint has not been peer reviewed The research reported in this paper was supported in part by the Institute of Education Sciences, U.S. Department of Education, through Grant # R305A170559 to the University of California, Davis. The opinions expressed are those of the authors and do not represent views of the Institute of Education Sciences or the U.S. Department of Education 1 Abstract Latent class analysis (LCA) is a useful statistical approach for understanding heterogeneity in a population. This paper provides a pedagogical introduction to LCA modeling and provides an example of its use to understand youth’s daily coping strategies. The analytic procedures are outlined for choosing the number of classes and integration of the LCA variable within a structural equation model framework, specifically a latent class moderation model, and a detailed table provides a summary of relevant modeling steps. This applied example demonstrates the modeling context when the LCA variable is moderating the association between a covariate and two outcome variables. Results indicate that students’ coping strategies moderate the association between social stress and negative mood, however they do not moderate the social stress-positive mood association. Appendices include R (MplusAutomation) code to automate the enumeration procedure, 3-step auxiliary variable integration, and the generation of figures for visually depicting LCA results.
    [Show full text]
  • Approximating a Similarity Matrix by a Latent Class Model Cajo J.F
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Wageningen University & Research Publications Approximating a similarity matrix by a latent class model Cajo J.F. ter Braak, Yiannis Kourmpetis, Henk A. L. Kiers, Marco C. A. M. Bink This is a "Post-Print" accepted manuscript, which has been published in Computational Statistics and Data Analysis This version is distributed under the Creative Commons Attribution 3.0 Netherlands License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Please cite this publication as follows: ter Braak, C. J. F., Kourmpetis, Y, Kiers, H. A. L, Bink, M. C. A. M. (2009). Approximating a similarity matrix by a latent class model: a reappraisal of additive fuzzy clustering. Computational Statistics and Data Analysis 53 (8) 3183-3193. doi:10.1016/j.csda.2008.10.004 You can download the published version at: http://dx.doi.org/10.1016/j.csda.2008.10.004 NOTICE: this is the author's version of a work that was accepted for publication in Computational Statistics and Data Analysis. Changes resulting from the publishing process, such as peer review, editing, corrections, structural formatting, and other quality control mechanisms may not be reflected in this document. Changes may have been made to this work since it was submitted for publication. A definitive version was subsequently published in Computational Statistics and Data Analysis 53 (8) 3183-3193 and can be downloaded from: http://dx.doi.org/10.1016/j.csda.2008.10.004 Approximating a similarity matrix by a latent class model Cajo J.F.
    [Show full text]
  • Regularized Latent Class Analysis for Polytomous Item Responses: an Application to SPM-LS Data
    Journal of Intelligence Article Regularized Latent Class Analysis for Polytomous Item Responses: An Application to SPM-LS Data Alexander Robitzsch 1,2 1 IPN—Leibniz Institute for Science and Mathematics Education, D-24098 Kiel, Germany; [email protected] 2 Centre for International Student Assessment (ZIB), D-24098 Kiel, Germany Received: 9 July 2020; Accepted: 10 August 2020; Published: 14 August 2020 Abstract: The last series of Raven’s standard progressive matrices (SPM-LS) test was studied with respect to its psychometric properties in a series of recent papers. In this paper, the SPM-LS dataset is analyzed with regularized latent class models (RLCMs). For dichotomous item response data, an alternative estimation approach based on fused regularization for RLCMs is proposed. For polytomous item responses, different alternative fused regularization penalties are presented. The usefulness of the proposed methods is demonstrated in a simulated data illustration and for the SPM-LS dataset. For the SPM-LS dataset, it turned out the regularized latent class model resulted in five partially ordered latent classes. In total, three out of five latent classes are ordered for all items. For the remaining two classes, violations for two and three items were found, respectively, which can be interpreted as a kind of latent differential item functioning. Keywords: regularized latent class analysis, regularization, fused regularization, fused grouped regularization, distractor analysis 1. Introduction There has been recent interest in assessing the usefulness of short versions of the Raven’s Progressive Matrices. Myszkowski and Storme(2018) composed the last 12 matrices of the Standard Progressive Matrices (SPM-LS) and argued that it could be regarded as a valid indicator of general intelligence g.
    [Show full text]