Technology and Testing
Total Page:16
File Type:pdf, Size:1020Kb
Technology and Testing From early answer sheets filled in with number 2 pencils, to tests administered by mainframe computers, to assessments wholly constructed by computers, it is clear that technology is changing the field of educational and psychological measurement. The numerous and rapid advances have immediate impact on test creators, assessment professionals, and those who implement and analyze assessments. This comprehensive new volume brings together lead- ing experts on the issues posed by technological applications in testing, with chapters on game-based assessment, testing with simulations, video assessment, computerized test devel- opment, large-scale test delivery, model choice, validity, and error issues. Including an overview of existing literature and groundbreaking research, each chapter considers the technological, practical, and ethical considerations of this rapidly changing area. Ideal for researchers and professionals in testing and assessment, Technology and Testing provides a critical and in-depth look at one of the most pressing topics in educational testing today. Fritz Drasgow is Professor of Psychology and Dean of the School of Labor and Employment Relations at the University of Illinois at Urbana-Champaign, USA. The NCME Applications of Educational Measurement and Assessment Book Series Editorial Board: Michael J. Kolen, The University of Iowa, Editor Robert L. Brennan, The University of Iowa Wayne Camara, ACT Edward H. Haertel, Stanford University Suzanne Lane, University of Pittsburgh Rebecca Zwick, Educational Testing Service Technology and Testing: Improving Educational and Psychological Measurement Edited by Fritz Drasgow Forthcoming: Meeting the Challenges to Measurement in an Era of Accountability Edited by Henry Braun Testing the Professions: Credentialing Policies and Practice Edited by Susan Davis-Becker and Chad W. Buckendahl Fairness in Educational Assessment and Measurement Edited by Neil J. Dorans and Linda L. Cook Validation of Score Meaning in the Next Generation of Assessments Edited by Kadriye Ercikan and James W. Pellegrino Technology and Testing Improving Educational and Psychological Measurement Edited by Fritz Drasgow First published 2016 by Routledge 711 Third Avenue, New York, NY 10017 and by Routledge 2 Park Square, Milton Park, Abingdon, Oxon, OX14 4RN Routledge is an imprint of the Taylor & Francis Group, an informa business © 2016 Taylor & Francis The right of the editor to be identified as the author of the editorial material, and of the authors for their individual chapters, has been asserted in accordance with sections 77 and 78 of the Copyright, Designs and Patents Act 1988. All rights reserved. No part of this book may be reprinted or reproduced or utilised in any form or by any electronic, mechanical, or other means, now known or hereafter invented, including photocopying and recording, or in any information storage or retrieval system, without permission in writing from the publishers. Trademark notice: Product or corporate names may be trademarks or registered trademarks, and are used only for identification and explanation without intent to infringe. Library of Congress Cataloging-in-Publication Data Technology and testing : improving educational and psychological measurement / edited by Fritz Drasgow. pages cm. — (Ncme applications of educational measurement and assessment book series) Includes bibliographical references and index. 1. Educational tests and measurements. 2. Educational technology. I. Drasgow, Fritz. LB3051.T43 2015 371.26—dc23 2015008611 ISBN: 978-0-415-71715-1 (hbk) ISBN: 978-0-415-71716-8 (pbk) ISBN: 978-1-315-87149-3 (ebk) Typeset in Minion by Apex CoVantage, LLC Contents List of Contributors vii Foreword xv Preface xvii 1. Managing Ongoing Changes to the Test: Agile Strategies for Continuous Innovation 1 CYNTHIA G. PARSHALL AND ROBIN A. GUILLE 2. Psychometrics and Game-Based Assessment 23 ROBERT J. MISLEVY, SETH CORRIGAN, ANDREAS ORANJE, KRISTEN DICERBO, MALCOLM I. BAUER, ALINA VON DAVIER, AND MICHAEL JOHN 3. Issues in Simulation-Based Assessment 49 BRIAN E. CLAUSER, MELISSA J. MARGOLIS, AND JEROME C. CLAUSER 4. Actor or Avatar? Considerations in Selecting Appropriate Formats for Assessment Content 79 ERIC C. POPP, KATHY TUZINSKI, AND MICHAEL FETZER Commentary on Chapters 1–4: Using Technology to Enhance Assessments 104 STEPHEN G. SIRECI 5. Using Technology-Enhanced Processes to Generate Test Items in Multiple Languages 109 MARK J. GIERL, HOLLIS LAI, KAREN FUNG, AND BIN ZHENG 6. Automated Test Assembly 128 KRISTA BREITHAUPT AND DONOVAN HARE vi • Contents 7. Validity and Automated Scoring 142 RANDY ELLIOT BENNETT AND MO ZHANG Commentary on Chapters 5–7: Moving From Art to Science 174 MARK D. RECKASE 8. Computer-Based Test Delivery Models, Data, and Operational Implementation Issues 179 RICHARD M. LUECHT 9. Mobile Psychological Assessment 206 OLEKSANDR S. CHERNYSHENKO AND STEPHEN STARK 10. Increasing the Accessibility of Assessments Through Technology 217 ELIZABETH STONE, CARA C. LAITUSIS, AND LINDA L. COOK 11. Testing Technology and Its Effects on Test Security 235 DAVID FOSTER Commentary on Chapters 8–11: Technology and Test Administration: The Search for Validity 255 KURT F. GEISINGER 12. From Standardization to Personalization: The Comparability of Scores Based on Different Testing Conditions, Modes, and Devices 260 WALTER D. WAY, LAURIE L. DAVIS, LESLIE KENG, AND ELLEN STRAIN-SEYMOUR 13. Diagnostic Assessment: Methods for the Reliable Measurement of Multidimensional Abilities 285 JONATHAN TEMPLIN 14. Item Response Models for CBT 305 DANIEL BOLT 15. Using Prizes to Facilitate Change in Educational Assessment 323 MARK D. SHERMIS AND JAISON MORGAN Commentary on Chapters 12–15: Future Directions: Challenge and Opportunity 339 EDWARD HAERTEL Index 343 Contributors Malcolm I. Bauer is a senior managing scientist in the Cognitive Science group at the Edu- cational Testing Service. His research has two main foci: game-based assessment for complex problem solving and non-cognitive skills, and learning progressions for assessment in STEM areas. Randy Elliot Bennett holds the Norman O. Frederiksen Chair in Assessment Innovation at Educational Testing Service in Princeton, New Jersey. His work concentrates on integrating advances in the learning sciences, measurement, and technology to create new approaches to assessment that measure well and that have a positive impact on teaching and learning. Daniel Bolt is Professor of Educational Psychology at the University of Wisconsin, Madison and specializes in quantitative methods. His research interests are in item response theory (IRT), including multidimensional IRT and applications. He teaches courses in the areas of test theory, factor analysis, and hierarchical linear modeling, and consults on various issues related to the development of measurement instruments in the social sciences. Dan has been the recipient of a Vilas Associates Award and Chancellors Distinguished Teaching Award at the University of Wisconsin. Krista Breithaupt, Ph.D., has 25 years of experience with modern psychometric methods and computerized testing, including assessment in clinical epidemiology, nursing, and popula- tion health. Krista served for 10 years as a psychometrician and then as Director of Psycho- metrics at the American Institute of CPAs (AICPA). She guided research and development to computerize a national licensing exam for CPAs, contributing innovative research in test development, assembly, scoring, and delivery technologies for the national licensing exam administered to 250,000 examinees annually. Krista held the position of Director, Research and Development (retired) at the Medical Council of Canada. In this post, Krista defined and led national studies to determine the skills, knowledge, and attitudes required for licensure as a physician in Canada. Krista developed and guided studies to validate and streamline the delivery, scoring, and comparability of certification examinations for Canadian and interna- tionally trained physicians in Canada. Oleksandr S. Chernyshenko received his Ph.D. from the University of Illinois at Urbana- Champaign in 2002 with a major in industrial organizational psychology and minors in quantitative methods and human resource management. He is currently an Associate viii • Contributors Professor at Nanyang Technological University in Singapore, conducting research and teach- ing courses related to personnel selection. Dr. Chernyshenko is an Associate Editor of the journal Personnel Assessment and Decisions and serves on the editorial board of the Interna- tional Journal of Testing. Brian E. Clauser has worked for the past 23 years at the National Board of Medical Examin- ers, where he has conducted research on automated scoring of complex assessment formats, standard setting, and applications of generalizability theory. He led the research project that developed the automated scoring system for a computer-simulation-based assessment of physicians’ patient management skills. The simulation is part of the licensing examina- tion for physicians. Brian is also the immediate past editor of the Journal of Educational Measurement. Jerome C. Clauser is a Research Psychometrician at the American Board of Internal Medi- cine. His research interests include standard setting, equating, and innovative item types. Linda L. Cook is recently retired from Educational Testing Service. While employed by ETS, she served in a number of roles including Executive Director of the Admissions and