CONFERENCE SCHEDULE

Monday, March 4 Pre-Conference Workshops Time Event 8:00am to 3:00pm Conference registration open 9:00am to Workshop 1 (Day 1): R & RStudio for Workshop 2: Constructing Measures: An 4:00pm Reproducible Language Test Analysis, Introduction to Analyzing Proficiency Test Research, and Reporting Data Using IRT with jMetrik

Leaders: Geoffrey T. LaFlair, Duolingo and Leader: Troy L. Cox, Center for Language Daniel R. Isbell, Michigan State University Studies

Location: Mary Gay C Location: Henry Oliver

Tuesday, March 5 Pre-Conference Workshops & Events Time Event 8:00am to 7:00pm Conference registration open

9:00am to Workshop 1 (Day 2): R & RStudio for Workshop 3: Corpus-based development 4:00pm Reproducible Language Test Analysis, and validation of language tests: Using Research and Reporting corpora of and for Leaders: Geoffrey T. LaFlair, Duolingo and Leaders: Darren Perrett, Cambridge Daniel R. Isbell, Michigan State University Assessment English and Brigita Séguis, Cambridge Assessment English Location: Mary Gay C Location: Henry Oliver

Time Event 9:00am to 1:00pm Workshop 4: The Civil and Human Rights of Language: Guided Tour of the Atlanta Civil & Human Rights Museum Leaders: Elana Shohamy, Tel Aviv University and Tim McNamara, University of Melbourne Meeting Location TBD

Time Event 11:00am to 3:30pm ILTA Pre-Conference Executive Advisory Board Meeting Location: TBD

4:00 to Newcomer & Strategic Planning Session 5:15pm Location: Decatur B

CONFERENCE SCHEDULE

Tuesday, March 5 Time Event 5:15 to Alan Davies Lecture 6:45pm Bringing tests to justice: can we make a difference? Dr. Catherine Elder, University of Melbourne

Sponsored by the British Council

Location: Decatur B 7:00 to 8:30pm Opening Reception Historic DeKalb Courthouse (101 East Court Square, Decatur)

Wednesday, March 6 Time Event 7:15 to 5:00pm Conference registration open 7:30 to Latin American Association of Language Testing & Assessment Meeting 8:30am Location: Decatur B Concurrent Sessions: Demonstrations Time Mary Gay C Henry Oliver Decatur A 8:00 to Automated essay scoring: Talk2Me Jr: A Digital Personalized language 8:30am An objective support to Language and Literacy learning as language human raters Assessment Tool assessment: A case study of two large learner corpora Matthew Kyle Martin, Samantha Dawn McCormick, Burr Settles, Masato Matthew Wilcox Hyunah Kim, Jeanne Sinclair, Hagiwara, Erin Gustafson, Clarissa Lau, Megan Vincett, Chris Brust Chris Barron, Eunice Eunhee Jang

8:30 to Opening Symposium: Local needs and global priorities in ensuring fair test use: synergies 10:35am and tension in balancing the two perspectives

Jamie Mark Dunlea, Jessica Wu, Yan Jin, Wei Wang, Haoran Yang, Quynh Nguyen, Barry O'Sullivan

Location: Decatur B

CONFERENCE SCHEDULE

Wednesday, March 6 Time Event 10:35 to Coffee Break 10:55am Location: Lobby Concurrent Sessions: Papers Time Mary Gay C Henry Oliver Decatur A Decatur B 10:55 to Understanding Reading Self- Rater cognition and Test Consequence 11:25am teacher educators’ Concept and the role of on Rural Chinese language Reading individual attributes Students: assessment literacy Achievement in in rating speaking Investigating practices while Monolingual and performances Learner Washback training pre-service Multilingual of National College English teachers in Students: A Cross- Entrance English Chile Panel Multiple- Exam Group SEM Analysis Salomé Villa Larenas Christopher Douglas Kathrin Eberharter Yangting Wang, Barron Mingxia Zhi

11:30am Using the Language What did the The Effect of Raters’ Examining to Assessment Literacy Reading Assessment Perception of Task Washback on 12:00pm Survey with an Miss? Complexity on Rater Learning from a Under Researched Severity in a Second Sociocultural Population: The Language Perspective: The Case of Uzbekistan Performance-based Case of a Graded EFL Teachers Oral Approach to English Communication Test Language Testing in Hong Kong David Lawrence Elizabeth Lee Yongkook Won Chi Lai Tsang Chiesa

12:00 to Lunch Break on your own 1:30pm Language Testing Editorial Board Meeting, Location: Henry Oliver

CONFERENCE SCHEDULE

Wednesday, March 6 Concurrent Sessions: Works in Progress 1:30 to 3:00pm Location: Decatur A 1. “That’s a waste of time, going back and reading the text again!” – Cognitive processes in an integrated summary writing task Sonja Zimmermann 2. Studying item difficulty. Insights from a multilingual foreign Katharina Karges 3. Development of Scales for the Assessment of Young Learners Functional Writing Proficiency Gustaf Bernhard Uno Skar, Lennart Joelle 4. Investigating the Interactiveness of IELTS Academic Writing Tasks and Their Washback on EFL Teachers’ Test Preparation Practices Parisa Safaei, Shahrzad Saif 5. The methods dealing with dependent effect sizes in a meta-analysis: a review in reading research area Jingxuan Liu, Xiaoyun Zhang, Hongli Li, Xinyuan Yang 6. Towards the Democratisation of the Assessment of English as a Lingua Franca Sheryl Cooke 7. Understanding Young Learners’ Spoken Academic Language Development through Analyzing Oral Proficiency Test Responses Megan Montee, Mark Chapman 8. Validating the use of a web-based rating system for oral proficiency interviews Jing Xu, Anne Clarke, Andrew Mullooly, Claire McCauley 9. Test preparation materials for the Test of Workplace Essential Skills (TOWES): Validating materials for adult literacy and numeracy Claire Elizabeth Reynolds 10. Students’ and Teachers’ Perception and Use of Diagnostic Feedback Hang Sun 11. Effects of Reader and Task Variables in L2 Reading Comprehension and Speed Toshihiko Shiotsu 12. The Effect of Genre on Linguistic Features of Source-Based Essays by Tertiary Learners: Implications for Construct Validity Sukran Saygi, Zeynep Aksit 13. Investigation of Social Justice Violation in an English Proficiency Test for PhD Candidates in Iran Masood Siyyari, Negar Siyari

CONFERENCE SCHEDULE

Wednesday, March 6 Concurrent Sessions: Works in Progress 1:30 to 3:00pm Location: Mary Gay C 14. Developing a Digital Simulation to Measure L2 Intercultural, Pragmatic and Interactional Competence: Initial Pilot Results Linda Forrest, Ayşenur Sağdıç, Julie Sykes, Margaret Malone 15. Using unscripted spoken texts in college-level L2 Mandarin assessment Xian Li 16. LAL: what is it to teachers in their classrooms? Sonia Patricia Hernandez-Ocampo 17. A case study of evidence-centered exam design and listening: Washback from placement test development to program tests, KSAs, and pedagogy Gerriet Janssen, Olga Inés Gómez 18. Development of an ITA Assessment Instrument based on English as a Lingua Franca Heesun Chang 19. Applying a summative assessment speaking test for formative assessment gains: The case of a computerized speaking test in Israel Tziona Levi, Ofra Inbar-Lourie 20. Creating a Socially Responsible Language Diagnostic Tool to Support At-Risk Students at a Canadian Technical College Nathan J Devos 21. Examining values and potential consequences in argument-based approaches to TOPIK-Speaking validation process Soohyeon Park, Gwan-Hyeok Im, Dongil Shin 22. Aviation English Proficiency Test Design for a Group of Brazilian Military Pilots: A Case Study Ana Lígia Barbosa de Carvalho e Silva

Time Event 3:00 to Coffee Break 3:20pm Location: Lobby 3:20 to Symposium: Language Proficiency Assessment and Social Justice in the US K-12 5:20pm Educational Context Mark Chapman, Margo Gottlieb, Keira Ballantyne, H. Gary Cook, Paula Winke, Todd

Ruecker, Micheline Chalhoub-Deville Location: Decatur B

CONFERENCE SCHEDULE

Wednesday, March 6 Concurrent Sessions: Papers Time Mary Gay C Henry Oliver Decatur A 3:20 to Investigating the validity of Assessing Workplace Justifying the Use of 3:50pm a writing scale and rubric Listening Comprehension of Scenario-Based Assessment using a corpus-based Thai Undergraduates in to Measure Complex analysis of grammatical English as an Asian Lingua Constructs of features Franca Contexts Communicative Language Competence

Susie Kim Panjanit Chaipuapae Heidi Liu Banerjee

3:55 to French Learners’ Use of Item-level analyses of a Differential Item 4:25pm Sources in an Integrated listening for implicature Functioning in GEPT-Kids Writing Assessment Task test: Evidence against an Listening implicature subskill construct? Anna Mikhaylova Stephen O’Connell Linyu Liao

4:30 to Formative Assessment Content-rich videos in Raters’ Perceptions and 5:00pm through Automated academic L2 listening tests: Operationalization of Corrective Feedback in A validity study (In)Authenticity in Oral : Proficiency Tests A Case Study of Criterion Giang Thi Linh Hoang Roman Olegovich Lesnov John Dylan Burton

5:05 to Individualized Feedback to Examining the effects of Investigating variance 5:35pm Raters: Effects on Rating foreign-accented lectures sources and score Severity, Inconsistency, and on an academic listening dependability of an ITA Bias in the Context of test at the item level using speaking test for construct- Chinese as a Second differential item related validity and Language Writing functioning analysis fairness: A mixed method Assessment G-theory study Jing Huang, Gaowei Chen Sun-Young Shin, Ryan Ji-young Shin Lidster, Senyung Lee 6:45pm Networking Dinners (advance registration required) Meet in Lobby

CONFERENCE SCHEDULE

Thursday, March 7 Time Event 7:15am to Conference registration open 5:00pm Concurrent Sessions: Demonstrations Time Mary Gay C Henry Oliver Decatur A 8:00 to Integrating Social Justice Development and Use of BEST Plus 3.0: Assessing 8:30am and Student Performance an Automatic Scoring Speaking Using a Multi- Online Model for Spoken Stage Adaptive Test Response Test Noah McLaughlin Judson Hart, Troy Cox, Megan Montee Matthew Wilcox Daniel Lee

Concurrent Sessions: Symposia Time Decatur A Decatur B 8:35 to Toward Social Justice in L2 Classroom Aligning language tests to external 10:35am Assessment Theory and Practice: The proficiency scales: validity issues Potential of Praxis Matthew E. Poehner, Ofra Inbar-Lourie, Sha Wu, Lianzhen He, Jianda Liu, Han Yu, Constant Leung, Tziona Levi, Luke Harding, Richard J. Tannenbaum, Spiros Tineke Brunfaut, Remi van Compernolle, Papageorgiou, Ching-Ni Hsieh, Shangchao Angela Scarino Min, Hongwen Cai, Jie Zhang, Jamie Dunlea, Richard Spiby 10:35 to 10:55am Coffee Break Lobby

11:00am to Messick Lecture 12:00pm Testing and the Public Interest Dr. Joan Herman, UCLA/CRESST

Sponsored by Educational Testing Service Location: Decatur B

12:00 to Lunch Provided (first 100 takers) 1:45pm ILTA Annual Business Meeting Location: Decatur B

CONFERENCE SCHEDULE

Thursday, March 7 Concurrent Sessions: Posters 1:30 to 3:00pm Location: Lobby Do students’ motivation and locus of control impact writing performance through their perceived writing competency? Clarissa Lau, Chris Barron Deconstructing writing and a writing scale: How a decision tree guides raters through a holistic, profile- based rating scale Hyunji Park, Xun Yan Assessing EFL college students’ speaking performance through Google Hangouts Yu-Ting Kao Is it fair to use scores from a test of grammar and vocabulary to refine grade boundary decisions in other skill areas? Karen Dunn, Gareth McCray Test taker characteristics as predictors of holistic score on independent and integrated-skills writing tasks Analynn Bustamante, Scott Crossley An Investigation of the Validity of a New Speaking Assessment for Adolescent EFL Learners Becky Huang, Alison Bailey, Shawn Chang, Yangting Wang Instructors as Agents of Change: A Systematic Approach to Developing Proficiency-Oriented Assessments in Less Commonly Taught Languages Shinhye Lee, Ahmet Dursun, Nicholas Swinehart Multimodality, Social Semiotics, and Literacy: How LESLLA Learners from Refugee Backgrounds Make Meaning in Official U.S. Naturalization Test Study Materials Jenna Ann Altherr Flores Raters’ decision-making processes in an integrated writing test: An eye-tracking study Phuong Nguyen Story of an education system accountable for exam-success but not for learning: A washback study Nasreen Sultana Using multimodal tasks to promote more equitable assessment of English learners in the content areas Scott Grapin, Lorena Llosa Familiarizing standard-setting panelists with the CEFR: A three-step approach to attaining a shared understanding of just-qualified candidates Sharon Pearce, Patrick McLain, Tony Clark Using Machine Learning Techniques in Building and Evaluating Automated Scoring Models for ITAs’ Speaking Performances Ziwei Zhou A systematic review: Ensuring high quality ELP assessments for all Jo-Kate Collier Bridge to Seven (Language Testing and Social Justice) Johanna Motteram

CONFERENCE SCHEDULE

Thursday, March 7 Concurrent Sessions: Posters 1:30 to 3:00pm Location: Lobby Beyond the Test Score: Developing Listening Test Feedback & Activities to Empower Young Learners and Teachers of English Brent Miller, Luke Slisz, Patrick McLain, Rachele Stucker, Renee Saulter Cyberpragmatics: Assessing Pragmatics through Interactive Email Communication Iftikhar Haider Reverse-engineering L2 reading and listening assessments for sub-score-reporting purposes Yeonsuk Cho, Chris Hamill Scenario-based tasks for a large-scale foreign language assessment: a mixed-methods exploratory study Malgorzata Barras, Katharina Karges, Peter Lenz Developing a local-made English test for Thai EFL grade 6 students: Concurrent validity and fairness issues Jirada Wudthayagorn, Chatraporn Piamsai, Pan-gnam Chairaksak Holistics and analytic scales of a paired oral test for Japanese learners of English Rie Koizumi, Yo In’nami, Makoto Fukazawa Accessibility in testing: generating research from good practice Richard David Spiby, Judith Fairbairn Listening to test-takers’ perspective in the validation process: the case of the Aviation English Proficiency Exam for Brazilian Air Traffic Controllers Natalia de Andrade Raymundo Pre-service and in-service language teachers’ conceptions of LA: towards the construction of LAL knowledge base Sonia Patricia Hernandez-Ocampo Language assessment literacy in Brazil: analyses of undergraduate and graduate courses at federal universities Gladys Quevedo-Camargo, Matilde V. R. Scaramucci Impact of Language Background on Response Similarity Analysis James Robert Davis Japanese EFL Learners’ Speech-in-Noise Listening Comprehension Process: Use of Context Information Ryoko Fujita Certifying language ability for immigration purposes in Switzerland Peter Lenz Comparing rater and score reliability under holistic and analytic rating scales in assessing speech acts in L2 Chinese Shuai Li Exploring raters’ perceptions of Oral Proficiency Interview Tasks as “promotion” or “demotion” Jeremy Ray Gevara, Troy Cox, Larissa Grahl, Logan Blackwell Mapping the Path to Advanced Second Language Literacy in Adults Using Eye-Tracking: A Look at Portuguese Troy Cox, Larissa Grahl, Logan Blackwell

CONFERENCE SCHEDULE

CONFERENCE SCHEDULE

Thursday, March 7 Time Event 3:00 to Coffee Break 3:20pm Location: Lobby Concurrent Sessions: Papers Time Mary Gay C Henry Oliver Decatur A Decatur B 3:20 to Analyzing Positioning Enhancing the Assessing textual 3:50pm stakeholders’ students as active Interpretability and sophistication and voices in the learners: An Usefulness of TEPS linguistic aviation context: a examination of Section Scores complexity in L2 glocal perspective student-generated Through Alignment writing question quality in with CEFR literacy assessment Natalia de Andrade Hyunah Kim, Megan Heesung Jun, Euijin Jianling Liao Raymundo Vincett, Samantha Lim, Yong-Won Lee

Dawn McCormick, Melissa Hunte, Xue Lin 3:55 to The domain expert Do test accessibility High-stakes tests Linguistic Tools in 4:25pm perspective on features have the can improve Writing workplace readiness: intended effect for learning - Reality or Assessment: Their Investigating the K-12 English wishful thinking? Impact on Test- standards set on the learners? takers' Writing writing component of Process and an English language proficiency test for Performance health professionals

Simon Davidson Ahyoung Alicia Kim, Jessica Wu, Judy Lo, Saerhim Oh Meltem Yumsek, Anita Chun-Wen Lin Mark Chapman, H. Gary Cook

CONFERENCE SCHEDULE

Thursday, March 7 Concurrent Sessions: Papers Time Mary Gay C Henry Oliver Decatur A Decatur B 4:30 to How valid are Empowering K-12 Source Use Unpacking the 5:00pm language tests used Teachers to Make Behavior and textual features, in the overseas- Better Use of High- Raters’ Judgement vocabulary use, and trained nurse Stakes Summative in L2 Academic source integration in registration ELP Assessments Writing integrated listening- to-write assessments processes? for adolescent English language learners

Ute Knoch, Sally Alexis Lopez Pakize Uludag, Renka Ohta, Jui- O’Hagan Heike Neumann, Teng Liao Kim McDonough 5:05 to Assessing clinical Strategies Used by Writing Assessment Japanese university 5:35pm communication on Young English Training Impact students’ the Occupational Learners in an and Mexican EFL paraphrasing English Test: The Assessment Context University strategies in L2 intersection of Teachers: A summary writing cognitive and Proposed consequential Categorization validity

Brigita Séguis, Lin Gu, Youngsoon Elsa Fernanda Yasuyo Sawaki,

Barbara Ying Zhang, So Gonzalez Yutaka Ishii, Hiroaki Gad Lim Yamada 6:30 to 9:30pm Banquet (Ticket required) The Trolley Barn 963 Edgewood Ave NE, Atlanta, Georgia

CONFERENCE SCHEDULE

Friday, March 8 Time Event 7:15am to 1:00pm Conference registration open

7:30 to Language Assessment Literacy Special Interest Group 8:30am Location: Henry Oliver Concurrent Sessions: Papers Time Mary Gay C Henry Oliver Decatur A 8:30 to Investigating raters’ Multilingual Assessment Developing lists of 9:00am scoring processes and Reflecting Multilingual empirical English word strategies in paired Educational Policy: Toward difficulties specific to each speaking assessment Assessment for Justice L1

Soo Jung Youn, Shi Chen Elana Goldeberg Shohamy, Steve Lattanzio, Alistair Michal Tannenbaum, Anna Van Moere, Jeff Elmore Gani 9:05 to Rater behavior in a high- Social justice and Exploring the Impact of 9:35am stakes L2 examination: washback in language Bilingual Education Types Does test takers’ perceived testing in Norway on DIF: Implications for first language matter? Vocabulary Test Development

Ari Huhta, Sari Ohranen, Marte Monsen Suchada Sanonguthai Mia Halonen, Tuija Hirvelä, Reeta Neittaanmäki, Sari Ahola, Riikka Ullakonoja 9:40 to Not Unwarranted Intended and unintended A Knowledge-based 10:10am Concordances But consequences of reforming Vocabulary List (KVL): Warranted Convergences: a national school-leaving German, Spanish, and Approaches to Standard exam and their role for Chinese Results Setting and Maintenance validation Using Subject Experts

Gad Lim, Barbara Ying Benjamin Kremmel, Carol , Barry Zhang, Brigita Seguis Spoettl, Veronika Schwarz O’Sullivan, Laurence Anthony, Karen Dunn, Benjamin Kremmel

CONFERENCE SCHEDULE

Friday, March 8 Time Event 8:30 to 10:30am Symposium: Transformative teacher-researcher partnerships in language assessment

Beverly Baker, José Manuel Martínez, Ni-La Lê, Erika B. Kraus, Azad Hassan, India C. Plough, Xun Yan, Ha Ram (Hannah) Kim, John Kotnarowski, Hyunji (Hayley) Park, Jamie L. Schissel, Mario López-Gopar, Constant Leung, Julio Morales, James R. Davis

Location: Decatur B

10:35 to Coffee Break and Group Photo 11:00am Lobby Concurrent Sessions: Papers Time Mary Gay C Henry Oliver Decatur A Decatur B 11:00 to Understanding How do raters Language The Impact of an 11:30am Writing Process of learn to rate? assessment and External Adult EFL Learners Many-facet Rasch student Standardized Test in a Writing modeling of rater performance in on Teaching and Assessment performance over South African Learning for Young Context the course of a higher education: Learners: A Year 1 rater certification The case of Baseline Study in program Stellenbosch Turkey University

Ikkyu Choi Xun Yan, Hyunji Kabelo Wilson Mikyung Kim Wolf, Park Sebolai Alexis Lopez, Jeremy Lee

11:35am to What aspects of Establishing a Exploring teacher Investigating the 12:05pm speech contribute Validity Argument for understandings consequential to the perceived a Rating Scale and beliefs as a validity of the intelligibility of L2 Developed for basis for Hanyu Shuiping speakers? Ongoing Diagnostic benchmarking Kaoshi (Chinese Assessment in an EFL assessments for proficiency test) by University Writing Classroom: A Mixed university foreign using an Argument- Methods Study language programs based framework

Willam Bonk, Apichat Noriko Iwashita Shujiao Wang Saerhim Oh Khamboonruang

CONFERENCE SCHEDULE

Friday, March 8 Time Event 12:05 to Lunch Break on your own 1:35pm Language Assessment Quarterly Editorial Board Meeting, Location: Henry Oliver Concurrent Sessions: Papers Time Mary Gay C Henry Oliver Decatur A Decatur B 1:35 to Examination of test- Academic language Placement Testing: Developmental 2:05pm taking strategies or disciplinary One test, two tests, frameworks for used for two item practices? three tests? How writing in types during L2 Reconciling many tests are Denmark, Norway, listening perspectives of sufficient? and the US: A language and assessment content educators Cross-national when assessing comparison English learners’ language proficiency in the content classroom

Ruslan Suvorov Lorena Llosa, Scott Kathryn Hille, Jill V. Jeffery, Grapin Yeonsuk Cho Nikolaj Elf, Gustaf Bernhard Uno Skar, Kristen Campbell Wilcox 2:10 to Exploring the The role of Mitigating rater Examining the 2:40pm relationships feedback in the bias in L2 English Structure, Scale, between test value, design of a testing speaking and Instructor motivation, anxiety model for social assessment Perceptions of the and test justice through controlled ACTFL Can-Do performance: The pairwise Statements for case of a high- comparisons Spoken Proficiency stakes English proficiency test

Jason Fan, Yan Jin Slobodanka Dimova Masato Hagiwara, Sonia Magdalena Burr Settles, Angela Tigchelaar DiCostanzo, Cynthia M. Berger

CONFERENCE SCHEDULE

Friday, March 8 Concurrent Sessions: Papers Time Mary Gay C Henry Oliver Decatur A Decatur B 2:45 to Establishing Towards social Use of automated Building a Partial 3:15pm appropriate cut- justice for item scoring technology Validity Argument scores of standardized writers: to predict difficult- for the Global Test tests for a local Empowering item to-score speaking of English placement context writers through responses Communication language assessment literacy training

Gary J. Ockey, Sonca Olena Rossi, Tineke Larry Davis, Edward Payman Vafaee, Vo, Shireen Brunfaut Wolfe Yuko Kashimada Baghestani

Time Event 3:15 to Coffee Break 3:35pm Lobby 3:35 to Distinguished Achievement Award Lecture 5:00pm Context, Language Knowledge, and Language Use: Current Understandings Dan Douglas, Iowa State University Sponsored by Cambridge ESOL/ILTA Location: Decatur B

5:00 to Wrap up & Thanks 5:30pm Location: Decatur B

Saturday, March 9 Time Event 1:30 to Joint AAAL/ILTA Invited Colloquium 3:30pm Assessing lingua franca competence Location: Atlanta Sheraton, Capitol North Ballroom