Generating Code-Switched Text for Lexical Learning

Total Page:16

File Type:pdf, Size:1020Kb

Generating Code-Switched Text for Lexical Learning Generating Code-switched Text for Lexical Learning Igor Labutov Hod Lipson Cornell University Cornell University [email protected] [email protected] Abstract nique relies on a phenomenon that elicits a nat- ural simulation of L1-like vocabulary learning in A vast majority of L1 vocabulary acqui- adults — significantly closer to L1 learning for L2 sition occurs through incidental learning learners than any model studied previously. By in- during reading (Nation, 2001; Schmitt et fusing foreign words into text in the learner’s na- al., 2001). We propose a probabilistic ap- tive tongue into low-surprisal contexts, the lexi- proach to generating code-mixed text as cal acquisition process is facilitated naturally and an L2 technique for increasing retention non-obtrusively. Incidentally, this phenomenon in adult lexical learning through reading. occurs “in the wild” and is termed code-switching Our model that takes as input a bilingual or code-mixing, and refers to the linguistic pattern dictionary and an English text, and gener- of bilingual speakers swapping words and phrases ates a code-switched text that optimizes a between two languages during speech. While this defined “learnability” metric by construct- phenomenon had received significant attention ing a factor graph over lexical mentions. from both a socio-linguistic (Milroy and Muysken, Using an artificial language vocabulary, 1995) and theoretical linguistic perspectives (Be- we evaluate a set of algorithms for gener- lazi et al., 1994; Bhatt, 1997) (including some ating code-switched text automatically by computational studies), only recently has it been presenting it to Mechanical Turk subjects hypothesizes that “code-switching” is a marking and measuring recall in a sentence com- of bilingual proficiency, rather than deficiency pletion task. (Genesee, 2001). 1 Introduction Until recently it was widely believed that inci- dental lexical acquisition through reading can only Today, an adult trying to learn a new language is occur for words that occur at sufficient density likely to embrace an age-old and widely accepted in a single text, so as to elicit the “noticing” ef- practice of learning vocabulary through curated fect needed for lexical acquisition to occur (Cobb, word lists and rote memorization. Yet, it is not 2007). Recent neurophysiological findings, how- uncommon to find yourself surrounded by speak- ever, indicate that even a single incidental expo- ers of a foreign language and instinctively pick up sure to a novel word in a sufficiently constrained words and phrases without ever seeing the defini- context is sufficient to trigger an early integra- tion in your native tongue. Hearing “pass le sale tion of the word in the brain’s semantic network please” at the dinner table from your in-laws vis- (Borovsky et al., 2012). iting from abroad, is unlikely to make you think An approach explored in this paper, and moti- twice about passing the salt. Humans are extraor- vated by the above findings, exploits “constrain- dinarily good at inferring meaning from context, ing” contexts in text to introduce novel words. A whether this context is your physical surround- state-of-the-art approach for generating such text ing, or the surrounding text in the paragraph of the is based on an expert annotator whose job is to word that you don’t yet understand. decide which words to “switch out” with novel Recently, a novel method of L2 language teach- foreign words (from hereon we will refer to the ing had been shown effective in improving adult “switched out” word as the source word and to the lexical acquisition rate and retention 1. This tech- “switched in” word as the target word). Conse- 1authors’ unpublished work quently the process is labor-intensive and leads to 562 Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, pages 562–571, Baltimore, Maryland, USA, June 23-25 2014. c 2014 Association for Computational Linguistics a “one size fits all solution” that is insensitive to the disciplines of sociolinguistics, theoretical and the learner’s skill level or vocabulary proficiency. psycholinguistics and even literary and cultural This limitation is also cited in literature as a sig- studies (predominantly in the domain of Spanish- nificant roadblock to the widespread adaptation English code-switching) (Lipski, 2005). of graded reading series (Hill, 2008). A reading- Code-switching that occurs naturally in bilin- based tool that follows the same principle, i.e. by gual populations, and especially in children, has systematic exposure of a learner to an incremen- for a long time been considered a marking of tally more challenging text, will result in more ef- incompetency in the second language. A more fective learning (Lantolf and Appel, 1994). recent view on this phenomenon, however, sug- To address the above limitation, we develop an gests that due to the underlying syntactic com- approach for automatically generating such “code- plexity of code-switching, code-switching is ac- switched” text with an explicit goal of maximizing tually a marking of bilingual fluency (Genesee, the lexical acquisition rate in adults. Our method 2001). More recently, the idea of employing is based on a global optimization approach that code-switching in the classroom, in a form of incorporates a “knowledge model” of a user with conversation-based exercises, has attracted the the content of the text, to generate a sequence of attention of multiple researchers and educators lexical “switches”. To facilitate the selection of (Moodley, 2010; Macaro, 2005), yielding promis- “switch points”, we learn a discriminative model ing results in an elementary school study in South- for predicting switch point locations on a corpus Africa. that we collect for this purpose (and release to the community). Below is a high-level outline of this 2.2 Computational Approaches to paper. Code-switching We formalize our approach within a prob- Additionally, there has been a limited number • abilistic graphical model framework, infer- of studies of the computational approaches to ence in which yields “code-switched” text code-switching, and in particular code-switched that maximizes a surrogate to the acquisition text generation. Solorio and Liu (2008), record rate objective. and transcribe a corpus of Spanish-English code- mixed conversation to train a generative model We compare this global method to sev- • (Naive Bayes) for the task of predicting code- eral baseline techniques, including the strong switch points in conversation. Additionally they “high-frequency” baseline. test their trained model in its ability to generate code-switched text with convincing results. Build- We analyze the operating range in which • ing on their work, (Adel et al., 2012) employ ad- our model is effective and motivate the near- ditional features and a recurrent network language future extension of this approach with the model for modeling code-switching in conversa- proposed improvements. tional speech. Adel and collegues (2011) propose 2 Related Work a statistical machine translation-based approach for generating code-switched text. We note, how- Our proposed approach to the computational gen- ever, that the primary goal of these methods is in eration of code-switched text, for the purpose of the faithful modeling of the natural phenomenon L2 pedagogy, is influenced by a number of fields of code-switching in bilingual populations, and that studied aspects of this phenomenon from dis- not as a tool for language teaching. While useful tinct perspectives. In this section, we briefly de- in generating coherent, syntactically constrained scribe a motivation from the areas of socio- and code-switched texts in its own right, none of these psycho- linguistics and language pedagogy re- methods explicitly consider code-switching as a search that indicate the promise of this approach. vehicle for teaching language, and thus do not take on an optimization-based view with an ob- 2.1 Code-switching as a natural phenomenon jective of improving lexical acquisition through Code-switching (or code-mixing) is a widely stud- the reading of the generated text. More recently, ied phenomenon that received significant attention and concurrently with our work, Google’s Lan- over the course of the last three decades, across guage Immersion app employs the principle of 563 code-switching for language pedagogy, by gener- “noticing” effect — a prerequisite to lexical acqui- ating code-switched web content, and allowing its sition (Schmidt and Schmidt, 1995; Cobb, 2007). users to tune it to their skill level. It does not, how- ever, seem to model the user explicitly, nor is it 3 Model clear if it performs any optimization in generating 3.1 Overview the text, as no studies have been published to date. The formulation of our model is primarily moti- vated by two hypotheses that have been validated 2.3 Computational Approaches to Sentence experimentally in the cognitive science literature. Simplification We re-state these hypotheses in the language of Although not explicitly for teaching language, “surprisal”: computational approaches that facilitate accessi- 1. Inserting a target word into a low surprisal bility to texts that might otherwise be too difficult context increases the rate of that word’s inte- for its readers, either due to physical or learning gration into a learner’s lexicon. disabilities, or language barriers, are relevant. In the recent work of (Kauchak, 2013), for example 2. Multiple exposures to the word in low sur- demonstrates an approach to increasing readability prisal contexts increases rate of that word’s of texts by learning from unsimplified texts. Ap- integration. proaches in this area span methods for simplify- Hypothesis 1 is supported by evidence from ing lexis (Yatskar et al., 2010; Biran et al., 2011), (Borovsky et al., 2012; Frank et al., 2013), and hy- syntax (Siddharthan, 2006; Siddharthan et al., pothesis 2 is supported by evidence from (Schmidt 2004), discourse properties (Hutchinson, 2005), and Schmidt, 1995).
Recommended publications
  • Incidental Vocabulary Acquisition from an Authentic Novel: Do Things Fall Apart?
    Reading in a Foreign Language April 2010, Volume 22, No. 1 ISSN 1539-0578 pp. 31–55 Incidental vocabulary acquisition from an authentic novel: Do Things Fall Apart? Ana Pellicer-Sánchez and Norbert Schmitt University of Nottingham United Kingdom Abstract Nation (2006) has calculated that second language (L2) learners require much more vocabulary than previously thought to be functional with language (e.g., 8,000–9,000 word families to read independently). This level is far beyond the highest graded reader, and would be difficult to explicitly teach. One way for learners to be exposed to mid- frequency vocabulary is to read authentic materials. The original A Clockwork Orange study (Saragi, Nation, & Meister, 1978) showed impressive amounts of incidental vocabulary learning with first language (L1) readers, but subsequent studies with L2 learners (using graded readers or simplified materials) showed only modest gains. This study explores the degree to which relatively advanced L2 readers can acquire spelling, word class, and recognition and recall of meaning from reading the unmodified authentic novel Things Fall Apart. After more than 10 exposures, the meaning and spelling could be recognized for 84% and 76% of the words respectively, while the meaning and word class could be recalled for 55% and 63%. Keywords: vocabulary acquisition, incidental learning, English as a foreign language (EFL), extensive reading, word knowledge In a landmark study, Nation (2006) calculated the amount of vocabulary necessary to function in English, using a 98% coverage figure (Hu & Nation, 2000). He found that it took much more vocabulary than previous estimates: 6,000–7,000 word families for spoken discourse and 8,000– 9,000 families for written discourse.
    [Show full text]
  • Language Difficulty and Prior Learning Influence Foreign Vocabulary Acquisition
    languages Article Language Difficulty and Prior Learning Influence Foreign Vocabulary Acquisition Sayuri Hayakawa *, James Bartolotti, Aimee van den Berg and Viorica Marian Department of Communication Sciences and Disorders, Northwestern University, Evanston, IL 60208, USA; [email protected] (J.B.); [email protected] (A.v.d.B.); [email protected] (V.M.) * Correspondence: [email protected] Received: 4 November 2019; Accepted: 22 December 2019; Published: 27 December 2019 Abstract: When learning a foreign language, words that are the hardest to learn are often the easiest to forget. Yet, there is also evidence that more challenging learning contexts can lead to greater long-term retention. Here, we investigate the effect of language difficulty on vocabulary retention by teaching participants novel words that varied in both imageability and similarity to a known language over a period of four weeks. We found that easier words (high-imageability and familiar) were generally retained better than harder words (low-imageability and unfamiliar). However, when words were fully learned during training, the more difficult unfamiliar words were later recalled with higher accuracy than easier familiar words. The effect of language difficulty on vocabulary retention therefore varies depending on how well words were initially encoded. We conclude that greater challenges can reap greater long-term rewards so long as learners establish a strong foundation during initial acquisition. Keywords: foreign language acquisition; desirable difficulties; imageability; language similarity; vocabulary 1. Introduction When learning a new skill, whether it be computer programming, tennis, or a foreign language, many of us have experienced a ‘honeymoon period,’ where the transformation from complete novice to competent newcomer prompts a rush of excitement and confidence.
    [Show full text]
  • An Introduction to Applied Linguistics This Page Intentionally Left Blank an Introduction to Applied Linguistics
    An Introduction to Applied Linguistics This page intentionally left blank An Introduction to Applied Linguistics edited by Norbert Schmitt Orders: please contact Bookpoint Ltd, 130 Milton Park, Abingdon, Oxon OX14 4SB. Telephone: (44) 01235 827720. Fax: (44) 01235 400454. Lines are open from 9.00 to 5.00, Monday to Saturday, with a 24-hour message answering service. You can also order through our website www.hoddereducation.co.uk If you have any comments to make about this, or any of our other titles, please send them to [email protected] British Library Cataloguing in Publication Data A catalogue record for this title is available from the British Library ISBN: 978 0 340 98447 5 First Edition Published 2002 This Edition Published 2010 Impression number 10 9 8 7 6 5 4 3 2 1 Year 2014, 2013, 2012, 2011, 2010 Copyright © 2010 Hodder & Stoughton Ltd All rights reserved. No part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher or under licence from the Copyright Licensing Agency Limited. Further details of such licences (for reprographic reproduction) may be obtained from the Copyright Licensing Agency Limited, of Saffron House, 6–10 Kirby Street, London EC1N 8TS. Hachette UK’s policy is to use papers that are natural, renewable and recyclable products and made from wood grown in sustainable forests. The logging and manufacturing processes are expected to conform to the environmental regulations of the country of origin.
    [Show full text]
  • Second Language Word Association: Processes, Methodologies, and Models
    Second Language Word Association: Processes, Methodologies, and Models John Paul Racine A thesis submitted to Cardiff University in fulfilment of the degree of Doctor of Philosophy 2018 DECLARATION This work has not been submitted in substance for any other degree or award at this or any other university or place of learning, nor is being submitted concurrently in candidature for any degree or other award. Signed …………………………………………………… Date: September 28, 2018 STATEMENT 1 This thesis is being submitted in partial fulfillment of the requirements for the degree of Ph.D. Signed ………………………………………….………… Date: September 28, 2018 STATEMENT 2 This thesis is the result of my own independent work/investigation, except where otherwise stated, and the thesis has not been edited by a third party beyond what is permitted by Cardiff University’s Policy on the Use of Third Party Editors by Research Degree Students. Other sources are acknowledged by explicit references. The views expressed are my own. Signed ……………………………………….……….…… Date: September 28, 2018 STATEMENT 3 I hereby give consent for my thesis, if accepted, to be available online in the University’s Open Access repository and for inter-library loan, and for the title and summary to be made available to outside organisations. Signed ……………………………………………..…..….. Date: September 28, 2018 For Moka May you dream big and always believe in yourself Acknowledgements I wish to express my deepest thanks to my supervisors, Professors Alison Wray and Tess Fitzpatrick, and Dr. Michelle Aldridge-Waddon. Without their wisdom and guidance, this dissertation would not have been possible. My research has also benefitted greatly from discussions with many other distinguished academics over the years.
    [Show full text]
  • Challenging Learners in Their Individual Zone of Proximal Development Using Pedagogic Developmental Benchmarks of Syntactic Complexity
    Challenging Learners in Their Individual Zone of Proximal Development Using Pedagogic Developmental Benchmarks of Syntactic Complexity Xiaobin Chen and Detmar Meurers LEAD Graduate School and Research Network Department of Linguistics Eberhard Karls Universitat¨ Tubingen,¨ Germany {xiaobin.chen,detmar.meurers}@uni-tuebingen.de Abstract and identify reading material individually chal- lenging learners, essentially instantiating the next This paper introduces an Intelligent Com- stage of acquisition as captured by Krashen’s con- puter Assisted Language Learning system cept of i+1 (Krashen, 1981) or relatedly, but em- designed to provide reading input for lan- phasizing the social perspective, Vygotsky’s Zone guage learners based on the syntactic com- of Proximal Development (ZPD; Vygotsky, 1976). plexity of their language production. The In terms of structure of the paper, we first locate system analyzes the linguistic complexity our approach in terms of the Complexity, Accu- of texts produced by the user and of texts racy, and Fluency (CAF) framework in SLA re- in a pedagogic target language corpus to search. Then we review approaches adopted by identify texts that are well-suited to foster earlier studies in developmental complexity re- acquisition. These texts provide develop- search, including problems they pose for a peda- mental benchmarks offering an individu- gogical approach aimed at offering developmental ally tailored language challenge, making benchmarks. We propose and justify a solution, ideas such as Krashen’s i+1 or Vygotsky’s before presenting the architecture and functional- Zone of Proximal Development concrete ity of the SyB system. and empirically explorable in terms of a broad range of complexity measures in all 2 Development of Syntactic Complexity dimensions of linguistic modeling.
    [Show full text]
  • Polish and Finnish Teenagers' Motivation to Learn
    MOUTON EuJAL 2016; 4(2): 277–300 Janina Iwaniec* and Riikka Ullakonoja Polish and Finnish teenagers’ motivation to learn English: The role of context DOI 10.1515/eujal-2015-0034 Abstract: Language learning motivation has been investigated in a number of contexts (Dörnyei and Ushioda 2009). However, there has been only one attempt (Taguchi et al. 2009) at a systematic comparison of models of motivation to learn English by students from different countries. The aim of this study is to investigate the role of context by comparing motivational characteristics and SEM (Structural Equation Modelling) models of language learning motivation of 15–16 year-old students from Finland and Poland. The data from both countries was collected using the same motivational questionnaire with a total of 351 students completing it. A number of differences were identified between the two contexts. Finnish learners scored significantly higher on scales of the ideal L2, intrinsic motivation, knowledge orientation, motivated behaviour and peer pressure, whereas Polish learners reported higher levels of international orientation and parental encour- agement. The comparison of models of language learning motivation shows differences in the role of parents and peers, endorsement of goals between the two cohorts. Finally, we discuss the potential reasons behind the differences in motivation between the two contexts and conclude that contexts not only affect motivational characteristics but also the structure of language learning motiva- tion. Keywords: motivation, Polish context, Finnish context Zusammenfassung: Sprachlernmotivation wurde bereits in unterschiedlichen Kontexten untersucht (Dörnyei und Ushioda 2009). Allerdings gab es bisher lediglich eine Untersuchung (Taguchi et al., 2009) eines systematischen Ver- gleichs, in der versucht wurde, Motivationsmodelle beim Spracherwerb der eng- lischen Sprache von Lernern aus verschiedenen Ländern zu vergleichen.
    [Show full text]
  • An Analysis of Lexical Errors in the English Compositions of Thai Learners
    An analysis of lexical errors in the English compositions of Thai learners SAENGCHAN HEMCHUA – Srinakharinwirot University, Bangkok NORBERT SCHMITT – The University of Nottingham ABSTRACT The importance of vocabulary in second language (L2) writing is widely accepted, but there has been relatively little research into the lexical errors learners produce when writing in their second language. Moreover, the error categorisation frameworks used in some previous studies have addressed only a relatively limited number of lexical error categories. In this paper the authors used a more comprehensive error taxonomy based on James (1998), with some additions from Leech’s semantics (1981), to analyse Thai third-year university students’ English compositions for lexical errors. The analysis revealed that (a) ‘near synonyms’ were the most numerous errors, followed by ‘preposition partners’ and ‘suffixes’, (b) the students had more difficulty with semantics than the forms of words, and (c) the identified sources of errors were mainly from L2 intrinsic difficulty rather than the first language (L1) transfer. The findings from the Thai students’ written lexical errors have impli- cations for L2 vocabulary teaching and learning, which should be of interest to wider English as a Second Language (ESL)/English as a Foreign Language (EFL) contexts. Research into lexical errors The language produced by foreign language (FL) learners almost inevitably contains errors of various types. This is the process of learning a language. Empirical evidence suggests that lexical errors are the most frequently occurring category of errors in written English (for example, Grauberg 1971; Meara 1984; Lennon 1991). They significantly affect the quality of academic writing (Astika 1993; Ellis, R 1994; Engber 1995) and native speakers consider them the most irritating (Santos 1988).
    [Show full text]
  • 090Ad 1376381204.Pdf
    ! ! " ! " ! # ! ! ! #$ ! % ! # " !! ! ! ! &' (# ) ! #" !! &' # # ! !!) * ) + ,- ) #. /0" // # # !! ) - + " #. " # ( # !! "# ! "# ! ! " # $ !% ! & $ ' ' ($ ' # % % ) %* %' $ ' + " % & ' ! # $, ( $ - . ! "- ( % . % % % % $ $ $ - - - - // $$$ 0 1"1"#23." "0" )*4/ +) * !5 !& 6!7%66898& % ) - 2 : ! * & /- ;9<8="0" )*4/ +) "3 " & 9<8= Cohesive Argumentative Writing Through Controlled, Guided, and Free writing Activities Abbas Ali Zarei, Assistant professor, Imam Khomeini International University, Qazvin, Iran Ramin Rahmany Assistant professor, Islamic Azad University, Takestan, Iran Maryam dodangeh M.A., Islamic Azad University, Takestan, Iran TABLE OF CONTENTS LIST OF TABLES............................................................................................5 LISTOFGRAPHS……………………………………………………………..6 CHAPTER I: Preliminaries 1.1 Introduction………………..……………………………………………...7 1.2 Statement of the Problem..........................................................................10
    [Show full text]
  • Vli.V08.1.2187-2759 VLI Editorial Team
    Vocabulary Learning and Instruction Volume 8, Number 1, March 2019 doi: http://dx.doi.org/10.7820/vli.v08.1.2187-2759 VLI Editorial Team Editorial Board: Jeffrey Stewart, Raymond Stubbe, Aaron Gibson Reviewers: Paul Meara, Norbert Schmitt, John A. Read, Stuart Webb, John P. Racine, Tomoko Ishii, Tim Stoeckel, Phil Bennett, Dale Brown, Jon Clenton, Stuart McLean, Peter Thwaites, Tatsuya Nakata, Kiwamu Kasahara, Masumi Kojima, James Rogers, Yuko Hoshino, Vivienne Rogers, Yu Kanazawa, Stephen Cutler, Ian Munby, Mark Maby, Brandon Kramer, Makoto Yoshii, Anna C. S. Chang, Brent Culligan Copy Editors: Alex Cameron, Andrew Gallacher, Peter Harold, Mark Howarth, Linda Joyce, Tim Pritchard, Andrew Thompson, Alonzo Williams, Ian Mumby, Pete Westbrook, Melanie González, Stephen Cutler, Michael McGuire, Shusaku Kida, Jenifer Larson-Hall, Shin Ishikawa, Magda Kitano. The Editorial Team expresses a sincere thank you to Mana Ikawa, who designed the cover for the print version of VLI. Copyright © 2019 Vocabulary Learning and Instruction, ISSN: Online 2187-2759; Print 2187-2767. All articles are copyrighted by their respective authors. Vocabulary Learning and Instruction Volume 8, Number 1, March 2019 doi: http://dx.doi.org/10.7820/vli.v08.1.2187-2759 Table of Contents Articles Page Letter from the Editor iv Raymond Stubbe Seventh Annual JALT Vocabulary SIG Symposium Morning Session: Vocabulary and Corpus Linguistics A Reconsideration of the Construct of “A Vocabulary for Japanese Learners of English”: A Critical Comparison of the JACET Wordlists and New General Service Lists 1 Shin’ichiro Ishikawa Validating the Construct of Readability in EFL Contexts: A Proposal for Criteria 8 Geoffrey G. Pinchbeck Toward Written Error Correction with a Japanese-English Parallel Corpus: Data-Driven Learning in the Japanese EFL Classroom 17 Michael McGuire Evaluating Corpora with Word Lists and Word Difficulty 29 Brent A.
    [Show full text]
  • Mouton De Gruyter | IRAL 49-1 (2011) | 1St Proofs Scott Crossley and Thomas Lee Salsbury / [email protected], [email protected]
    Mouton de Gruyter | IRAL 49-1 (2011) | 1st proofs Scott Crossley and Thomas Lee Salsbury / [email protected], [email protected] The development of lexical bundle accuracy and production Please note: page- and linebreaks are temporary; overlong lines, orphans/widows, and split examples will be dealt with after your corrections have been incorporated; there is no need to mark split examples or bad page breaks. Typesetter’s notes/queries /1/ Article adapted to IRAL style sheet. There is a huge number of minor changes and corrections, please carefully check the file for errors. Preliminary page and line breaks 1-iral-49-1 — 2011/1/29 18:46—1— #2—ce Mouton de Gruyter The development of lexical bundle accuracy and production in English second language speakers SCOTT CROSSLEY AND THOMAS LEE SALSBURY Abstract Six adult, second language (L2) English learners were observed over a period of one year to explore the development of lexical bundles (i.e., bigrams) in nat- urally produced, oral English. Total bigrams produced by the L2 learners over the year of observation that were shared with native speakers were compared using a frequency index to explore L2 learner’s accuracy of use. The results of the study support the notion that bigram accuracy increases as a function of time spent learning English. This finding lends credence to the notion that the production of lexical bundles by L2 learners begins to develop in paral- lel with the frequency of lexical bundles used by native speakers. Like native speakers, L2 learners also begin to produce lexical bundles that serve prag- matic and syntactic functions.
    [Show full text]
  • Challenging Learners in Their Individual Zone of Proximal Development Using Pedagogic Developmental Benchmarks of Syntactic Complexity
    Challenging Learners in Their Individual Zone of Proximal Development Using Pedagogic Developmental Benchmarks of Syntactic Complexity Xiaobin Chen and Detmar Meurers LEAD Graduate School and Research Network Department of Linguistics Eberhard Karls Universitat¨ Tubingen,¨ Germany fxiaobin.chen,[email protected] Abstract and identify reading material individually chal- lenging learners, essentially instantiating the next This paper introduces an Intelligent Com- stage of acquisition as captured by Krashen’s con- puter Assisted Language Learning system cept of i+1 (Krashen, 1981) or relatedly, but em- designed to provide reading input for lan- phasizing the social perspective, Vygotsky’s Zone guage learners based on the syntactic com- of Proximal Development (ZPD; Vygotsky, 1976). plexity of their language production. The In terms of structure of the paper, we first locate system analyzes the linguistic complexity our approach in terms of the Complexity, Accu- of texts produced by the user and of texts racy, and Fluency (CAF) framework in SLA re- in a pedagogic target language corpus to search. Then we review approaches adopted by identify texts that are well-suited to foster earlier studies in developmental complexity re- acquisition. These texts provide develop- search, including problems they pose for a peda- mental benchmarks offering an individu- gogical approach aimed at offering developmental ally tailored language challenge, making benchmarks. We propose and justify a solution, ideas such as Krashen’s i+1 or Vygotsky’s before presenting the architecture and functional- Zone of Proximal Development concrete ity of the SyB system. and empirically explorable in terms of a broad range of complexity measures in all 2 Development of Syntactic Complexity dimensions of linguistic modeling.
    [Show full text]
  • Eurosla 27 – Reading, 30Th August
    Version 31st July 2017 Eurosla 27 – Reading, 30th August – 2nd September 2017 Wednesday 30th August doctoral workshops Palmer Building Room 1.02 9.00-9.30 Linda Evenstad Emilsen Acquisition of gender in Norwegian L2 9.30-10.00 Ting Yan Rachel Kan Acquisition of Cantonese classifiers in majority and minority language contexts 10.00-10.30 Shi Zhang The effect of L2 exposure on processing and interpreting reflexive binding and aspect marking in Mandarin-English bilinguals break 10.45-11.15 Eloi Puig Mayenco Investigating the grammatical knowledge of Negative Quantifiers and Negative Polarity Items in L3 English 11.15-11.45 George Pontikas, Ian Processing of English wh-questions by Cunnings, Ianthi Maria sequential bilingual children: a visual-world Tsimpli and Theo paradigm study Marinis 11.45-12.15 Gabriella Notarianni Tense and aspect in the acquisition of Italian Burk as a foreign language Palmer Building Room 1.03 9.00-9.30 Vincent DeLuca Changing our Brains and Minds: Exploring the effect of the bilingual experience on cognition, neural structure and function. 9.30-10.00 Kim Collewaert Referential mechanisms in oral narratives of learners of Spanish 10.00-10.30 Anders Agebjörn The form and function of (double) definiteness in second language Swedish break 10.45-11.15 Xuemei Chen Don’t you find this funny? Humour appreciation among Chinese L2 users of English 11.15-11.45 Audrey Bonvin Discourse cohesion in a Romance-Germanic languages contact environment: lexical and syntactic preferences for additive linking in Switzerland 11.45-12.15
    [Show full text]