Download Preprint

Preprint version International Journal of Learner Corpus Research, accepted for publication 23 July 2021 [email protected] Bert Le Bruyn and Magali Paquot (Eds.), Learner Corpus Research Meets Second Language Acquisition. Cambridge: Cambridge University Press, 2021. xiii + 275 pp. ISBN 9781108442299. [Cambridge Applied Linguistics Series] Reviewed by Kevin McManus (Penn State University, USA) Understanding the ways in which speakers use an additional language and how that ability emerges and changes over time constitutes a major focus of applied linguistics research to date. A dominant approach to investigating this question has involved studies of production, including studies that have documented how speakers use specific linguistic features (e.g., articles), multi-word combinations, as well as broader linguistic and/or discourse-level patterns. One particularly fruitful method for studying L2 usage has involved corpus linguistic analyses of learner corpora, defined as “systematic collections of authentic, continuous and contextualized language use (spoken or written) by L2 learners stored in electronic format” (Callies & Paquot, 2015, p. 1). Learner corpus research (LCR) thus holds considerable potential for informing and better grounding current conceptualizations of L2 learning as developed in the field of second language acquisition (SLA). However, as many commentators have noted, the fields of LCR and SLA have not always benefited from one another as much as they could (see Myles, 2005, 2015). By providing an up-to-date account of cutting-edge work at the intersection of LCR and SLA, 1 Preprint version International Journal of Learner Corpus Research, accepted for publication 23 July 2021 [email protected] the current volume shows how recent advances in LCR and SLA have brought these fields closer together to provide robust and innovative accounts of L2 use and development. In this review, I provide a short summary of each chapter in the volume, followed by a concise evaluation of the volume as a whole and its contribution to the field. Le Bruyn and Paquot’s introduction situates and provides a broad contextualization for the volume, noting three main topics addressed in the nine empirical studies: universal tendencies and crosslinguistic influence, proficiency and time, and corpus analysis and development. The editors note that the volume “provides a fair impression of how the fields of LCR and SLA are currently interacting” (p. 3), achieved by drawing on a broad range of corpora, theories of language and learning, and analytical approaches. The book concludes with commentaries from Sylviane Granger and Florence Myles on the volume’s contribution to LCR and SLA. The first three empirical chapters focus on crosslinguistic influence. Ionin and Díez- Bedmar investigated article usage by Russian- and Spanish-speaking learners of English. Their study examined the extent to which the predictions formulated from prior experimental SLA work about article usage were borne out in LCR using the Cambridge Learner Corpus (see Cambridge University Press, 2021). In general, their comparisons indicate a similar patterning of results in both approaches: article usage was influenced by prior linguistic knowledge and proficiency. At the same time, the authors note that each approach brings its own specific contribution for studying article usage, thus cementing claims about the importance of methodological triangulation in research design. 2 Preprint version International Journal of Learner Corpus Research, accepted for publication 23 July 2021 [email protected] Understanding crosslinguistic influence in terms of Present Perfect and Simple Past usage among German and Chinese speakers is the focus of Werner, Fuchs and Götz’s study. Their analysis examined “if/how learners from two different L1 backgrounds deviate from native usage” (p. 58) by comparing usage differences in a variety of corpora: for the L2 speakers, the Louvain International Database of Spoken English Interlanguage (LINDSEI, see Gilquin et al., 2010) and the International Corpus of Learner English (ICLE, see Granger et al., 2009); and for the L1 speakers, the Louvain Corpus of Native English Conversation (LOCNEC, see De Cock, 2004) and the Louvain Corpus of Native English Essays (LOCNESS, see Granger & Tyson, 1996). Consistent with previous research, results indicated both that L2 and L1 speakers use these tense forms in different ways and that increased L2 proficiency shapes usage towards target-like norms. The authors suggest that L1 transfer explanations do not account well for their findings. In the last of the crosslinguistic influence chapters, Meriläinen examined embedded inversion and preposition omission in L2 English among learners from a variety of L1 backgrounds as well as with L1 speakers using the ICLE corpus and the Corpus of Matriculation Examination compositions for L2 speakers and LOCNESS for L1 speakers. The author suggests that prior linguistic knowledge plays an important role in accounting for usage patterns in the corpus data. Polio and Yoon offer a refreshing take on understandings and operationalizations of accuracy in L2 research. They investigated to what extent multi-word combinations can function as measures of accuracy in L2 writing, using the Corpus of Contemporary American English (COCA, see Davies, 2008) and a variety of L2 corpora (e.g., the MSU 3 Preprint version International Journal of Learner Corpus Research, accepted for publication 23 July 2021 [email protected] corpus, see Connor-Linton & Polio, 2014). In so doing, they propose new ways to think about accuracy, drawing on usage-based conceptualizations of language that move beyond judgements from data coders and L1 speakers. In addition, the authors present and discuss some of the ways that accuracy coding has the potential to be less labor-intensive by using corpus linguistic techniques. The volume also includes three chapters that use longitudinal learner corpora to understand development and change. Paquot, Haets, and Gries investigated phraseological complexity development in L2 English using written data from the Longitudinal Database of Learner English project (LONGDALE, see Meunier et al., 2016). The study examined how L2 usage changed over time and to what extent L2 proficiency helped understand development. Their findings suggest a close relationship between general L2 proficiency and development in writing. The authors also draw attention to important task effects on L2 performance. In the case of writing, this includes the extent to which different essay prompts can elicit different types of responses (see also Verspoor et al.). These insights on can influence conclusions and claims about L2 development in important ways. Using oral and written data in French and Spanish from the Languages and Social Networks Abroad Project (LANGSNAP, see Mitchell et al., 2017), Tracy-Ventura, Huensch, and Mitchell investigated changes in L2 lexical diversity during study abroad and four years later. In contrast to much LCR research, this learner corpus includes considerable meta-data to understand and contextualize usage. In their study, social network data were used to understand the extent to which learners continued to use and/or 4 Preprint version International Journal of Learner Corpus Research, accepted for publication 23 July 2021 [email protected] receive exposure to the L2. The findings indicated L2 exposure provided an important explanation for changes in L2 lexical diversity. Taking a longitudinal multiple case study approach, Verspoor, Lowie, and Wieling investigated L2 writing development over 23 weeks. The analyses examined lexical and syntactic changes over the course of the 23 weeks, with data collected each week to provide fine-grained data points for studying development. Their findings indicated improvement over time on a range of measures, but development was non-linear. Variation among individuals was also evident. The authors call for more multiple case study approaches to better understand the longitudinal trajectories of L2 development. The last two empirical studies of the volume focus on methodology and research design. Wulff and Gries make a case for studying individual differences and variation in SLA using the MuPDAR(F) (multifactorial prediction and deviation analysis using regression/random forests) statistical technique (see Gries & Adelman, 2014; Gries & Deshors, 2014). This technique can address the limitations of frequency comparisons of overuse and underuse by focusing on probabilistic differences resulting from usage. In so doing, this method compares linguistic choices between speakers rather than counting the number usage instances between speakers. The usefulness of this method for LRC and SLA was demonstrated by analyzing genitive alternation (of vs. ’s) among Chinese- and German-speaking learners of English (from ICLE) and L1 English speakers (from ICE). The results show that individual variation plays a major role in our understanding of usage. In line with Verspoor et al., while group-level analyses can be insightful, when used alone they provide a partial account only. 5 Preprint version International Journal of Learner Corpus Research, accepted for publication 23 July 2021 [email protected] Bell, Collins, and Marsden offer a review of methodological issues associated with the creation of learner corpora and present analyses from pilot data involving school-aged children. Their analyses highlight the importance of piloting before finalizing

Download Preprint

The Spoken BNC2014: Designing and Building a Spoken Corpus Of

E-Language: Communication in the Digital Age Dawn Knight1, Newcastle University

Investigating Vocabulary in Academic Spoken English

SELECTING and CREATING a WORD LIST for ENGLISH LANGUAGE TEACHING by Deny A

FERSIWN GYMRAEG ISOD the National Corpus of Contemporary

The National Corpus of Contemporary Welsh 1. Introduction

A Corpus-Assisted Critical Discourse Analysis of the Reporting on Corporate Fraud by UK Newspapers 2004 - 2014

The Spoken BNC2014 Designing and Building a Spoken Corpus of Everyday Conversations

Classical and Modern Arabic Corpora: Genre and Language Change

The Spoken British National Corpus 2014

EUROCALL Conference 2019 “CALL and Complexity”

Corpus Linguistics and Pragmatics Christoph Rühlemann, University of Paderborn Brian Clancy, University of Limerick