Copyrighted Material
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
A Pattern Language for Teaching in a Foreign Language - Part 2
A Pattern Language for Teaching in a Foreign Language - Part 2 Christian K¨oppe and Mari¨elle Nijsten Hogeschool Utrecht, Institute for Information & Communication Technology, Postbus 182, 3500 AD Utrecht, Netherlands {christian.koppe,marielle.nijsten}@hu.nl http://www.hu.nl Abstract. Teaching a technical subject in a foreign language is not just switching to the foreign language. There are specific problems related to the integration of content and learn- ing. This paper is part of ongoing work with mining of patterns which address these problems and intends to offer practical help to teachers by working towards a pattern language for technical instructors who teach students in a second language, and who are not trained in language pedagogy. This is the pre-conference version for the PLoP'12 conference. Note to workshop participants: for the PLoP 2012 conference we would like to focus on following patterns: Content-Compatible Language, Commented Action, and Language Monitor. We also would like to work on the list of related educational patterns. However, feedback on the other patterns is more than welcome, they are therefore all included in this workshop version of the paper. 1 Introduction The set of patterns introduced in this paper is aimed at instructors who occasionally want | or have | to teach a class in a foreign language instead of their mother tongue. The patterns are meant to support these instructors in preparing for these classes in such a way that a foreign language will not be a barrier to students' understanding of the course content. This set of patterns is part of a larger project for developing a pattern language for teaching in a foreign language. -
Aurora Tsai's Review of the Diagnosis of Reading in a Second Or Foreign
Reading in a Foreign Language April 2015, Volume 27, No. 1 ISSN 1539-0578 pp. 117–121 Reviewed work: The Diagnosis of Reading in a Second or Foreign Language. (2015). Alderson, C., Haapakangas, E., Huhta, A., Nieminen, L., & Ullakonoja, R. New York and London: Routledge. Pp. 265. ISBN 978-0-415-66290-1 (paperback). $49.95 Reviewed by Aurora Tsai Carnegie Mellon University United States http://www.amazon.com In The Diagnosis of Reading in a Second or Foreign Language, Charles Alderson, Eeva-Leena Haapakangas, Ari Huhta, Lea Nieminen, and Riikka Ullakonoja discuss prominent theories concerning the diagnosis of reading in a second or foreign language (SFL). Until now, the area of diagnosis in SFL reading has received little attention, despite its increasing importance as the number of second language learners continues to grow across the world. As the authors point out, researchers have not yet established a theory of how SFL diagnosis works, which makes it difficult to establish reliable procedures for accurately diagnosing and helping students in need. In this important contribution to reading and diagnostic research, Alderson et al. illustrate the challenges involved in carrying out diagnostic procedures, conceptualize a theory of diagnosis, provide an overview of SFL reading, and highlight important factors to consider in diagnostic assessment. Moreover, they provide detailed examples of tests developed specifically for diagnosis, most of which arise from their most current research projects. Because this book covers a wide variety of SFL reading and diagnostic topics, researchers in applied linguistics, second language acquisition, language assessment, education, and even governmental organizations or military departments would consider it an enormously helpful and enlightening resource on SFL reading. -
Segmentability Differences Between Child-Directed and Adult-Directed Speech: a Systematic Test with an Ecologically Valid Corpus
Report Segmentability Differences Between Child-Directed and Adult-Directed Speech: A Systematic Test With an Ecologically Valid Corpus Alejandrina Cristia 1, Emmanuel Dupoux1,2,3, Nan Bernstein Ratner4, and Melanie Soderstrom5 1Dept d’Etudes Cognitives, ENS, PSL University, EHESS, CNRS 2INRIA an open access journal 3FAIR Paris 4Department of Hearing and Speech Sciences, University of Maryland 5Department of Psychology, University of Manitoba Keywords: computational modeling, learnability, infant word segmentation, statistical learning, lexicon ABSTRACT Previous computational modeling suggests it is much easier to segment words from child-directed speech (CDS) than adult-directed speech (ADS). However, this conclusion is based on data collected in the laboratory, with CDS from play sessions and ADS between a parent and an experimenter, which may not be representative of ecologically collected CDS and ADS. Fully naturalistic ADS and CDS collected with a nonintrusive recording device Citation: Cristia A., Dupoux, E., Ratner, as the child went about her day were analyzed with a diverse set of algorithms. The N. B., & Soderstrom, M. (2019). difference between registers was small compared to differences between algorithms; it Segmentability Differences Between Child-Directed and Adult-Directed reduced when corpora were matched, and it even reversed under some conditions. Speech: A Systematic Test With an Ecologically Valid Corpus. Open Mind: These results highlight the interest of studying learnability using naturalistic corpora Discoveries in Cognitive Science, 3, 13–22. https://doi.org/10.1162/opmi_ and diverse algorithmic definitions. a_00022 DOI: https://doi.org/10.1162/opmi_a_00022 INTRODUCTION Supplemental Materials: Although children are exposed to both child-directed speech (CDS) and adult-directed speech https://osf.io/th75g/ (ADS), children appear to extract more information from the former than the latter (e.g., Cristia, Received: 15 May 2018 2013; Shneidman & Goldin-Meadow,2012). -
Multimedia Corpora (Media Encoding and Annotation) (Thomas Schmidt, Kjell Elenius, Paul Trilsbeek)
Multimedia Corpora (Media encoding and annotation) (Thomas Schmidt, Kjell Elenius, Paul Trilsbeek) Draft submitted to CLARIN WG 5.7. as input to CLARIN deliverable D5.C3 “Interoperability and Standards” [http://www.clarin.eu/system/files/clarindeliverableD5C3_v1_5finaldraft.pdf] Table of Contents 1 General distinctions / terminology................................................................................................................................... 1 1.1 Different types of multimedia corpora: spoken language vs. speech vs. phonetic vs. multimodal corpora vs. sign language corpora......................................................................................................................................................... 1 1.2 Media encoding vs. Media annotation................................................................................................................... 3 1.3 Data models/file formats vs. Transcription systems/conventions.......................................................................... 3 1.4 Transcription vs. Annotation / Coding vs. Metadata ............................................................................................. 3 2 Media encoding ............................................................................................................................................................... 5 2.1 Audio encoding ..................................................................................................................................................... 5 2.2 -
Gold Standard Annotations for Preposition and Verb Sense With
Gold Standard Annotations for Preposition and Verb Sense with Semantic Role Labels in Adult-Child Interactions Lori Moon Christos Christodoulopoulos Cynthia Fisher University of Illinois at Amazon Research University of Illinois at Urbana-Champaign [email protected] Urbana-Champaign [email protected] [email protected] Sandra Franco Dan Roth Intelligent Medical Objects University of Pennsylvania Northbrook, IL USA [email protected] [email protected] Abstract This paper describes the augmentation of an existing corpus of child-directed speech. The re- sulting corpus is a gold-standard labeled corpus for supervised learning of semantic role labels in adult-child dialogues. Semantic role labeling (SRL) models assign semantic roles to sentence constituents, thus indicating who has done what to whom (and in what way). The current corpus is derived from the Adam files in the Brown corpus (Brown, 1973) of the CHILDES corpora, and augments the partial annotation described in Connor et al. (2010). It provides labels for both semantic arguments of verbs and semantic arguments of prepositions. The semantic role labels and senses of verbs follow Propbank guidelines (Kingsbury and Palmer, 2002; Gildea and Palmer, 2002; Palmer et al., 2005) and those for prepositions follow Srikumar and Roth (2011). The corpus was annotated by two annotators. Inter-annotator agreement is given sepa- rately for prepositions and verbs, and for adult speech and child speech. Overall, across child and adult samples, including verbs and prepositions, the κ score for sense is 72.6, for the number of semantic-role-bearing arguments, the κ score is 77.4, for identical semantic role labels on a given argument, the κ score is 91.1, for the span of semantic role labels, and the κ for agreement is 93.9. -
A Massively Parallel Corpus: the Bible in 100 Languages
Lang Resources & Evaluation DOI 10.1007/s10579-014-9287-y ORIGINAL PAPER A massively parallel corpus: the Bible in 100 languages Christos Christodouloupoulos • Mark Steedman Ó The Author(s) 2014. This article is published with open access at Springerlink.com Abstract We describe the creation of a massively parallel corpus based on 100 translations of the Bible. We discuss some of the difficulties in acquiring and processing the raw material as well as the potential of the Bible as a corpus for natural language processing. Finally we present a statistical analysis of the corpora collected and a detailed comparison between the English translation and other English corpora. Keywords Parallel corpus Á Multilingual corpus Á Comparative corpus linguistics 1 Introduction Parallel corpora are a valuable resource for linguistic research and natural language processing (NLP) applications. One of the main uses of the latter kind is as training material for statistical machine translation (SMT), where large amounts of aligned data are standardly used to learn word alignment models between the lexica of two languages (for example, in the Giza?? system of Och and Ney 2003). Another interesting use of parallel corpora in NLP is projected learning of linguistic structure. In this approach, supervised data from a resource-rich language is used to guide the unsupervised learning algorithm in a target language. Although there are some techniques that do not require parallel texts (e.g. Cohen et al. 2011), the most successful models use sentence-aligned corpora (Yarowsky and Ngai 2001; Das and Petrov 2011). C. Christodouloupoulos (&) Department of Computer Science, UIUC, 201 N. -
The Relationship Between Transitivity and Caused Events in the Acquisition of Emotion Verbs
Love Is Hard to Understand: The Relationship Between Transitivity and Caused Events in the Acquisition of Emotion Verbs The Harvard community has made this article openly available. Please share how this access benefits you. Your story matters. Hartshorne, Joshua K., Amanda Pogue, and Jesse Snedeker. 2014. Citation Love Is Hard to Understand: The Relationship Between Transitivity and Caused Events in the Acquisition of Emotion Verbs. Journal of Child Language (June 19): 1–38. Published Version doi:10.1017/S0305000914000178 Accessed January 17, 2017 12:55:19 PM EST Citable Link http://nrs.harvard.edu/urn-3:HUL.InstRepos:14117738 This article was downloaded from Harvard University's DASH Terms of Use repository, and is made available under the terms and conditions applicable to Open Access Policy Articles, as set forth at http://nrs.harvard.edu/urn-3:HUL.InstRepos:dash.current.terms-of- use#OAP (Article begins on next page) Running head: TRANSITIVITY AND CAUSED EVENTS Love is hard to understand: The relationship between transitivity and caused events in the acquisition of emotion verbs Joshua K. Hartshorne Massachusetts Institute of Technology Harvard University Amanda Pogue University of Waterloo Jesse Snedeker Harvard University In press at Journal of Child Language Acknowledgements: The authors wish to thank Timothy O’Donnell for assistance with the corpus analysis as well as Alfonso Caramazza, Susan Carey, Steve Pinker, Mahesh Srinivasan, Nathan Winkler- Rhoades, Melissa Kline, Hugh Rabagliati, members of the Language and Cognition workshop, and three anonymous reviewers for comments and discussion. This material is based on work supported by a National Defense Science and Engineering Graduate Fellowship to JKH and a grant from the National Science Foundation to Jesse Snedeker (0623845). -
An Introduction to Applied Linguistics This Page Intentionally Left Blank an Introduction to Applied Linguistics
An Introduction to Applied Linguistics This page intentionally left blank An Introduction to Applied Linguistics edited by Norbert Schmitt Orders: please contact Bookpoint Ltd, 130 Milton Park, Abingdon, Oxon OX14 4SB. Telephone: (44) 01235 827720. Fax: (44) 01235 400454. Lines are open from 9.00 to 5.00, Monday to Saturday, with a 24-hour message answering service. You can also order through our website www.hoddereducation.co.uk If you have any comments to make about this, or any of our other titles, please send them to [email protected] British Library Cataloguing in Publication Data A catalogue record for this title is available from the British Library ISBN: 978 0 340 98447 5 First Edition Published 2002 This Edition Published 2010 Impression number 10 9 8 7 6 5 4 3 2 1 Year 2014, 2013, 2012, 2011, 2010 Copyright © 2010 Hodder & Stoughton Ltd All rights reserved. No part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher or under licence from the Copyright Licensing Agency Limited. Further details of such licences (for reprographic reproduction) may be obtained from the Copyright Licensing Agency Limited, of Saffron House, 6–10 Kirby Street, London EC1N 8TS. Hachette UK’s policy is to use papers that are natural, renewable and recyclable products and made from wood grown in sustainable forests. The logging and manufacturing processes are expected to conform to the environmental regulations of the country of origin. -
The Correlation Between Self- Assessment Reports and CEFR Levels According to Standardized Assessment
The Correlation between Self- Assessment Reports and CEFR Levels according to Standardized Assessment The DIALANG Vocabulary Test and the Cambridge General English Online Test in relation to two Self-Assessment Procedures: Likert Scale Ratings and the Employment of CEFR Descriptors Master’s Thesis – MA Linguistics: Language and Communication Coaching Radboud University, Nijmegen Student: Laura Buuts Student Number: 4423194 Date of Submission: 21-06-2020 Supervisor: J. Klatter-Folmer Second Reader: E. Krikhaar ABSTRACT This thesis concerns an investigation into the correlation between self-assessment reports regarding vocabulary knowledge and actual levels of vocabulary knowledge according to two standardized assessments. The experiment in this study was conducted by means of a questionnaire consisting of a language testing component and a self-assessment component. The participants in this study are speakers of Dutch as a native language. With the help of regression analyses, it has been shown that the result of this study is that the Common European Framework of Reference (CEFR) levels of vocabulary knowledge according to both the DIALANG Vocabulary Test and the Cambridge General English Online Test correlate with the estimated levels of the self-assessment component. There are differences in the correlation coefficients of both tests and the self-assessment components. Unfortunately, the sample size impeded the confirmation of demographic discrepancies, such as gender, educational background, and use of English in daily life and in the workplace or educational context. Nonetheless, the fact that in general the language testing component correlates with the self- assessment component indicates that in terms of vocabulary, participants are generally able to predict their language skills. -
Distributional Properties of Verbs in Syntactic Patterns Liam Considine
Early Linguistic Interactions: Distributional Properties of Verbs in Syntactic Patterns Liam Considine The University of Michigan Department of Linguistics April 2012 Advisor: Nick Ellis Acknowledgements: I extend my sincerest gratitude to Nick Ellis for agreeing to undertake this project with me. Thank you for cultivating, and partaking in, some of the most enriching experiences of my undergraduate education. The extensive time and energy you invested here has been invaluable to me. Your consistent support and amicable demeanor were truly vital to this learning process. I want to thank my second reader Ezra Keshet for consenting to evaluate this body of work. Other thanks go out to Sarah Garvey for helping with precision checking, and Jerry Orlowski for his R code. I am also indebted to Mary Smith and Amanda Graveline for their participation in our weekly meetings. Their presence gave audience to the many intermediate challenges I faced during this project. I also need to thank my roommate Sean and all my other friends for helping me balance this great deal of work with a healthy serving of fun and optimism. Abstract: This study explores the statistical distribution of verb type-tokens in verb-argument constructions (VACs). The corpus under investigation is made up of longitudinal child language data from the CHILDES database (MacWhinney 2000). We search a selection of verb patterns identified by the COBUILD pattern grammar project (Francis, Hunston, Manning 1996), these include a number of verb locative constructions (e.g. V in N, V up N, V around N), verb object locative caused-motion constructions (e.g. -
The Field of Phonetics Has Experienced Two
The field of phonetics has experienced two revolutions in the last century: the advent of the sound spectrograph in the 1950s and the application of computers beginning in the 1970s. Today, advances in digital multimedia, networking and mass storage are promising a third revolution: a movement from the study of small, individual datasets to the analysis of published corpora that are thousands of times larger. These new bodies of data are badly needed, to enable the field of phonetics to develop and test hypotheses across languages and across the many types of individual, social and contextual variation. Allied fields such as sociolinguistics and psycholinguistics ought to benefit even more. However, in contrast to speech technology research, speech science has so far taken relatively little advantage of this opportunity, because access to these resources for phonetics research requires tools and methods that are now incomplete, untested, and inaccessible to most researchers. Our research aims to fill this gap by integrating, adapting and improving techniques developed in speech technology research and database research. The intellectual merit: The most important innovation is robust forced alignment of digital audio with phonetic representations derived from orthographic transcripts, using HMM methods developed for speech recognition technology. Existing forced-alignment techniques must be improved and validated for robust application to phonetics research. There are three basic challenges to be met: orthographic ambiguity; pronunciation variation; and imperfect transcripts (especially the omission of disfluencies). Reliable confidence measures must be developed, so as to allow regions of bad alignment to be identified and eliminated or fixed. Researchers need an easy way to get a believable picture of the distribution of transcription and measurement errors, so as to estimate confidence intervals, and also to determine the extent of any bias that may be introduced. -
Evaluation, Testing and Assessment
Case Study 4 – Evaluation, testing and assessment “May I help you, madam?” – English for office communication in an adult education centre The use of current evaluation, assessment and testing instruments in a VOLL course Anthony Fitzpatrick with Inge-Anna Koleff, Wiener Volkshochschulen, Austria, and Manfred Thönicke, Hamburger Institut für Berufliche Bildung – HIBB, Germany 1. Introduction Within the context of the February 2009 workshop, a sub-group dealt with evaluation, assessment and testing, basing its work on some of the tools referred to and made available by the animator. The group took as its point of departure the situation of one of the workshop participants, who was in the process of setting up a (VOLL) English language course for employees in her organisation. The group members explored the different assessment systems available to decide on their potential for language testing and assessment procedures for the envisaged target group. They then set up a step-by-step system for evaluating the proficiency of course participants from entry level to achievement of course objectives. By documenting the steps taken, they hope to give colleagues working in VOLL a practical introduction to available tools which they found useful for testing, assessment and evaluation, and guidelines as to how and when to deploy those tools. 2. The setting The language department of the Viennese Adult Education Association (Wiener Volkshochschulen) had decided to set up and offer a new English course for employees in the administration of its 16+ member institutes throughout the city. The course was to address the needs of those operatives, all native speakers of German, who had contact with potential learners/customers in face-to-face or telephone settings.