Lexical Intuitions and Collocation Patterns in Corpora

Total Page:16

File Type:pdf, Size:1020Kb

Lexical Intuitions and Collocation Patterns in Corpora Lexical Intuitions and Collocation Patterns in Corpora by Iain David McGee Centre for Language and Communication Research School o f English, Communication and Philosophy Cardiff University Thesis submitted for degree of Ph.D. Cardiff University June 2006 UMI Number: U584816 All rights reserved INFORMATION TO ALL USERS The quality of this reproduction is dependent upon the quality of the copy submitted. In the unlikely event that the author did not send a complete manuscript and there are missing pages, these will be noted. Also, if material had to be removed, a note will indicate the deletion. Dissertation Publishing UMI U584816 Published by ProQuest LLC 2013. Copyright in the Dissertation held by the Author. Microform Edition © ProQuest LLC. All rights reserved. This work is protected against unauthorized copying under Title 17, United States Code. ProQuest LLC 789 East Eisenhower Parkway P.O. Box 1346 Ann Arbor, Ml 48106-1346 DECLARATION This work has not previously been accepted in substance for any degree and is not being concurrently submitted in candidature for any degree. Signed.. Date STATEMENT 1 This thesis is the result of my own investigations, except where otherwise stated. Other sources are acknowledged by footnotes giving explicit references. A bibliography is appended. Signed Date 2 -& . STATEMENT 2 I hereby give consent for my thesis, if accepted, to be available for photocopying and for inter-library loan, and for the title and summary to be made available to outside organizations. Signed Date Thesis Abstract Language teachers are often called upon by their students to provide examples of vocabulary usage in the classroom. Drawing on their experience of language, these teachers model lexical combinations and collocations, not only in their classes, but also in materials writing. However, corpus linguists have claimed that native speaker intuitions about the typical collocates of words are not reliable, because they do not align with the patterns observed in large corpora. These claims are critically evaluated, and an alternative explanation for the mismatch, the possibility that the corpora might not be representative of actual language in use, is also examined. Various linguistic and psycholinguistic explanations for the disparity between corpus data and elicited data are examined, and theories dealing with the mental representation of collocations are also discussed. Data from word frequency estimate research, and word association research are also analysed for relevant information on the subject. Five experiments are then reported, investigating the ability of native speakers (students and EFL teachers) and non­ native speakers (Arab university teachers) to rank, recognize and spontaneously produce frequent adjective-noun collocations. The results indicate that a key factor affecting the ‘quality’ of lexical intuitions may be the employment of an ‘availability heuristic’ in judgements of frequency. It is argued that some collocates of words may be more hidden from memory searches than others, and that there may be a systematic bias in the respondants’ lexical intuitions based on how words are stored in the mental lexicon. Conclusions are drawn that reflect the many facets of research relevant to the questions under discussion: corpus linguistics, frequency theory, word association research, learning theory and theories of lexical storage. The thesis ends in applying some of the key findings to language teachers. Acknowledgments I would like to thank Prof. Alison Wray for her help and support during the writing of the thesis. She has shown great patience and given much encouragement along the way: she has been a model supervisor. I would also like to thank Gaetanelle Gilquin for alerting me to some relevant research on elicited data and corpus data research. The libraiy staff at King Fahd University of Petroleum and Minerals, Saudi Arabia, have been very helpful in obtaining copies of hard-to-come-by journal articles and books - I am thankful for their help and hard work. Without the respondents giving me their time, this research could not have been conducted and I gratefully acknowledge their willingness to participate in the experiments which are reported in this thesis. Last, but not least, I should like to thank my wife and children for their patience and support, and for giving me the time to put in the many hours that have been spent on this project. This thesis is dedicated to them. Table of Contents Declaration Thesis Abstract Acknowledgments Chapter 1 The Claims of Corpus Linguists against Lexical Intuitions ............ 1. Introduction 2. Background 3. Specific claims 3.1. Collocation 3.2. Frequency 3.3. Semantic prosody and pragmatics 3.4. Phraseology 3.5. The true place for intuitions 4. Explanations for data differences 4.1. Connotation, denotation and delexicalisation 4.2. Saliency 5. Summary Chapter 2 Corpus Representativeness: Establishing the Parameters of Usage 1. Introduction 2. Objection 1: Different registers different data 2.1. Word frequencies 2.2. Word meanings 2.3. Collocations 2.4. Spoken and written English 2.5. National Englishes 3. Objection 2: Representativeness 3.1. Size 3.2. Content 3.3. The BNC 4. Objection 3: The Internet as a super-corpus 5. Objection 4: The dating of a corpus 6. Objection 5: Corpus material and significance 7. Objection 6: Different corpora different data 7.1. Comparisons o f ‘traditional’ corpora 7.2. Comparisons of ‘traditional’ corpora and the Internet. 7.2.1. Frequencies of words 7.2.2. Frequencies of multi-word items 8. Summary Page No. Chapter 3 Collocation Classification and Psycholinguistic Representation .................................63 1. Introduction 2. Collocation: definition and classification 2.1. Co-occurrence of words 2.2. Frequency 2.3. Restrictedness 2.4. Selectional restrictions and embedded collocations 3. Categorising adjectives 4. Collocation representation: psycholinguistic perspectives 4.1. Background 4.2. Psycholinguistic explanations for data differences 4.3. Wray’s formulaic language model 4.4. Evidence for the existence of fused language or formulaic language 5. Summary Chapter 4 Word Frequency Estimation and Frequency Estimation Theory ...............................94 1. Introduction 2. Frequency estimation research 2.1. Word frequency 2.1.1. Methodological issues 2.1.2. Different corpora different results? 2.1.3. Priming effects 2.2. Collocation frequency 2.3. Miscellaneous 3. The coding of and access to frequency information 3.1. Coding of frequency information 3.1.1. Indirect and direct coding 3.1.2. The automatic encoding of frequency 3.2. Access to frequency information 4. Summary Chapter 5 Ranking Restricted and Free Collocations ...................................................................116 1. Introduction 2. Statistical measures of collocation strength and raw frequency co-occurrence data 3. Experiment 1 3.1. Rationale for experiment and hypotheses 3.2. Experiment design 3.2.1. Choice of adjectives and collocations 3.2.2. Differences in frequency between test items 3.2.3. Collocation frequency and raw noun frequency 3.2.4. The sets for testing 3.3. Methodology 3.4. Subjects 3.5. Results and analyses 3.6. Discussion: testing the hypotheses 3.6.1. Evidence of embedding effects 3.6.2. Evidence of saliency effects 3.6.3. Miscellaneous explanations 4. Summary Chapter 6 Insights from Word Association Studies ............................................. 1. Introduction 2. Review of word association testing 2.1. Background 2.2. Methodological issues 2.2.1. Controlled association and free association 2.2.2. Single and multiple responses 2.3. Types of stimuli and response classification 2.3.1. Stimulus type and its effect on response type 2.3.2. Response types: paradigmatic, syntagmatic, clang 2.4. Native speaker and non-native speaker responses 2.4.1. Native speakers 2.4.2. Non-native speakers 2.4.3. Developmental issues 3. Adjective-noun collocations in word association tests 3.1 Introduction and research questions 3.2. Analysis of Moss & Older (1996) data 3.2.1. Adjective stimuli and responses 3.2.1.1. Analysing the data 3.2.1.2. Results 3.2.1.3. Discussion 3.2.2. Noun stimuli and responses 3.2.2.1. Analysing the data 3.2.2.2. Results 3.2.2.3. Discussion 4. Summary Page No. Chapter 7 Testing Productive and Receptive Knowledge of Noun Collocates of Frequent Adjectives ..................................................................192 1. Introduction 2. Experiment 2 2.1. Main research question and hypothesis 2.2. Subjects 2.2.1. Group 1 - Native speakers (NSs) 2.2.2. Group 2 - Non-native speakers (NNSs) 2.3. Designing the experiment 2.3.1. Justification for methodology adopted 2.3.2. Criteria affecting the choice of word stimuli 2.3.3. The issue of lemmatisation 2.3.4. The adjective stimuli 2.4. Method 2.5. Quantitative analyses - justification for choice of test 2.6. Results 2.6.1. Native speakers 2.6.1.1. Analysis 1 - Excluding ties 2.6.1.2. Analysis 2 - Retaining ties 2.6.2. Non-native speakers 2.6.2.1. Analysis 1 - Excluding ties 2.6.2.2. Analysis 2 - Retaining ties 2.6.3. Comparison of native speaker and non-native speaker responses 2.6.3.1. Quantitative analyses 2.6.3.1.1. Number of different responses 2.6.3.1.2. Comparison of NS and NNS group ranks 2.6.3.2. Qualitative analyses 2.6.4. The dominant responses 2.7. Discussion 3. Experiment 3 3.1. Main research interest and hypothesis 3.2. Designing the experiment 3.3. Method 3.4. Subjects 3.5. Results 3.5.1. Native speakers 3.5.1.1. Analysis 1 (Majority choice) 3.5.1.2. Analysis 2 (Votes) 3.5.2. Non-native speakers 3.5.2.1. Analysis 1 (Majority choice) 3.5.2.2. Analysis 2 (Votes) Page No. 3.6. Discussion 4. Summary Chapter 8 Testing Productive and Receptive Knowledge of Adjective Collocates of Frequent Nouns ..................................................................252 1. Introduction 2. Experiment 4 2.1. Rationale for experiment 2.2. Research questions 2.3.
Recommended publications
  • The Impact of Corpus-Based Collocation Instruction on Iranian EFL Learners' Collocation Learning
    Universal Journal of Educational Research 2(6): 470-479, 2014 http://www.hrpub.org DOI: 10.13189/ ujer.2014.020604 The Impact of Corpus-Based Collocation Instruction on Iranian EFL Learners' Collocation Learning Shabnam Ashouri*, Masoume Arjmandi, Ramin Rahimi Department of English, Guilan Science and Research Branch, Islamic Azad University, Guilan, Iran *Corresponding Author: [email protected] Copyright © 2014 Horizon Research Publishing All rights reserved. Abstract Over the past decades, studies of EFL/ESL drawn to vocabulary teaching and its collocation. Afterward vocabulary acquisition have identified the significance of they came to this truth that lack of knowledge of collocation collocations in language learning. Due to the fact that can prevent learners from inferring, and they also collocations have been regarded as one of the major concerns misunderstand native speakers. Wrong use of collocation can of both EFL teachers and learners for many years, the present cause native speakers not to comprehend what the learners study attempts to shed light on the impact of corpus-based say. According to Zarie and Gholami (2007), another collocation on EFL learners' collocation learning and problem of second language learners is the function of awareness. 60 Iranian EFL learners, who participated in this near-synonyms, particularly their collocations. study, were chosen randomly based on their scores in an Near-synonyms are pairs of words with parallel meanings, OPT exam. There were two groups, experimental and control but diverse collocations. Strong and powerful are two cases ones. The study examined the effects of direct corpus-based of near-synonyms. Tea can be strong, but not powerful.
    [Show full text]
  • English Collocations in Use Intermediate Book with Answers
    McCarthy and O’Dell McCarthy and ENGLISH COLLOCATIONS IN USE Collocations are combinations of words, which frequently appear together. Using Intermediate them makes your English sound more natural. Knowledge of collocations is often tested in examinations such as Cambridge FCE, CAE, CPE and IELTS. This book is suitable for ENGLISH students at good intermediate level and above. Using collocations will improve your style of written and spoken English: ENGLISH • Instead of ‘a big amount’, say ‘a substantial amount’ • Instead of ‘think about the options’, say ‘consider the options’ COLLOCATIONS • Using collocations will make your English sound more natural: • Instead of ‘get ill’, say ‘fall ill’ COLLOCATIONS • Instead of ‘a bigCURRENT fine’, say ‘a BCC heavy TOO fine’ LONG Using collocationsFOR will helpNEW you DESIGN avoid common learner errors: How words work • Instead of ‘do a choice’, say ‘make a choice’ together for fluent • Instead of ‘make your homework’, say ‘do your homework’ IN USE and natural English English Collocations in Use Intermediate Self-study and • 60 easy-to-use two-page units: collocations are presented and explained IN USE on left-hand pages with a range of practice exercises on right-hand pages. classroom use • Presents and explains approximately 1,500 collocations in typical contexts Second Edition using short texts, dialogues, tables and charts. Also available • Contains a comprehensive answer key and full index for easy reference. CAMBRIDGE LEARNER’S DICTIONARY• FOURTHHighlights EDITION register to help students choose the appropriate language for ENGLISH VOCABULARY IN USE UPPER-INTERMEDIATEparticular situations. Intermediate ENGLISH PRONUNCIATION IN USE INTERMEDIATE • Informed by the Cambridge English Corpus to ensure that the most frequently used collocations are presented.
    [Show full text]
  • Collocation As Source of Translation Unacceptabilty: Indonesian Students’ Experiences
    International Journal of English Linguistics; Vol. 3, No. 5; 2013 ISSN 1923-869X E-ISSN 1923-8703 Published by Canadian Center of Science and Education Collocation as Source of Translation Unacceptabilty: Indonesian Students’ Experiences Syahron Lubis1 1 Postgraduate Studies of Linguistics, University of Sumatera Utara, Indonesia Correspondence: Syahron Lubis, Postgraduate Studies of Linguistics, University of Sumatera Utara, Indonesia. E-mail: [email protected] Received: July 24, 2013 Accepted: August 19, 2013 Online Published: September 23, 2013 doi:10.5539/ijel.v3n5p20 URL: http://dx.doi.org/10.5539/ijel.v3n5p20 Abstract The aims of the present study are to explore wrong English collocations made by some Indonesian English learners and to find out the causes of the wrong collocations. Twenty seven wrong English lexical collocations and nine wrong grammatical collocations collected from the students’ translation and writing assignments have been examined. After comparing the patterns of English collocations and the Indonesian collocations it is found out that the erroneous English collocations are attributed to four causes: (1) learners’ lack of knowledge of collocation, (2) differences of collocations between English and Bahasa Indonesia, (3) learners’ low mastery of vocabulary and (4) strong interferences of the learners’ native language.It is suggested that the students should be informed that collocation, like grammar, is one aspect of not only English but all other languages that should be learnt if they want their English sound natural and mastery of vocabulary can contribute to the constructing of correct collocations. Keywords: collocation, translation, vocabulary, interference 1. Introduction In learning a foreign language a learner is faced with a variety of problems.
    [Show full text]
  • Towards a Model for Investigating Predicate-Intensifier Collocations
    WORK IN PROGRESS Towards a Model for Investigating Predicate-Intensifier Collocations. Silvia Cacchiani Post-doctoral student Dipartimento di Anglistica, Università di Pisa Via Santa Maria 67,1 - 56126 Pisa (PI) [email protected] Abstract Adverbial intensifiers express the semantic role of degree. Here, we shall focus on English upgrading intensifiers like very, absolutely, extremely, impossibly. Specifically, what we have mainly aspire at is to develop and apply a simple but efficient model that investigates the motivations behind choosing from among competing intensifiers in a non-haphazard way. Such a model is meant to work as a "combinatory chart" that allows for fair comparison ofnear-synonymic intensifiers with respect to a number ofparameters ofvariations (or textual preferences) on the morpho-syntactic, lexico-semantic and discourse-pragmatic levels. Its ultimate lexicographic contribution to the issue of predicate-intensifier collocations will be building a combinatory dictionary of English intensifiers - and, later on, a bilingual combinatory dictionary of English and Italian intensifiers. 1. Corpus Data and Methodology bi the current paper we want to depict a model for investigating predicate-intensifier collocations.1 Upgrading intensifiers constitute an extremely varied lexico-functional category. They boost a quality ab:eady present in their predicate (i.e. head) along an imaginary scale of degree of intensity. The modification introduced cannot be objectively measured (e.g. deadgorgeous as agaiastfully developed countries). The corpus consulted was the BNC (http://www.natcorp.ox.ac.uk/index.html). We only searched the www in the case ofpoorly or not represented at all intensifiers. Since most intensifiers are polyfimctional words ambiguous between different interpretations, one of which is precisely intensification (e.g.
    [Show full text]
  • A Corpus-Based Study of the Effects of Collocation, Phraseology
    A CORPUS-BASED STUDY OF THE EFFECTS OF COLLOCATION, PHRASEOLOGY, GRAMMATICAL PATTERNING, AND REGISTER ON SEMANTIC PROSODY by TIMOTHY PETER MAIN A thesis submitted to the University of Birmingham for the degree of DOCTOR OF PHILOSOPHY Department of English Language and Applied Linguistics College of Arts and Law The University of Birmingham December 2017 University of Birmingham Research Archive e-theses repository This unpublished thesis/dissertation is copyright of the author and/or third parties. The intellectual property rights of the author or third parties in respect of this work are as defined by The Copyright Designs and Patents Act 1988 or as modified by any successor legislation. Any use made of information contained in this thesis/dissertation must be in accordance with that legislation and must be properly acknowledged. Further distribution or reproduction in any format is prohibited without the permission of the copyright holder. Abstract This thesis investigates four factors that can significantly affect the positive-negative semantic prosody of two high-frequency verbs, CAUSE and HAPPEN. It begins by exploring the problematic notion that semantic prosody is collocational. Statistical measures of collocational significance, including the appropriate span within which evaluative collocates are observed, the nature of the collocates themselves, and their relationship to the node are all examined in detail. The study continues by examining several semi-preconstructed phrases associated with HAPPEN. First, corpus data show that such phrases often activate evaluative modes other than semantic prosody; then, the potentially problematic selection of the canonical form of a phrase is discussed; finally, it is shown that in some cases collocates ostensibly evincing semantic prosody occur in profiles because of their occurrence in frequent phrases, and as such do not constitute strong evidence of semantic prosody.
    [Show full text]
  • Lexical Collocation Errors in Literary Translation
    Dil Dergisi Sayı/Number: 170/1 Ocak/January Gönderildiği tarih: 15.11.2018 Kabul edildiği tarih: 10.01.2019 DOI: 10.33690/dilder.528981 Keywords Anahtar Sözcükler Lexical Collocation Translation, L1 Influence, Action, Verb, Russian, Turkish, Semantic Restriction On Collocability, Translations Of Literary Texts LEXICAL COLLOCATION ERRORS IN LITERARY TRANSLATION YAZINSAL METİNLERİN ÇEVİRİSİNDE SÖZCÜKSEL EŞDİZİM HATALARI • Özgür Şen Bartan Dr. Öğr. Üyesi, Kırıkkale Üniversitesi, Fen-Edebiyat Fakültesi, Batı Dilleri ve Edebiyatları Bölümü, Mütercim Tercümanlık A.B.D., [email protected] Abstract Öz This study aims to explore lexical collocation errors in Bu çalışmanın amacı Türkçe-İngilizce yazınsal metinlerin Turkish-English translations of literary texts. To identify, çevirisinde sözcüksel eşdizim hatalarını araştırmaktır. Türkçe describe and explain the errors (Ellis, 1985), a written corpus yazınsal metinlerin İngilizce çevirilerinde yapılan hataları of Turkish literary texts and their English translation texts belirlemek, tanımlamak ve açıklamak (Ellis, 1985) için İngilizce (ETT) was compiled from students of the Department of Mütercim Tercümanlık Anabilim dalındaki öğrencilerin English Translation and Interpretation. Benson et al.’s (1997) çevirilerinden oluşan yazılı derlem oluşturulmuştur. Bunun classifications for lexical collocations have been used in order yanı sıra, en sık yapılan sözcüksel eşdizim hataları (eylem+ad) to analyse lexical collocation errors for this study. Also, the kısıtlı eşdizimlilik ve anadilin etkisi açısından
    [Show full text]
  • Semantic Prosody and Intensifier Variation in Academic Speech
    SEMANTIC PROSODY AND INTENSIFIER VARIATION IN ACADEMIC SPEECH by ALLISON REBECCA WACHTER (Under the Direction of Chad Howe) ABSTRACT The study of English intensifiers has been of interest in sociolinguistic research. This paper analyzes the variation of common intensifiers very and really in a corpus of Academic English and the language-internal and –external factors that predict this variation. All factors are related to a speaker’s evaluation, specifically to the semantic notion of positive, negative, or neutral prosody. Using a variationst approach, this paper furthers insight to the grammaticalization of intensifiers and how the effect of delexicalization can predict the semantic properties of a modified adjective. The significant factors predicting very/really variation were academic setting, semantic prosody, academic discipline, and gender. The study further develops a methodological framework for operationalizing semantic prosody and producing quantitative results. A second analysis concerning the distinctions between other modifiers, namely reinforcers and attenuators, are also analyzed in the academic corpus. Finally, the paper discusses its support for the use of smaller corpora to examine and compare linguistic effects in more specific registers. INDEX WORDS: semantic prosody, intensifiers, language variation SEMANTIC PROSODY AND INTENSIFIER VARIATION IN ACADEMIC SPEECH by ALLISON REBECCA WACHTER BA, University of Michigan, 2008 A Thesis Submitted to the Graduate Faculty of The University of Georgia in Partial Fulfillment of the Requirements for the Degree MASTER OF ARTS ATHENS, GEORGIA 2012 © 2012 Allison Rebecca Wachter All Rights Reserved SEMANTIC PROSODY AND INTENSIFIER VARIATION IN ACADEMIC SPEECH by ALLISON REBECCA WACHTER Major Professor: Chad Howe Committee: Anya Lunden Paula Schwanenflugel Electronic Version Approved: Maureen Grasso Dean of the Graduate School The University of Georgia May 2012 ACKNOWLEDGEMENTS This thesis would not have been possible without the guidance of my advisor, Dr.
    [Show full text]
  • Collocations As a Language Resource a Functional and Cognitive Study in English Phraseology
    Collocations as a language resource A functional and cognitive study in English phraseology Sonja Poulsen Ph.d. dissertation Institute of Language and Communication University of Southern Denmark 2005 Supervisors Associate professor Fritz Larsen, University of Southern Denmark Associate professor John M. Dienhart, University of Southern Denmark Associate professor Alex Klinge, Copenhagen Business School Front cover Peter Harder’s analogy (1996a: 91) between the functional role that wings play for the survival of birds in biology and the functionality of linguistic constructions inspired me to use birds for my front cover illustration. My thanks to Kurt Normann for letting me use parts of his etching of flying rooks. Sonja Poulsen Institute of Language and Communication University of Southern Denmark DK-5230 Odense M Tel.: +45 6550 2188 E-mail: [email protected] © Sonja Poulsen 2005 Contents: Typographical conventions 3 Abbreviations 3 Tables 4 Figures 4 English résumé 6 Preface and acknowledgements 10 PART ONE 1. Introduction 11 1.1. Phraseology: the traditional approach 12 1.2. Definitions of ‘collocation’ 14 1.3. Methodology and classification in phraseology 17 1.4. Motivation for a functional and cognitive approach 23 1.5. What should a theory of collocations be able to account for? 28 1.6. Overview 33 PART TWO 2. The foundations of a traditional approach to phraseology 34 2.1. Theoretical influences on phraseology 35 2.1.1. A practical concern: teaching English as a foreign language 35 2.1.2. Firthian linguistics 38 2.1.3. Underlying assumptions 48 2.1.3.1. Structuralist dichotomies 48 2.1.3.2.
    [Show full text]
  • Verb Collocations in Medical English
    US-China Foreign Language, ISSN 1539-8080 August 2013, Vol. 11, No. 8, 609-618 D DAVID PUBLISHING Verb Collocations in Medical English Bionote Evelina Miščin College of Business and Management, Zagreb, Croatia The aim of this paper was to investigate the most frequent verb collocations which occur in medical texts and to check the competence of the first-year students of medical English in using them. Merck’s Manual of Medical Information (2004) was used as a corpus to find the most frequent collocations. It was analyzed by several programmes like Simple Concordance, Collocation Extract, TermeX. The corpus analysis established that the nouns “function” and “infection” occur with most verbs (30). They are followed by “pain” (28 verbs), “muscle” (24 verbs). Three hundred and sixty-two verbs occur with nouns and among them the most frequent are “cause”, “have”, “develop”, “treat”, “prevent”, and “produce”. After that, the test was devised to see which collocations students use with the most competence. Two hundred and ninety-seven first-year students of School of Medicine in Zagreb were tested. There were four types of exercises—multiple choice, gap-fill, translation from English into Croatian, and vice versa. The average result in multiple choice was 9.8 with the s = 2.0 as a standard deviation, in gap-fill 5.0 with s = 2.17, in translation into Croatian 6.7 with s = 2.10, and in translation into English 5.2 with s = 2.53. Then, glossary was made which should help future users of medical English. Keywords: collocations, collocational competence, corpus, collocational mistakes, glossary Introduction This paper deals with verb collocations in medical English.
    [Show full text]
  • Collocations Or Free Variation? Empirical Study on the Use of Adverbs in Pre-Verb Position by German ESL Learners
    Philipps-Universität Marburg Institut für Anglistik und Amerikanistik – Englische Sprachwissenschaft SS 2004 10 038 SE Foreign Language Vocabulary Prof. Dr. Rüdiger Zimmermann Collocations or Free Variation? Empirical Study on the Use of Adverbs in Pre-Verb Position by German ESL Learners Dirk Hovy Hirschberg 9 35037 Marburg Subjects: DsuL, Spw. Angl., Ling. Eng. MA Matr.Nr.: 1420461 Table of Content 1. INTRODUCTION.......................................................................................................................3 2. RESEARCH QUESTIONS .......................................................................................................3 3. PREVIOUS RESEARCH ..........................................................................................................6 4. METHODOLOGY .....................................................................................................................9 5. RESULTS...................................................................................................................................14 5.1 PARTICIPANTS .......................................................................................................................14 5.2 ADVERBS...............................................................................................................................15 Sentence 1 ..............................................................................................................................15 Sentence 2 ..............................................................................................................................16
    [Show full text]
  • Induction of Syntactic Collocation Patterns from Generic Syntactic Relations
    Induction of Syntactic Collocation Patterns from Generic Syntactic Relations Violeta Seretan University of Geneva Language Technology Laboratory 2 Rue de Candolle, Geneva, CH-1211, Switzerland [email protected] Abstract 2 Syntactic Patterns for Collocations Syntactic configurations used in collocation extrac- In the most permissive case (when no extraction patterns are tion are highly divergent from one system to an- defined), any pair of words is regarded as a valid collocation other, this questioning the validity of results and candidate. This results in much noise, since the most fre- making comparative evaluation difficult. We de- quent word combinations are also the least interesting (e.g., scribe a corpus-driven approach for inferring an ex- “of the”, “in the”). Therefore, many extraction systems per- haustive set of configurations from actual data by form the linguistic analysis of text and apply a linguistic filter finding, with a parser, all the productive syntactic on collocation candidates (very often, they perform POS tag- associations, then by appealing to human expertise ging in order to filter out the pairs involving function words2). for relevance judgements. There is unfortunately much disagreement with respect to the accepted syntactic configurations for collocations (often as a consequence of the lack of consensus in the understand- 1 Introduction ing of the notion of collocation). There is much divergence, The term collocation, often used in different senses in the first, with respect to the POS of participating words. Some literature, is understood here as in the following statement: authors consider open-class words only ([Justeson and Katz, “The term collocation refers to the idiosyncratic syntagmatic 1995; Hausmann, 1989]), while most of them allow for func- combination of lexical items and is independent of word tion words too (as in “agree on”).
    [Show full text]
  • Idioms, Collocations, and Structure Conventionalized Expressions and the Analysis of Ditransitives
    Idioms, Collocations, and Structure Conventionalized Expressions and the Analysis of Ditransitives Benjamin Bruening (University of Delaware) rough draft, January 15, 2018; comments welcome Abstract Noncompositional phrasal idioms have been used as evidence in syntactic theorizing for decades. A standard assumption, occasionally made explicit (e.g., Larson 2017), is that phrasal idioms differ signifi- cantly from fully compositional collocations in the kinds of syntactic structures they can be built from. I show with a detailed empirical study that this is false. In fact, the syntactic constraints on idioms and col- locations are identical. They should therefore be treated the same, as a broad class of conventionalized expressions. I show that the selection theory of idioms proposed in Bruening (2010) extends straight- forwardly to cover literal collocations. A minor modification accounts for patterns of conventionalized expressions that are striking in their absence. Furthermore, a detailed analysis of the patterns of conven- tionalized expressions in ditransitive constructions shows that they are systematic and decide between theoretical proposals for the structure of ditransitive constructions in English (contra Larson 2017). Only the ApplP analysis developed in Bruening (2010) accounts for the patterns. 1 Introduction There is a long tradition in generative syntax of examining the syntactic patterns of non-literal, convention- alized expressions, and using those patterns to inform our theories of syntax, or even to decide between competing analyses of some phenomenon. As an example, patterns of non-literal phrasal idioms have been used as arguments for the proper analysis of ditransitive constructions (e.g., Green 1974, Larson 1988, Richards 2001, Harley 2002, Bruening 2010).
    [Show full text]