Phraseology and Epistemology in Scientific Writing: a Corpus-Driven Approach

Total Page:16

File Type:pdf, Size:1020Kb

Phraseology and Epistemology in Scientific Writing: a Corpus-Driven Approach PHRASEOLOGY AND EPISTEMOLOGY IN SCIENTIFIC WRITING: A CORPUS-DRIVEN APPROACH BY Garry Lee Plappert A thesis submitted to The University of Birmingham for the degree of DOCTOR OF PHILOSOPHY Department of English School of English, Drama, Canadian and American Studies July 2012 Abstract This thesis uses the tools and methods of corpus linguistics to study the process of knowledge encoding in a corpus of texts from the scientific discipline of genetics. It is argued here that the approach taken fits into the tradition of corpus-driven approaches to linguistic questions in that no assumption is made about the linguistic form that this knowledge encoding will take. Instead the study proceeds by identifying a set of keywords using the concept of lexical chains to identify items of terminology. The investigation of these uses the cluster function of WordSmith Tools (Scott 2004) and is qualitative, following Sinclair (1991; 2004) in attempting to develop a picture of the typical linguistic nature of the patterns surrounding these clusters inductively through a process of studying collocation and colligation patterns and identifying phraseology. It is argued here that such an approach is required to discover linguistic aspects of epistemic encoding that have as yet not been identified by those working in the related fields of discourse analysis or corpus linguistics. 2 Table of Contents Table of Contents Chapter 1: Introduction .......................................................................................................... 8 1.1 Introduction .................................................................................................................................. 8 1.2 Stating the research problem(s) ............................................................................................ 9 1.3 Justifying the research ............................................................................................................ 12 1.4 Outline of the study .................................................................................................................. 15 1.5 Some assumptions and limitations of this research .................................................... 17 Chapter 2: Epistemology, social epistemology and the philosophy of science 22 2.1 Traditional Epistemology ...................................................................................................... 23 2.1.1 Epistemology in analytic philosophy ....................................................................................... 23 2.1.2 A crisis in the traditional view of knowledge: the Gettier case ..................................... 23 2.2 The move towards Social epistemology ........................................................................... 25 2.3 The analytic tradition of the philosophy of science ..................................................... 30 2.4 Conclusion ................................................................................................................................... 33 Chapter 3: The contribution of linguistics .................................................................... 35 3.1 The study of discourse ............................................................................................................ 36 3.2 Non-corpus study of scientific texts ................................................................................... 39 3.3 Corpus Linguistics: concordances, collocation and keywords ................................. 44 3.4 Applications of Corpus Linguistics ..................................................................................... 47 3.5 Corpus linguistics, scientific texts and academic writing .......................................... 49 3.6 The potential of corpus-based and corpus driven approaches ............................... 54 3.6.1 The corpus-based/corpus-driven distinction ...................................................................... 55 3.6.2 Epistemic signalling and the corpus-driven approach ..................................................... 56 3.7 Conclusion ................................................................................................................................... 61 Chapter 4: Methodology ...................................................................................................... 63 4.1 Introduction ............................................................................................................................... 63 4.2 Pilot study ................................................................................................................................... 63 4.2.1 The pilot corpus ................................................................................................................................ 63 4.2.2 Extracting an item for further investigation ......................................................................... 67 4.2.3 Exploring an item using the techniques of corpus linguistics ....................................... 72 4.2.4 Investigating an item: The synchronic perspective .......................................................... 73 4.2.5 The synchronic perspective continued: qualitative analysis of expanded contexts ............................................................................................................................................................................. 78 4.2.6 Summary .............................................................................................................................................. 82 4.3 Final corpus construction ...................................................................................................... 83 4.3.1 Purpose ................................................................................................................................................ 84 4.3.2 Genre ..................................................................................................................................................... 84 4.3.3 Size ......................................................................................................................................................... 85 4.3.4 Representativeness ......................................................................................................................... 86 4.3.5 Corpus annotation ........................................................................................................................... 87 4.3.6 Reference corpus .............................................................................................................................. 88 4.4 Data collection and the corpus ............................................................................................ 89 4.5 Extracting useful items for study ........................................................................................ 89 3 4.5.1 whole corpus keywords ................................................................................................................ 90 4.5.2 Extracting discourse objects using lexical chains ............................................................... 97 4.6 Conclusion ................................................................................................................................... 99 Chapter 5: Results ................................................................................................................ 101 5.1 The clusters .............................................................................................................................. 102 5.1.1 Problems with the initial cluster list ..................................................................................... 108 5.1.2 Final list of clusters for investigation in expanded contexts. ...................................... 110 5.2 Clusters containing cells ...................................................................................................... 115 5.2.1 wild type cells .................................................................................................................................. 115 5.2.2 embryonic stem cells ..................................................................................................................... 119 5.3 Clusters containing gene ...................................................................................................... 122 5.3.1 gene expression data ................................................................................................................... 122 5.3.2 gene expression patterns ............................................................................................................. 125 5.3.3 gene expression profiles .............................................................................................................. 131 5.3.4 changes in gene expression......................................................................................................... 136 5.4 Clusters containing genes ........................................................................................................ 138 5.4.1 X linked genes .................................................................................................................................. 138 5.5 Clusters containing expression .............................................................................................. 150 5.6 Clusters containing cell .......................................................................................................
Recommended publications
  • Lectures on English Lexicology
    МИНИСТЕРСТВО ОБРАЗОВАНИЯ И НАУКИ РОССИЙСКОЙ ФЕДЕРАЦИИ ГОУ ВПО «Татарский государственный гуманитарно-педагогический университет» LECTURES ON ENGLISH LEXICOLOGY Курс лекций по лексикологии английского языка Казань 2010 МИНИСТЕРСТВО ОБРАЗОВАНИЯ И НАУКИ РОССИЙСКОЙ ФЕДЕРАЦИИ ГОУ ВПО «Татарский государственный гуманитарно-педагогический университет» LECTURES ON ENGLISH LEXICOLOGY Курс лекций по лексикологии английского языка для студентов факультетов иностранных языков Казань 2010 ББК УДК Л Печатается по решению Методического совета факультета иностранных языков Татарского государственного гуманитарно-педагогического университета в качестве учебного пособия Л Lectures on English Lexicology. Курс лекций по лексикологии английского языка. Учебное пособие для студентов иностранных языков. – Казань: ТГГПУ, 2010 - 92 с. Составитель: к.филол.н., доцент Давлетбаева Д.Н. Научный редактор: д.филол.н., профессор Садыкова А.Г. Рецензенты: д.филол.н., профессор Арсентьева Е.Ф. (КГУ) к.филол.н., доцент Мухаметдинова Р.Г. (ТГГПУ) © Давлетбаева Д.Н. © Татарский государственный гуманитарно-педагогический университет INTRODUCTION The book is intended for English language students at Pedagogical Universities taking the course of English lexicology and fully meets the requirements of the programme in the subject. It may also be of interest to all readers, whose command of English is sufficient to enable them to read texts of average difficulty and who would like to gain some information about the vocabulary resources of Modern English (for example, about synonyms
    [Show full text]
  • Approach to Analysing Phraseology and Collocation in ESP Texts
    ASp la revue du GERAS 59 | 2011 Varia The ‘lexicogrammar’ approach to analysing phraseology and collocation in ESP texts Christopher Gledhill Electronic version URL: http://journals.openedition.org/asp/2169 DOI: 10.4000/asp.2169 ISBN: 978-2-8218-0416-6 ISSN: 2108-6354 Publisher Groupe d'étude et de recherche en anglais de spécialité Printed version Date of publication: 1 March 2011 Number of pages: 5-23 ISSN: 1246-8185 Electronic reference Christopher Gledhill, « The ‘lexicogrammar’ approach to analysing phraseology and collocation in ESP texts », ASp [Online], 59 | 2011, Online since 01 March 2014, connection on 10 December 2020. URL : http://journals.openedition.org/asp/2169 ; DOI : https://doi.org/10.4000/asp.2169 This text was automatically generated on 10 December 2020. Tous droits réservés The ‘lexicogrammar’ approach to analysing phraseology and collocation in ESP ... 1 The ‘lexicogrammar’ approach to analysing phraseology and collocation in ESP texts Christopher Gledhill 1. Introduction 1 The aim of this paper1 is to examine the notions of phraseology and collocation in the field of English for Specific Purposes (ESP) and to recast these terms from the point of view of Systemic Functional Linguistics (SFL). Broadly speaking, phraseology involves the study of formulaic sequences of words, including idiomatic phrases and proverbial expressions, which stand in contrast to other more prosaic constructions in the language in that they have a highly conventionalised form and frame of reference. For example, the rhetorical impact of the phrase (to) cut (one’s) losses (cited in sample text T1 in the Appendix) cannot quite be captured by paraphrases such as: accept what one has lost and move on, stop doing something in order not to make a bad situation worse, etc.
    [Show full text]
  • Introduction: WHY, WHAT and HOW in Phraseology
    Jezikoslovlje 6.1 (2005): 1-5 ɱ 1 0 ÿlanci – Articles - Artikel Marija Omaziü Faculty of Philosophy Josip Juraj Strossmayer University Osijek Introduction: WHY,WHAT and HOW in phraseology In the past 20 years there has been a growing interest in what was traditionally known as ‘idioms’ in the American and British traditions, but what is also called ‘multi-word combinations or units’, ‘fixed expressions’, ‘formulaic language’, ‘coselection of words’, ‘phrasal lexemes’ and ‘phrasemes’ or ‘phraseological units’. Those units are studied in a variety of disciplines of theoretical and ap- plied linguistics: lexicology, lexicography, discourse analysis, corpus analysis, first and second language acquisition, foreign language teaching, cognitive lin- guistics, psycholinguistics, sociolinguistics, semantics, rhetoric and stylistics. This expansion reflects a keener awareness that phraseology is pervasive in speech and writing, and that it plays an important role in first and second lan- guage acquisition and speech production. Despite being undoubtedly fascinating, phraseology has been relatively ne- glected even in lexical studies and semantics. It is only in the past decade that the spectrum of perspectives broadened to include different aspects of the phe- nomenon, thus shaping the new and more widely acknowledged discipline: phraseology. One of the reasons for the increased interest in phraseological units is the tendency in modern linguistics to study language as a mental phenome- non, and the tendency towards interdisciplinary studies. In the logical approach to language, PUs were treated as exceptions to the rule, as special lists of lexi- cally fossilized phrases with specific meanings that language users consulted only if they could not interpret the expression literally.
    [Show full text]
  • English Language and Literature Majοr
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Khazar University Institutional Repository KHAZAR UNIVERSITY Faculty: Schοοl οf Humanities and Sοcial Sciences Department: English Language and Literature Majοr: Linguistics MA THESIS Theme “Analysis of phraseological units in linguistics” Master Student: Aydan Kerimova Supervisοr: Eldar Shahgaldiyev INTRODUCTION CHAPTER I. CHARACTERISTIC FEATURES OF LANGUAGE PHRASEOLOGY 1.1. PHRASEOLOGY - AS A LINGUISTIC DISCIPLINE 1.2 EQUIVALENCE PHRASEOLOGY WORD 1.3. THE CORRELATION OF PHRASEOLOGY AND WORDS CHAPTER II. VOLUME PHRASEOLOGY IN THE CONCEPTS OF NATIONAL SCIENTISTS 2.1 CLASSIFICATION PHRASEOLOGICAL UNITS 2.2 EXPANSION OF PHRASEOLOGICAL UNITS 2.3 THE PROBLEM OF MODELING CHAPTER III. ANALYSIS OF THE MEANS OF EXPRESSION AS PART OF PHRASEOLOGY 3.1 THEORY STYLISTIC CONTEXT 3.2 LEXICAL AND SYNTACTIC MEANS OF EXPRESSION 3.3 PHONETIC MEANS OF EXPRESSION / ALLITERATION, RHYME/ 3.4 CONVERGENCE MEANS OF EXPRESSION CONCLUSION LIST ΟF USED LITERATURE Introduction Thesis is devoted to research of phraseological units Problems of phraseology are traditionally the focus of linguists, there were different opinions regarding the criteria for determining phraseology, status phraseological unit, the characteristics of the system of relations in phraseology, and so on. The number of concepts in linguistics is so great that some of the linguists involuntarily lost faith in the ability to find a way out of the situation, and some linguists especially among new phraseologists even tend to cast doubt on the idea of combining different types of phraseological expressions. The criteria for determining phraseology in linguistics is called in various combinations stability, integrity values are not output from the sum of the values of its constituent words separate formation, the possibility of structural variants or new growths, reproducibility, equivalent words, untranslatability into other languages.
    [Show full text]
  • Terminology, Phraseology, and Lexicography 1. Introduction Sinclair
    Terminology, Phraseology, and Lexicography1 Patrick Hanks Institute of Formal and Applied Linguistics, Charles University in Prague This paper explores two aspects of word use and word meaning in terms of Sinclair's (1991, 1998) distinction between the open-choice principle (or terminological tendency) and the idiom principle (or phraseological tendency). Technical terms such as strobilation are rare, highly domain-specific, and of little phraseological interest, although the texts in which such word occur do tend to contain interesting clusters of domain-specific terminology. At the other extreme, it is impossible to know the meaning of ordinary common words such as the verb blow without knowing the phraseological context in which the word is used. Many words have both a terminological tendency and a phraseological tendency. In some cases the two tendencies are in harmony; in other cases there is tension between them. The relationship between these two tendencies is investigated, using examples from the British National Corpus. 1. Introduction Sinclair (1991) makes a distinction between two aspects of meaning in texts, which he calls the open-choice principle and the idiom principle. In this paper, I will try to show why it is of great importance, not only for understanding meaning in text, but also for lexicography, to try to take account of both principles even-handedly and assess the balance between them when analyzing the meaning and use of a word. The open-choice principle (alternatively called the terminological tendency) is: a way of seeing language as the result of a very large number of complex choices. At each point where a unit is complete (a word or a phrase or a clause), a large range of choices opens up and the only restraint is grammaticalness.
    [Show full text]
  • Motivation and Etymology of Phraseological Units in English and Albanian Language
    E-ISSN 2039-2117 Mediterranean Journal of Social Sciences Vol 5 No 1 ISSN 2039-9340 MCSER Publishing, Rome-Italy January 2014 Motivation and Etymology of Phraseological Units in English and Albanian Language Lediana Beshaj PhD Candidate, Lecturer at Tirana University Albania [email protected] Doi:10.5901/mjss.2014.v5n1p189 Abstract “The notion of phraseology implies much more than inventories of idioms and systems of lexical patterns. Phraseology is a dimension of language use in which patterns of wording (lexicons- grammatical patterns) encode semantic views of the world, and at a higher level idioms and lexical phrases have rhetorical and textual roles within a specific discourse. Phraseology is at once a pragmatic dimension of linguistic analysis, and a system of organization which encompasses more local lexical relationships, namely collocation and lexicon grammar” (Gledhill 2000, p.202). The reason why I decided to write this paper is the fact that apart from investigating the phraseological units thoroughly, of a great challenge would be the discovery of their meaning, their etymology and their motivation, specifically in the Albanian language, a language which according to the “Fjalorit të sotëm te gjuhës shqipe” – “The Dictionary of the today’s Albanian language”, over 50% of the word have more than one meaning, i.e. they are poly-semantic words. On the other hand we have the English language which is spoken by 400 million people as a mother tongue and it is a worldwide international language where phraseological units play an integral part of the language .In this paper we aim to see the similarities and the differences between the two languages.
    [Show full text]
  • Constructive Linguistics for Computational Phraseology: the Esperanto Case
    Constructive Linguistics for Computational Phraseology: the Esperanto Case Federico Gobbo1,2[0000−0003−1748−4921] 1 University of Amsterdam, the Netherlands 2 University of Turin, Italy [email protected] http://uva.nl/profile/f.gobbo/ Abstract. This paper presents the application of the constructive ad- positional grammars (CxAdGrams) to phraseological units, through the special case study of Esperanto. Constructive linguistics is an approach to human language analysis that considers constructions, themselves being paradigms of language-in-use, as the first units. Unlike other construc- tional approaches, constructive linguists apply formalisms in understand- ing linguistic phenomena. The adpositional paradigm is the most devel- oped formalism in constructive linguistics, which is understandable by humans and machine-readable at the same time. The term ‘constructive’ should also be understood in formal terms, as the adpositional paradigm is based on constructive mathematics, and in particular on topos-theory. From a theoretical perspective, CxAdGrams describe human languages in terms of constructions, described adpositional trees (in short, adtrees). This paper aims to explain why such an interpretation of constructions in terms of adtrees can be useful for a deeper understanding of phrase- ology. Esperanto is the case study chosen so to give an empirical base to CxAdGrams. In particular, we illustrate the problematisation of its phraseology as well as the advantages of Esperanto in setting up workable prototypes in a short time. Keywords: Constructive Linguistics Computational Phraseology Ad- · · positional Grammars Adpositional Argumentation Esperanto. · · 1 Introduction Understanding how language is structured is one of the most fascinating and challenging endevour that human beings have ever done.
    [Show full text]
  • Phraseology in the Language, in the Dictionary, and in the Computer
    DOI 10.1515/phras-2012-0003 YoP 2012; 3: 31–56 Igor Mel’čuk Phraseology in the language, in the dictionary, and in the computer Abstract : Two main families of phrasemes (= non-free phrases) are distinguish ed: lexical phrasemes and semantic-lexical phrasemes; the phrasemes of the first family are constrained only in their form (their meaning being free), those of the second family are constrained both in their meaning and in their form. Two basic concepts are introduced: compositionality of complex linguistic signs and the pivot of a meaning. Three major classes of phrasemes are presented: non- compositional idioms and compositional collocations and clichés . A new type of general dictionary is proposed, and the lexicographic presentation of the three classes of phrasemes is illustrated. To show how the proposed approach to phraseology can be used in Automatic Language Processing, three fully-fledged examples are examined in detail. Keywords: phraseology; compositionality; pivot (of a meaning); idioms; colloca- tions; clichés; lexicography; automatic language processing. Correspondence address: [email protected] 1 Introduction There is no need to insist on the importance of phraseology for linguistic studies; on this point the linguistic community is in agreement. But, curiously and unfor- tunately, there is no agreement on either the exact content of the notion ‘phra- seology’, nor on the way phraseological expressions should be described, nor on how they should be treated in linguistic applications, in particular, in lexi- cography
    [Show full text]
  • Phraseological Meaning As a Mechanism of Cultural Memory1
    422 Irina V. Zykova Phraseological Meaning as a Mechanism of Cultural Memory 423 of science such as sociology, philosophy, psychology, computer science, semiotics, cognitive science, anthropology and some others. As a result, phraseology has been equipped with new theories and new successfully applied methods of research, with many different kinds of data prompt- Irina V. Zykova ing a new outlook on classical or traditional issues in phraseology as well Institute of Linguistics of the Russian Academy of Sciences as casting light on the problems that remained controversial and / or Russia unresolved for quite a long time. Phraseology has always been developing as a discipline of many inter- connected approaches and the 1990s saw the rise of a new, very promising Phraseological Meaning but at the same time quite challenging approach to the investigation of as a Mechanism of Cultural Memory1 phraseologisms – the linguoculturological study of phraseology. Nowa- days the research on phraseology as a culture-bound phenomenon unites linguists from all over the world: Piirainen (2008, 2011), Colson (2008), Sab- ban (2008), Pamies (2011), Szerszunowicz (2011), Holandi (2011), Al Jallad abstract: One of the questions that arise within the framework of linguocultu- (2011), Arboe (2011) and many others. Recent international phraseologi- rological studies is what enables phraseologisms to be a store of a huge amount of cultural knowledge or cultural information. Proceeding from the notion of cal conferences held by EUROPHRAS are quite indicative of the growing “cultural memory” we set out to show that the phraseological meaning is a com- interest in the approach in question, for instance, “Cross-linguistic and plex structure including semantic information and its conceptual basis.
    [Show full text]
  • Stylistic Use of Phraseological Units in Discourse
    Stylistic Use of Phraseological Units in Discourse Anita Naciscione John Benjamins Publishing Company Stylistic Use of Phraseological Units in Discourse Stylistic Use of Phraseological Units in Discourse Anita Naciscione John Benjamins Publishing Company Amsterdam / Philadelphia TM The paper used in this publication meets the minimum requirements of 8 American National Standard for Information Sciences – Permanence of Paper for Printed Library Materials, ansi z39.48-1984. doi: 10.1075/z.159 Library of Congress Cataloging-in-Publication Data Naciscione, Anita. Stylistic use of phraseological units in discourse / Anita Naciscione. p. cm. Includes bibliographical references and index. 1. English language--Rhetoric. 2. English language--Style. 3. English language--Terms and phrases. 4. Discourse analysis. I. Title. PE1442.N35 2010 808'.0014--dc22 2010028218 isbn 978 90 272 1176 7 (hb; alk. paper) isbn 978 90 272 8769 4 (Eb) An electronic version of this book is freely available, thanks to the support of libraries working with Knowledge Unlatched. KU is a collaborative initiative designed to make high quality books Open Access for the public good. The Open Access isbn for this book is 978 90 272 8769 4. © 2010 – John Benjamins B.V. This e-book is licensed under a Creative Commons CC BY-NC-ND license. To view a copy of this license, visit https://creativecommons.org/licenses/by-nc-nd/4.0/. For any use beyond this license, please contact the publisher. John Benjamins Publishing Co. • P.O. Box 36224 • 1033 me Amsterdam • The Netherlands https://benjamins.com TM The paper used in this publication meets the minimum requirements of 8 American National Standard for Information Sciences – Permanence of Paper for Printed Library Materials, ansi z39.48-1984.
    [Show full text]
  • English Phraseology: Cognitive, Symbolic and Terminological Aspects (Based on Idioms with Colour Adjectives Black/White/Red)
    Arab World English Journal (AWEJ) Volume. 8 Number 4 December 2017 Pp. 137 -147 DOI: https://dx.doi.org/10.24093/awej/vol8no4.9 English Phraseology: Cognitive, Symbolic and Terminological Aspects (Based on Idioms with Colour Adjectives Black/White/Red) Natalya N. Zerkina Foreign Languages for Engineering Chair, Institute of Humanities, Nosov Magnitogorsk State Technical University, Magnitogorsk, Russia Yekaterina A. Lomakina Linguistics and Literature Chair, Institute of Humanities, Nosov Magnitogorsk State Technical University, Magnitogorsk, Russia Natalja V. Kozhushkova Department of Sociology, Document Science and Archive Science, Institute of Humanities, Nosov Magnitogorsk State Technical University Magnitogorsk, Russia Abstract The authors focus on idioms as means of categorization of the world and means of keeping temporal and spatial cultural-historical data transmission. Special attention is given to the symbolic meaning of idiom components. Aspects of English phraseology are analyzed by emphasizing phraseological antonymy as an important linguistic universal that is pointed out as binary structures playing an important role in cognition by participating in cognitive and structural processes. The analysis considers explicit and implicit representation of the concept of ‘white’, its semiotic and symbolic meanings and its psychical effect on a human being. The article focuses on the process of phrase-forming as a language phenomenon and an efficient means of term-formation in the English language. The authors discusses possibility of using and reproducing idioms with the phraseological model adjective + noun and forecasts further term-formation according to the phraseological model. The article is aimed at showing interaction of linguistic and extra linguistic aspects within an idiom, as a linguistic unit, and the way the linguistic aspects transform into extra linguistic ones and vice versa.
    [Show full text]
  • Etymology of English Idioms S.M
    ETYMOLOGY OF ENGLISH IDIOMS S.M. Bobrovnyk Національний технічний університет України “КПІ” The article deals with the etymology of English idioms. It reveals the notion of this phenomenon and its reflection in the English language. It researches the fields of this concept, defines the meaning of an idiom and the meaning of the term “etymology” and the origin of English idioms. Idioms are social and cultural phenomena that reflect social psychological characteristic of behavior, they relate to the communicative human behavior which implies norms, rules and traditions of communication of this or that lingua-cultural community. One of the most important aspects in the study of the English language is the study of English idioms. In any language there are expressions in which the individual words that make up this expression lose their original meaning. Everyone strives to express ideas and thoughts expressively and it is impossible without idioms which are present in every language of the world. They strengthen the speech and improve understanding cultural norms of the community. This article is undoubtedly relevant as it contains idioms of the modern English language which reveal the beauty of the language. And a great significance is given to the etymology of the idioms’ origin, as it is easier to learn and understand the idiom when you know the origin of it. Key words: idiom, etymology, origin, expression, meaning, community, activity. Introduction. One of the most important aspects in the study of the English language is the study of English idioms. In any language there are expressions in which the individual words that make up this expression lose their original meaning.
    [Show full text]