Verbs of Letting in Germanic and Romance Languages a Quantitative Investigation Based on a Parallel Corpus of Film Subtitles
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
An Automatic Part-Of-Speech Tagger for Middle Low German
An automatic part-of-speech tagger for Middle Low German Mariya Koleva, Melissa Farasyn, Bart Desmet, Anne Breitbarth and Véronique Hoste Ghent University Syntactically annotated corpora are highly important for enabling large-scale diachronic and diatopic language research. Such corpora have recently been developed for a variety of historical languages, or are still under development. One of those under development is the fully tagged and parsed Corpus of Historical Low German (CHLG), which is aimed at facilitating research into the highly under-researched diachronic syntax of Low German. The present paper reports on a crucial step in creating the corpus, viz. the creation of a part-of-speech tagger for Middle Low German (MLG). Having been transmitted in several non-standardised written varieties, MLG poses a challenge to standard POS taggers, which usually rely on normalized spelling. We outline the major issues faced in the creation of the tagger and present our solutions to them. Keywords: historical linguistics, part-of-speech tagging, conditional random fields, feature selection, normalization 1. Introduction Corpora of historical texts annotated with different levels of grammatical information, such as parts of speech, (inflectional) morphology, syntactic chunks, clausal syntax, provide an important resource for studies of diachronic syntactic variation and change (e.g. Kroch et al. 2000, Rögnvaldsson & Helgadóttir 2011). They enable the automatic extraction of syntactic information from historical texts (more than is manually possible), and allow making statistically valid observations. Apart from reducing the amount of time required for data retrieval, an important advantage is that they make research testable and replicable. The Corpus of Historical Low German (CHLG) (Breitbarth et al. -
Review of "The Romance Languages" by R. Posner
Swarthmore College Works Linguistics Faculty Works Linguistics 1998 Review Of "The Romance Languages" By R. Posner Donna Jo Napoli Swarthmore College, [email protected] Follow this and additional works at: https://works.swarthmore.edu/fac-linguistics Part of the Linguistics Commons Let us know how access to these works benefits ouy Recommended Citation Donna Jo Napoli. (1998). "Review Of "The Romance Languages" By R. Posner". Journal Of Linguistics. Volume 34, Issue 1. 302-303. https://works.swarthmore.edu/fac-linguistics/10 This work is brought to you for free by Swarthmore College Libraries' Works. It has been accepted for inclusion in Linguistics Faculty Works by an authorized administrator of Works. For more information, please contact [email protected]. The Romance Languages by Rebecca Posner Review by: Donna Jo Napoli Journal of Linguistics, Vol. 34, No. 1 (Mar., 1998), pp. 302-303 Published by: Cambridge University Press Stable URL: http://www.jstor.org/stable/4176472 . Accessed: 17/07/2014 15:00 Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at . http://www.jstor.org/page/info/about/policies/terms.jsp . JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms of scholarship. For more information about JSTOR, please contact [email protected]. Cambridge University Press is collaborating with JSTOR to digitize, preserve and extend access to Journal of Linguistics. http://www.jstor.org This content downloaded from 130.58.65.13 on Thu, 17 Jul 2014 15:00:57 PM All use subject to JSTOR Terms and Conditions JOURNAL OF LINGUISTICS cliticin objectposition when quantification is involvedand the presenceof the resumptiveclitic when thereis no quantification,even if a wh-phraseis involved. -
Gramley 2009
1 The Origins of English. 1. The origins of human language. According to some calculations the capacity for language – which is surely one of the most clearly human features we have – emerged approximately 145,000 years ago (± 70,000) (Bickerton 1990: 175). The emergence of human speech depended on both suitable physiological change in what were to become the organs of speech and on changes in the structure of the brain to allow humans to work with the complexity of language neurologically (ibid.: chap. 8). Furthermore, widespread opinion (e.g. Bickerton 1990: 4; Bloomfield 1933: 3; Chomsky 1968: 100; Diamond 1992: 141; Sapir 1921: 23) sees the acquisition of language as a unique phenomenon. As such it was then passed on to the descendants of the first group of speakers. Just how this mooted first language may have looked in detail is unknown, but the multiplicity and diversity of languages spoken in today’s world indicate one of the unchanging principles of human language – change: out of one many have developed. One of these many languages is English, itself a grouping of often very different varieties spoken all over the world by both native and non-native speakers. (Just how many speakers is a widely debated question, as is the question of what a native and what a non-native speakers is. See the discussion in chapters 7 and 13.) It is the aim of this book to explore how English came into being and developed the enormous amount of diversity which the label English covers. 2. Divergence, change, and the family model. -
Germanic and Romance. Probing the Similarities and Differences
Germanic and Romance. Probing the similarities and differences There is a large literature in the field of comparative and historical syntax of drawing comparison between Germanic and Romance varieties. This includes a particular tradition which argues that the earlier languages were more alike than their present day counterparts (see in particular Adams 1989; Fontana 1993; Mathieu 2007 and Franco 2009). The most prominent example of this similarity is the Verb Second constraint, which is characteristic of many early and contemporary Germanic languages (see Vikner 1995 and Walkden 2014) and argued to be operative in Medieval Romance (Thurneysen 1892 et seq.). The workshop will aim to develop a more nuanced understanding of both the parallels and points of contrast between these two families, through synchronic comparison of phenomena in previous historical stages and diachronic consideration of the relevant pathways of change. The time is right for such an exercise on several grounds. First, research in recent decades has equipped the historical linguist with a range of large-scale corpora for both Germanic and Romance (see, for example, in the case of Romance the Tycho Brahe Parsed Corpus of Historical Portuguese, the Base de Français Médiéval and the Opera del Vocabolario Italiano alongside the Penn Parsed Corpus of Historical English, the Icelandic Parsed Historical Corpus and the Corpus of Historical Low German for Germanic). This affords a methodologically more robust basis for comparison than has previously been possible empirically. Second, a more nuanced understanding has been reached in recent years of previously little-reported variation amongst the early Germanic and Romance varieties (see Walkden 2014 and Wolfe 2015), which has so far not been extensively exploited for comparison between Germanic and Romance. -
CHAPTER SEVENTEEN History of the German Language 1 Indo
CHAPTER SEVENTEEN History of the German Language 1 Indo-European and Germanic Background Indo-European Background It has already been mentioned in this course that German and English are related languages. Two languages can be related to each other in much the same way that two people can be related to each other. If two people share a common ancestor, say their mother or their great-grandfather, then they are genetically related. Similarly, German and English are genetically related because they share a common ancestor, a language which was spoken in what is now northern Germany sometime before the Angles and the Saxons migrated to England. We do not have written records of this language, unfortunately, but we have a good idea of what it must have looked and sounded like. We have arrived at our conclusions as to what it looked and sounded like by comparing the sounds of words and morphemes in earlier written stages of English and German (and Dutch) and in modern-day English and German dialects. As a result of the comparisons we are able to reconstruct what the original language, called a proto-language, must have been like. This particular proto-language is usually referred to as Proto-West Germanic. The method of reconstruction based on comparison is called the comparative method. If faced with two languages the comparative method can tell us one of three things: 1) the two languages are related in that both are descended from a common ancestor, e.g. German and English, 2) the two are related in that one is the ancestor of the other, e.g. -
The Germanic Third Weak Class
THE GERMANIC THIRD WEAK CLASS JAY H. JASANOFF Harvard University Germanic verbs of the 3rd weak class form presents characterized by an alternation between predesinential *ai (e.g. Go. 3sg. habai1» and *a (e.g. Ipl. habam). These verbs are usually compared with the 'e-verbs' of Italic and Balto Slavic, but no IE present built on the stative suffix *-e- will account phonologi cally for the form of the suffix in Germanic. Instead, it can be shown that the characteristic Germanic paradigm results from the 'activization' of an older middle paradigm in which a 3sg. in *-ai « IE *-oi; cf. Skt. duh~ 'milks') was further suffixed by the productive active ending *-Pi « IE *-ti). 1. The inflection of the third class of weak verbs (exemplified by the verb 'to have': Gothic haban, Old Norse hafa,! Old High German hab~, Old Saxon hebbian, Old English habban) presents one of the classic problems in the historical morphology of Germanic. Not only do the verbs of this class show peculiarities in all the older Germanic languages, but they differ remarkably in their conjugation from one language to another, so that it is not at all obvious how the Common Germanic paradigm should be reconstructed. Given this diversity of forms, we will do well to begin with a short review of the morphological facts themselves. The situation is simplest in Old High German. The entire conjugation of hab~n is athematic (to the extent that this term still has any meaning), and is based on the single stem hab~-: 1sg. hab~, 3sg. hab~t, 3pl. -
Teaching English-Spanish Cognates Using the Texas 2X2 Picture Book Reading Lists
TEACHING ENGLISH-SPANISH COGNATES USING THE TEXAS 2X2 PICTURE BOOK READING LISTS JOSÉ A. MONTELONGO, ANITA C. HERNÁNDEZ, & ROBERTA J. HERTER ABSTRACT English-Spanish cognates are words that possess identical or nearly identical spellings and meanings in both English and Spanish as a result of being derived mainly from Latin and Greek. Of major importance is the fact that many of the more than 20,000 cognates in English are academic vocabulary words, terms essential for comprehending school texts. The Texas 2x2 Reading List is a list of recommended reading books for children ranging in ages from pre-school to the early primary grades. The list is published yearly by the Children’s Round Table, a division of the Texas Library Association. The books that comprise the Texas 2x2 Reading List are a rich source of vocabulary and contain many English-Spanish cognates. Teachers can use the Texas 2x2 picture books to create a cognate vocabulary lesson that can be taught as a companion to a picture book read- aloud. The purpose of this paper is to present some of the different types of cognate vocabulary lessons that may be created to accompany a picture book read-aloud. The lessons are based on the morphological and spelling regularities between English and Spanish cognates and can be used to teach students how to convert words from one language to another. Examples of the different types of regularities and the Texas 2x2 books that contain them are included, as is an example cognate vocabulary lesson plan to accompany the picture book, Oddrey (Whamond, 2012). -
A Penn-Style Treebank of Middle Low German
A Penn-style Treebank of Middle Low German Hannah Booth Joint work with Anne Breitbarth, Aaron Ecay & Melissa Farasyn Ghent University 12th December, 2019 1 / 47 Context I Diachronic parsed corpora now exist for a range of languages: I English (Taylor et al., 2003; Kroch & Taylor, 2000) I Icelandic (Wallenberg et al., 2011) I French (Martineau et al., 2010) I Portuguese (Galves et al., 2017) I Irish (Lash, 2014) I Have greatly enhanced our understanding of syntactic change: I Quantitative studies of syntactic phenomena over time I Findings which have a strong empirical basis and are (somewhat) reproducible 2 / 47 Context I Corpus of Historical Low German (‘CHLG’) I Anne Breitbarth (Gent) I Sheila Watts (Cambridge) I George Walkden (Konstanz) I Parsed corpus spanning: I Old Low German/Old Saxon (c.800-1050) I Middle Low German (c.1250-1600) I OLG component already available: HeliPaD (Walkden, 2016) I 46,067 words I Heliand text I MLG component currently under development 3 / 47 What is Middle Low German? I MLG = West Germanic scribal dialects in Northern Germany and North-Eastern Netherlands 4 / 47 What is Middle Low German? I The rise and fall of (written) Low German I Pre-800: pre-historical I c.800-1050: Old Low German/Old Saxon I c.1050-1250 Attestation gap (Latin) I c.1250-1370: Early MLG I c.1370-1520: ‘Classical MLG’ (Golden Age) I c.1520-1850: transition to HG as in written domain I c.1850-today: transition to HG in spoken domain 5 / 47 What is Middle Low German? I Hanseatic League: alliance between North German towns and trade outposts abroad to promote economic and diplomatic interests (13th-15th centuries) 6 / 47 What is Middle Low German? I LG served as lingua franca for supraregional communication I High prestige across North Sea and Baltic regions I Associated with trade and economic prosperity I Linguistic legacy I Huge amounts of linguistic borrowings in e.g. -
Old Frisian, an Introduction To
An Introduction to Old Frisian An Introduction to Old Frisian History, Grammar, Reader, Glossary Rolf H. Bremmer, Jr. University of Leiden John Benjamins Publishing Company Amsterdam / Philadelphia TM The paper used in this publication meets the minimum requirements of 8 American National Standard for Information Sciences — Permanence of Paper for Printed Library Materials, ANSI Z39.48-1984. Library of Congress Cataloging-in-Publication Data Bremmer, Rolf H. (Rolf Hendrik), 1950- An introduction to Old Frisian : history, grammar, reader, glossary / Rolf H. Bremmer, Jr. p. cm. Includes bibliographical references and index. 1. Frisian language--To 1500--Grammar. 2. Frisian language--To 1500--History. 3. Frisian language--To 1550--Texts. I. Title. PF1421.B74 2009 439’.2--dc22 2008045390 isbn 978 90 272 3255 7 (Hb; alk. paper) isbn 978 90 272 3256 4 (Pb; alk. paper) © 2009 – John Benjamins B.V. No part of this book may be reproduced in any form, by print, photoprint, microfilm, or any other means, without written permission from the publisher. John Benjamins Publishing Co. · P.O. Box 36224 · 1020 me Amsterdam · The Netherlands John Benjamins North America · P.O. Box 27519 · Philadelphia pa 19118-0519 · usa Table of contents Preface ix chapter i History: The when, where and what of Old Frisian 1 The Frisians. A short history (§§1–8); Texts and manuscripts (§§9–14); Language (§§15–18); The scope of Old Frisian studies (§§19–21) chapter ii Phonology: The sounds of Old Frisian 21 A. Introductory remarks (§§22–27): Spelling and pronunciation (§§22–23); Axioms and method (§§24–25); West Germanic vowel inventory (§26); A common West Germanic sound-change: gemination (§27) B. -
The Romance Advantage — the Significance of the Romance Languages As a Pathway to Multilingualism
ISSN 1799-2591 Theory and Practice in Language Studies, Vol. 8, No. 10, pp. 1253-1260, October 2018 DOI: http://dx.doi.org/10.17507/tpls.0810.01 The Romance Advantage — The Significance of the Romance Languages as a Pathway to Multilingualism Kathleen Stein-Smith Fairleigh Dickinson University, Metropolitan Campus, Teaneck, NJ, USA Abstract—As 41M in the US speak a Romance language in the home, it is necessary to personally and professionally empower L1 speakers of a Romance language through acquisition of one or more additional Romance languages. The challenge is that Romance language speakers, parents, and communities may be unaware of both the advantages of bilingual and multilingual skills and also of the relative ease in developing proficiency, and even fluency, in a second or third closely related language. In order for students to maximize their Romance language skills, it is essential for parents, educators, and other language stakeholders to work together to increase awareness, to develop curriculum, and to provide teacher training -- especially for Spanish-speakers, who form the vast majority of L1 Romance language speakers in the US, to learn additional Romance languages. Index Terms—romance languages, bilingual education, multilingualism, foreign language learning, romance advantage I. INTRODUCTION The Romance languages, generally considered to be French, Spanish, Italian, Portuguese, and Romanian, and in addition, regional languages including Occitan and Catalan, developed from Latin over a significant period of time and across a considerable geographic area. The beginnings of the Romance languages can be traced to the disappearance of the Roman Empire, along with Latin, its lingua franca. -
Latin Language and Latin Culture
Latin Language and Latin Culture from ancient to modern times Joseph Farrell University of Pennsylvania ab published by the press syndicate of the university of cambridge The Pitt Building, Trumpington Street, Cambridge, United Kingdom cambridge university press The Edinburgh Building, Cambridge cb22ru, UK www.cup.cam.ac.uk 40 West 20th Street, New York, ny 10011±4211, USA www.cup.org 10 Stamford Road, Oakleigh, Melbourne 3166, Australia Ruiz de AlarcoÂn 13, 28014 Madrid, Spain ( Cambridge University Press 2001 This book is in copyright. Subject to statutory exception and to the provisions of relevant collective licensing agreements, no reproduction of any part may take place without the written permission of Cambridge University Press. First published 2001 Printed in the United Kingdom at the University Press, Cambridge 1 Typeset in 92 /12 Times New Roman and New Hellenic Greek on `3B2' [ao] A catalogue record for this book is available from the British Library isbn 0 521 77223 0 hardback isbn 0 521 77663 5 paperback Contents Preface page xi 1 The nature of Latin culture 1 2 The poverty of our ancestral speech 28 3 The gender of Latin 52 4 The life cycle of dead languages 84 5 The voices of Latin culture 113 Appendix: Nepos fr. 59 in the edition of Marshall (1977) 134 Bibliography 135 Index 141 ix CHAPTER 1 The nature of Latin culture Coming to Latin culture At the end of Virgil's Aeneid there occurs an episode in which the goddess Juno ®nally agrees to stop ®ghting. Her position, however, is far from abject. -
Romance Languages (Bryn Mawr) 1
Romance Languages (Bryn Mawr) 1 Spanish ROMANCE LANGUAGES (BRYN SPAN B102, SPAN B120. Four courses at the 200 MAWR) level. Two courses at the 300 level. Second Language and Literature Department Website: French https://www.brynmawr.edu/romance FREN B101-FREN B102 or FREN B101-FREN B105; or FREN B005-FREN B102 or FREN B005-FREN B105. The Departments of French and Francophone Two literature courses at the 200 level. FREN B260 Studies, Italian, and Spanish cooperate in offering (BMC) or FREN H212 (HC). One course at the 300 a major in Romance Languages that requires level. advanced work in at least two Romance languages and literatures. Additional work in a third language Italian and literature is suggested. ITAL B101, ITAL B102. Two literature courses at the 200 level. Two literature courses at the 300 level. Major Requirements Spanish The requirements for the major are a minimum SPAN B102, SPAN B120. Two courses at the 200 of nine courses, including the Senior Conference level. Two courses at the 300 level. and/or Senior Essay, described below, in the first language and literature and six courses in the In addition to the coursework described above, when second language and literature, including the the first language and literature is Spanish, majors Senior Conference in French, if French is selected as in Romance Languages must enroll in SPAN B398 * second. Students should consult with their advisors (Senior Seminar). no later than their sophomore year in order to select courses in the various departments that complement When French is chosen as either the first or second each other.