A Corpus-Assisted Discourse Study of Kulturkampf in German, Polish and English
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Talk Bank: a Multimodal Database of Communicative Interaction
Talk Bank: A Multimodal Database of Communicative Interaction 1. Overview The ongoing growth in computer power and connectivity has led to dramatic changes in the methodology of science and engineering. By stimulating fundamental theoretical discoveries in the analysis of semistructured data, we can to extend these methodological advances to the social and behavioral sciences. Specifically, we propose the construction of a major new tool for the social sciences, called TalkBank. The goal of TalkBank is the creation of a distributed, web- based data archiving system for transcribed video and audio data on communicative interactions. We will develop an XML-based annotation framework called Codon to serve as the formal specification for data in TalkBank. Tools will be created for the entry of new and existing data into the Codon format; transcriptions will be linked to speech and video; and there will be extensive support for collaborative commentary from competing perspectives. The TalkBank project will establish a framework that will facilitate the development of a distributed system of allied databases based on a common set of computational tools. Instead of attempting to impose a single uniform standard for coding and annotation, we will promote annotational pluralism within the framework of the abstraction layer provided by Codon. This representation will use labeled acyclic digraphs to support translation between the various annotation systems required for specific sub-disciplines. There will be no attempt to promote any single annotation scheme over others. Instead, by promoting comparison and translation between schemes, we will allow individual users to select the custom annotation scheme most appropriate for their purposes. -
Arabic Corpus and Word Sketches
Journal of King Saud University – Computer and Information Sciences (2014) 26, 357–371 King Saud University Journal of King Saud University – Computer and Information Sciences www.ksu.edu.sa www.sciencedirect.com arTenTen: Arabic Corpus and Word Sketches Tressy Arts a, Yonatan Belinkov b, Nizar Habash c,*, Adam Kilgarriff d, Vit Suchomel e,d a Chief Editor Oxford Arabic Dictionary, UK b MIT, USA c New York University Abu Dhabi, United Arab Emirates d Lexical Computing Ltd, UK e Masaryk Univ., Czech Republic Available online 7 October 2014 KEYWORDS Abstract We present arTenTen, a web-crawled corpus of Arabic, gathered in 2012. arTenTen con- Corpora; sists of 5.8-billion words. A chunk of it has been lemmatized and part-of-speech (POS) tagged with Lexicography; the MADA tool and subsequently loaded into Sketch Engine, a leading corpus query tool, where it Morphology; is open for all to use. We have also created ‘word sketches’: one-page, automatic, corpus-derived Concordance; summaries of a word’s grammatical and collocational behavior. We use examples to demonstrate Arabic what the corpus can show us regarding Arabic words and phrases and how this can support lexi- cography and inform linguistic research. The article also presents the ‘sketch grammar’ (the basis for the word sketches) in detail, describes the process of building and processing the corpus, and considers the role of the corpus in additional research on Arabic. Ó 2014 Production and hosting by Elsevier B.V. on behalf of King Saud University. 1. Introduction ber of the TenTen Corpus Family (Jakubı´cˇek et al., 2013). -
An Ethnographic Study of the Second Generation German
AN ETHNOGRAPHIC STUDY OF THE SECOND GENERATION GERMAN AMERICANS LEAVING THEIR MARK ON U.S. HIGHER EDUCATION A dissertation submitted by Nicole Ruscheinski Herion to Benedictine University in partial fulfillment of the requirements for the degree of Doctor of Education in Higher Education and Organizational Change Benedictine University April 2016 Copyright by Nicole Ruscheinski Herion, 2016 All rights reserved ACKNOWLEDGEMENTS When I began contemplating what I would write for my dissertation, I wanted to write something that would contribute to the field of higher education and make a lasting footprint on my cultural background. The words “never forget where you came from” kept ringing in my ears. I must thank Dr. Antonina Lukenchuk for helping me define and focus a concept that flourished and came to life over the past three years. I want to thank God for showing his grace and mercy during times of confusion, trouble, and misunderstanding throughout a long and laborious dissertation process. God is good and he truly allowed for this dream to become a reality. It will go down as one of the biggest accomplishments in my life. The hours that it takes to fine tune and go through a project like this are unimaginable to some; however, Dr. Sunil Chand, Dr. Kathy Sexton-Radek, and Dr. Antonina Lukenchuk were the best of mentors to me and spent countless hours helping me so I could produce a product I would be proud of. I will never forget the time and energy you devoted to getting me to this place. During the second year of this program, both my Omi and Tata passed away. -
Twenty-Four Conservative-Liberal Thinkers Part I Hannes H
Hannes H. Gissurarson Twenty-Four Conservative-Liberal Thinkers Part I Hannes H. Gissurarson Twenty-Four Conservative-Liberal Thinkers Part I New Direction MMXX CONTENTS Hannes H. Gissurarson is Professor of Politics at the University of Iceland and Director of Research at RNH, the Icelandic Research Centre for Innovation and Economic Growth. The author of several books in Icelandic, English and Swedish, he has been on the governing boards of the Central Bank of Iceland and the Mont Pelerin Society and a Visiting Scholar at Stanford, UCLA, LUISS, George Mason and other universities. He holds a D.Phil. in Politics from Oxford University and a B.A. and an M.A. in History and Philosophy from the University of Iceland. Introduction 7 Snorri Sturluson (1179–1241) 13 St. Thomas Aquinas (1225–1274) 35 John Locke (1632–1704) 57 David Hume (1711–1776) 83 Adam Smith (1723–1790) 103 Edmund Burke (1729–1797) 129 Founded by Margaret Thatcher in 2009 as the intellectual Anders Chydenius (1729–1803) 163 hub of European Conservatism, New Direction has established academic networks across Europe and research Benjamin Constant (1767–1830) 185 partnerships throughout the world. Frédéric Bastiat (1801–1850) 215 Alexis de Tocqueville (1805–1859) 243 Herbert Spencer (1820–1903) 281 New Direction is registered in Belgium as a not-for-profit organisation and is partly funded by the European Parliament. Registered Office: Rue du Trône, 4, 1000 Brussels, Belgium President: Tomasz Poręba MEP Executive Director: Witold de Chevilly Lord Acton (1834–1902) 313 The European Parliament and New Direction assume no responsibility for the opinions expressed in this publication. -
Collection of Usage Information for Language Resources from Academic Articles
Collection of Usage Information for Language Resources from Academic Articles Shunsuke Kozaway, Hitomi Tohyamayy, Kiyotaka Uchimotoyyy, Shigeki Matsubaray yGraduate School of Information Science, Nagoya University yyInformation Technology Center, Nagoya University Furo-cho, Chikusa-ku, Nagoya, 464-8601, Japan yyyNational Institute of Information and Communications Technology 4-2-1 Nukui-Kitamachi, Koganei, Tokyo, 184-8795, Japan fkozawa,[email protected], [email protected], [email protected] Abstract Recently, language resources (LRs) are becoming indispensable for linguistic researches. However, existing LRs are often not fully utilized because their variety of usage is not well known, indicating that their intrinsic value is not recognized very well either. Regarding this issue, lists of usage information might improve LR searches and lead to their efficient use. In this research, therefore, we collect a list of usage information for each LR from academic articles to promote the efficient utilization of LRs. This paper proposes to construct a text corpus annotated with usage information (UI corpus). In particular, we automatically extract sentences containing LR names from academic articles. Then, the extracted sentences are annotated with usage information by two annotators in a cascaded manner. We show that the UI corpus contributes to efficient LR searches by combining the UI corpus with a metadata database of LRs and comparing the number of LRs retrieved with and without the UI corpus. 1. Introduction Thesaurus1. In recent years, such language resources (LRs) as corpora • He also employed Roget’s Thesaurus in 100 words of and dictionaries are being widely used for research in the window to implement WSD. -
A Language That Forgot Itself Tomasz Kamusella*
Journal on Ethnopolitics and Minority Issues in Europe Vol 13, No 4, 2014, 129-138 Copyright © ECMI 2014 This article is located at: http://www.ecmi.de/fileadmin/downloads/publications/JEMIE/2014/Kamusella.pdf A Language that Forgot Itself Tomasz Kamusella* University of St Andrews In this essay, as a background, I reflect on how the German language was liquidated in post-1945 Poland’s region of Upper Silesia where nowadays the country’s German minority is concentrated. But the main focus is on the irony that neither this language, nor a genuine German minority education system has been revived during the last quarter of a century that has elapsed since the fall of communism in 1989, despite promises to the contrary. The mystery persists. In the entry devoted to Poland in the respectable reference on the languages of the world, the Ethnologue, the number of native speakers of German living in the country is conservatively estimated at half a million.1 Until well into the 1990s German sources spoke of one million or a million and a half Germans in Poland. It is noted that the region where they live in compact areas of settlement is the countryside of Upper Silesia. But in the region the local Germans overwhelmingly communicate in the Silesian language. Silesia Superioris, Oberschlesien, Haute-Silésie, Horní Slezsko, Górny Śląsk, Felső-Szilézia—it is known by so many names, as many homelands in Central Europe were before the powers-that-be minced and fitted this part of the continent into the unbecomingly tight and too-small pantyhose of national polities, each so pure, homogenous through and through, so painfully monolingual. -
Estonian Yiddish and Its Contacts with Coterritorial Languages
DISSERT ATIONES LINGUISTICAE UNIVERSITATIS TARTUENSIS 1 ESTONIAN YIDDISH AND ITS CONTACTS WITH COTERRITORIAL LANGUAGES ANNA VERSCHIK TARTU 2000 DISSERTATIONES LINGUISTICAE UNIVERSITATIS TARTUENSIS DISSERTATIONES LINGUISTICAE UNIVERSITATIS TARTUENSIS 1 ESTONIAN YIDDISH AND ITS CONTACTS WITH COTERRITORIAL LANGUAGES Eesti jidiš ja selle kontaktid Eestis kõneldavate keeltega ANNA VERSCHIK TARTU UNIVERSITY PRESS Department of Estonian and Finno-Ugric Linguistics, Faculty of Philosophy, University o f Tartu, Tartu, Estonia Dissertation is accepted for the commencement of the degree of Doctor of Philosophy (in general linguistics) on December 22, 1999 by the Doctoral Committee of the Department of Estonian and Finno-Ugric Linguistics, Faculty of Philosophy, University of Tartu Supervisor: Prof. Tapani Harviainen (University of Helsinki) Opponents: Professor Neil Jacobs, Ohio State University, USA Dr. Kristiina Ross, assistant director for research, Institute of the Estonian Language, Tallinn Commencement: March 14, 2000 © Anna Verschik, 2000 Tartu Ülikooli Kirjastuse trükikoda Tiigi 78, Tartu 50410 Tellimus nr. 53 ...Yes, Ashkenazi Jews can live without Yiddish but I fail to see what the benefits thereof might be. (May God preserve us from having to live without all the things we could live without). J. Fishman (1985a: 216) [In Estland] gibt es heutzutage unter den Germanisten keinen Forscher, der sich ernst für das Jiddische interesiere, so daß die lokale jiddische Mundart vielleicht verschwinden wird, ohne daß man sie für die Wissen schaftfixiert -
The German Identity Op Mennonite Brethren Immigrants in Canada, 1930-1960
THE GERMAN IDENTITY OP MENNONITE BRETHREN IMMIGRANTS IN CANADA, 1930-1960 by BENJAMIN WALL REDEKOP B.A., Fresno Pacific College, 1985 A THESIS SUBMITTED IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF MASTER OF HISTORY in THE FACULTY OF GRADUATE STUDIES DEPARTMENT OF HISTORY We accept this thesis as conforming to the required standard THE UNIVERSITY OF BRITISH COLUMBIA September 1990 ©BENJAMIN WALL REDEKOP, 1990 In presenting this thesis in partial fulfilment of the requirements for an advanced degree at the University of British Columbia, I agree that the Library shall make it freely available for reference and study. I further agree that permission for extensive copying of this thesis for scholarly purposes may be granted by the head of my department or by his or her representatives. It is understood that copying or publication of this thesis for financial gain shall not be allowed without my written permission. Department of l4i£4p/' The University of British Columbia Vancouver, Canada Date DE-6 (2/88) ii ABSTRACT Little scholarly research has been done on the function of Germanism among Mennonites who immigrated to Canada from Russia in the 1920's, and what has been done often relies on an oversimplified "desire for separation" to explain the phenomenon. At the same time, it has been argued that the enthusiasm for Nazi Germany among Mennonite immigrants in Canada is to be understood as part of a larger "Volks-German awakening". In fact, the Mennonite experience of brutal treatment during the Bolshevik Revolution, the economic conditions of the Great Depression, and assinflationist pressures from Canadian society put them in a naturally receptive position for the cultural, political and ethnic ideas associated with the "new Germany". -
From CHILDES to Talkbank
From CHILDES to TalkBank Brian MacWhinney Carnegie Mellon University MacWhinney, B. (2001). New developments in CHILDES. In A. Do, L. Domínguez & A. Johansen (Eds.), BUCLD 25: Proceedings of the 25th annual Boston University Conference on Language Development (pp. 458-468). Somerville, MA: Cascadilla. a similar article appeared as: MacWhinney, B. (2001). From CHILDES to TalkBank. In M. Almgren, A. Barreña, M. Ezeizaberrena, I. Idiazabal & B. MacWhinney (Eds.), Research on Child Language Acquisition (pp. 17-34). Somerville, MA: Cascadilla. Recent years have seen a phenomenal growth in computer power and connectivity. The computer on the desktop of the average academic researcher now has the power of room-size supercomputers of the 1980s. Using the Internet, we can connect in seconds to the other side of the world and transfer huge amounts of text, programs, audio and video. Our computers are equipped with programs that allow us to view, link, and modify this material without even having to think about programming. Nearly all of the major journals are now available in electronic form and the very nature of journals and publication is undergoing radical change. These new trends have led to dramatic advances in the methodology of science and engineering. However, the social and behavioral sciences have not shared fully in these advances. In large part, this is because the data used in the social sciences are not well- structured patterns of DNA sequences or atomic collisions in super colliders. Much of our data is based on the messy, ill-structured behaviors of humans as they participate in social interactions. Categorizing and coding these behaviors is an enormous task in itself. -
Jazykovedný Ústav Ľudovíta Štúra
JAZYKOVEDNÝ ÚSTAV ĽUDOVÍTA ŠTÚRA SLOVENSKEJ AKADÉMIE VIED 2 ROČNÍK 70, 2019 JAZYKOVEDNÝ ČASOPIS ________________________________________________________________VEDEcKÝ ČASOPIS PRE OTáZKY TEóRIE JAZYKA JOURNAL Of LINGUISTIcS ScIENTIfIc JOURNAL fOR ThE Theory Of LANGUAGE ________________________________________________________________ hlavná redaktorka/Editor-in-chief: doc. Mgr. Gabriela Múcsková, PhD. Výkonní redaktori/Managing Editors: PhDr. Ingrid Hrubaničová, PhD., Mgr. Miroslav Zumrík, PhD. Redakčná rada/Editorial Board: PhDr. Klára Buzássyová, CSc. (Bratislava), prof. PhDr. Juraj Dolník, DrSc. (Bratislava), PhDr. Ingrid Hrubaničová, PhD. (Bra tislava), doc. Mgr. Martina Ivanová, PhD. (Prešov), Mgr. Nicol Janočková, PhD. (Bratislava), Mgr. Alexandra Jarošová, CSc. (Bratislava), prof. PaedDr. Jana Kesselová, CSc. (Prešov), PhDr. Ľubor Králik, CSc. (Bratislava), PhDr. Viktor Krupa, DrSc. (Bratislava), doc. Mgr. Gabriela Múcsková, PhD. (Bratislava), Univ. Prof. Mag. Dr. Stefan Michael Newerkla (Viedeň – Ra- kúsko), Associate Prof. Mark Richard Lauersdorf, Ph.D. (Kentucky – USA), prof. Mgr. Martin Ološtiak, PhD. (Prešov), prof. PhDr. Slavomír Ondrejovič, DrSc. (Bratislava), prof. PaedDr. Vladimír Patráš, CSc. (Banská Bystrica), prof. PhDr. Ján Sabol, DrSc. (Košice), prof. PhDr. Juraj Vaňko, CSc. (Nitra), Mgr. Miroslav Zumrík, PhD. (Bratislava), prof. PhDr. Pavol Žigo, CSc. (Bratislava). Technický_______________________________________________________________ redaktor/Technical editor: Mgr. Vladimír Radik Vydáva/Published by: Jazykovedný -
The British National Corpus Revisited: Developing Parameters for Written BNC2014 Abi Hawtin (Lancaster University, UK)
The British National Corpus Revisited: Developing parameters for Written BNC2014 Abi Hawtin (Lancaster University, UK) 1. The British National Corpus 2014 project The ESRC Centre for Corpus Approaches to Social Science (CASS) at Lancaster University and Cambridge University Press are working together to create a new, publicly accessible corpus of contemporary British English. The corpus will be a successor to the British National Corpus, created in the early 1990s (BNC19941). The British National Corpus 2014 (BNC2014) will be of the same order of magnitude as BNC1994 (100 million words). It is currently projected that the corpus will reach completion in mid-2018. Creation of the spoken sub-section of BNC2014 is in its final stages; this paper focuses on the justification and development of the written sub- section of BNC2014. 2. Justification for BNC2014 There are now many web-crawled corpora of English, widely available to researchers, which contain far more data than will be included in Written BNC2014. For example, the English TenTen corpus (enTenTen) is a web-crawled corpus of Written English which currently contains approximately 19 billion words (Jakubíček et al., 2013). Web-crawling processes mean that this huge amount of data can be collected very quickly; Jakubíček et al. (2013: 125) note that “For a language where there is plenty of material available, we can gather, clean and de-duplicate a billion words a day.” So, with extremely large and quickly-created web-crawled corpora now becoming commonplace in corpus linguistics, the question might well be asked why a new, 100 million word corpus of Written British English is needed. -
Universal Social Protection in Latin America and the Caribbean: Selected Texts 2006-2019
Select pages of ECLAC Universal Social Protection in Latin America and the Caribbean Selected texts 2006-2019 Simone Cecchini (compiler) Thank you for your interest in this ECLAC publication ECLAC Publications Please register if you would like to receive information on our editorial products and activities. When you register, you may specify your particular areas of interest and you will gain access to our products in other formats. www.cepal.org/en/publications ublicaciones www.cepal.org/apps SELECT PAGES OF ECLAC Select Pages of ECLAC is an innovative editorial collection from the Commission, in keeping with the new ways of disseminating and reading publications in the digital era. The titles included in this electronic collection are compilations of texts on the latest topics that make up the Organization’s main areas of work. The full versions of the original articles can be accessed through the links in the publication and in the final section “Documents included in this compilation”. Alicia Bárcena Executive Secretary Mario Cimoli Deputy Executive Secretary Raúl García-Buchaca Deputy Executive Secretary for Management and Programme Analysis Ricardo Pérez Chief, Publications and Web Services Division The selected texts in this volume are taken from institutional documents of the Economic Commission for Latin America and the Caribbean, ECLAC (annual reports and meeting and session documents) and books or documents produced by the following authors and editors: Laís Abramo, Alberto Arenas, Carla Bronzo, Simone Cecchini, Nuria Cunill-Grau, Fernando Filgueira, Carlos Maldonado, Rodrigo Martínez, Beatriz Morales, Fabián Repetto, Nieves Rico, Claudia Robles, Cecilia Rossel, Ana Sojo, Andras Uthoff and Mario Velásquez.