Compilation of a Swiss German Dialect Corpus and Its Application to Pos Tagging
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Under the Influence: Advertisers' Impact on The
Media and Communication (ISSN: 2183–2439) 2017, Volume 5, Issue 2, Pages 31–40 DOI: 10.17645/mac.v5i2.901 Article Under the Influence: Advertisers’ Impact on the Content of Swiss Free Newspapers Colin Porlezza IPMZ—Institute of Mass Communication and Media Research, University of Zurich, 8050 Zurich, Switzerland; E-Mail: [email protected] Submitted: 1 February 2017 | Accepted: 11 March 2017 | Published: 13 April 2017 Abstract The study focuses on whether and to what extent advertisers influence the editorial content of free newspapers in the German part of Switzerland. The contribution analyzes, grounded on an historic approach, the most competitive period in Switzerland, 2008, when not less than five freesheets were competing for advertisers and public attention. By using Alt- meppen’s (2006) organizational theory, the paper offers a theoretical frame able to describe the vanishing co-orientation between the media management and the newsroom, a trend that aggravates commercialization processes in news or- ganizations. In a situation of economic turmoil, so the hypothesis, newsrooms are more inclined to positively adapt the valence of their coverage about their main advertisers in order to keep them in the portfolio. Using a content analysis, the author examined the editorial coverage of six among the most important advertisers of Swiss free newspapers, carrying out an aggregated statistical analysis based on logistic regression. The study revealed that free newspapers with a strong market orientation display a higher chance to publish positive facts and evaluations about advertisers with a high adver- tising expenditure. Keywords advertising; commercialization; free newspapers; journalism; media accountability Issue This article is part of a multidisciplinary issue of Media and Communication, edited by Epp Lauk (University of Jyväskylä, Finland) and Raul Reis (Emerson College, USA). -
Ringier Annual Report 2009
Ringier Annual Report 2009 Prologue, Michael Ringier 119 The Year at Ringier, Christian Unger 121 Ringier at a Glance 122 Ringier in Figures 124 Profit and Loss Account 125 Sales Shares 127 The Year in Figures 128 Ringier Portfolio 137 Ringier Switzerland 158 Ringier Print Switzerland 160 Ringier Germany 165 Ringier Czech Republic 166 Ringier Slovakia 168 Ringier Hungary 170 Ringier Romania 172 Ringier Serbia 174 Ringier Pacific 176 Important Participations Switzerland 180 Important Participations Europe/Pacific 182 Corporate Center 202 Corporate Communications 203 Corporate Social Responsibility 204 Organisation 230 Events 232 Locations 234 Epilogue, Frank A. Meyer 236 About the Artist 239 Imprint 299 Prologue 119 Michael Ringier, Publisher John Baldessari, the man who designed this year’s annual The second constant is the future. It is change, it is the report, looks like a Greek philosopher. But behind the shock will to shape the future in new and innovative ways no of snow-white hair and flowing beard is not just a wise matter how technology may change our business. Bad man, but a radical one. In his legendary Cremation Project times are sometimes good times as well. After all, the pres- in 1970, he burned all the pictures he had painted before sure to rise to new challenges was never greater than it 1966. Was his voluntary act 40 years ago something that is today. As the use of new technologies advances rapidly, publishers will involuntarily have to imitate quite soon? so too does our ingenuity in applying these technologies to our business activities and in adding new activities to In terms of intelligence and visionary thinking, John expand our business. -
Jung, Laut, Hässig!
JUNG, LAUT, HÄSSIG! WIE SICH DIE LUZERNER HIP-HoPPER INS SCHWEIZER MUSIKGEDÄCHTNIS RAPPEN. 3 CHF 8.– www.null41.ch März 2014 O Monatszeitschrift für Luzern und die Zentralschweiz mit Kulturkalender N ANZEIGEN TUNISREISE ZENTRUM PAUL KLEE A FILM BY NOËL DERNESCH AND MORITZ SPRINGER BERN 14/03 22/06/14DIE TILMANN OTTO aka AUDIENCE AWARD Zurich Film Festival MOILLIETMACKEKLEE GENTLEMAN 2013 IN COMPETITION IDFA Amsterdam 2013 DEFA AWARD Max-Ophüls-Preis 2014 ALBERTO D’ASCOLA aka ALBOROSIE Ab 20. März im Kino I—1914, Privatsammlung, (1887–1914)—Markt in Tunis Macke August Bildausschnitt: Wakonigg Rudolf LWL-MKuK, Foto: Hamburg, Kunsthandel, Rotermund Thole courtesy WWW.ZPK.ORG *JAH_InsD_96x126_kml.indd 1 06.02.14 12:26 4. Leibchen 1. Tor schiessen über den 2. losrennen Kopf ziehen 3. auf die Knie fallen Museum für Kommunikation Helvetiastrasse 16 3005 Bern Hodlerstrasse 8 – 12 www.mfk.ch CH-3000 Bern 7 www.kunstmuseumBern.CH di 10H – 21H mi–so 10H – 17H 140218_Ins_Kulturpool_041_Sesam_96X126mm.indd 1 18.02.2014 16:25:57 EDITORIAL MUSIK Luzern wird ja ganz gerne als «Rock City» bezeich- Einblick in die junge Lebensgeschichte des Luzerner net. Und ja, in Sachen Rock und Pop hat die Zent- Rappers Mimiks (Cover), einem aufstrebenden Talent ralschweiz schon viele Namen hervorgebracht: Da unter den jungen Wilden. Er hat in den vergangenen waren etwa Sportsguitar und Stevens Nude Club, da Jahren praktisch nur für den Rap gelebt und sich fast waren Mothers Pride und Neviss. Da sind noch immer darin verloren – bis der Wendepunkt kam. (Seite 8) Dada Ante Portas. Etwas ruhiger, aber doch mit Gi- An einem Wendepunkt steht man auch am anderen tarre ausgestattet, ist aktuell Heidi Happy schweizweit musikalischen Ende Luzerns: Das Lucerne Festival auf Erfolgskurs – gerade bringt sie ein neues Album macht sich an die Herkulesaufgabe, die Nachfolge von heraus (Seite 25). -
Semantic Approaches in Natural Language Processing
Cover:Layout 1 02.08.2010 13:38 Seite 1 10th Conference on Natural Language Processing (KONVENS) g n i s Semantic Approaches s e c o in Natural Language Processing r P e g a u Proceedings of the Conference g n a on Natural Language Processing 2010 L l a r u t a This book contains state-of-the-art contributions to the 10th N n Edited by conference on Natural Language Processing, KONVENS 2010 i (Konferenz zur Verarbeitung natürlicher Sprache), with a focus s e Manfred Pinkal on semantic processing. h c a o Ines Rehbein The KONVENS in general aims at offering a broad perspective r on current research and developments within the interdiscipli - p p nary field of natural language processing. The central theme A Sabine Schulte im Walde c draws specific attention towards addressing linguistic aspects i t of meaning, covering deep as well as shallow approaches to se - n Angelika Storrer mantic processing. The contributions address both knowledge- a m based and data-driven methods for modelling and acquiring e semantic information, and discuss the role of semantic infor - S mation in applications of language technology. The articles demonstrate the importance of semantic proces - sing, and present novel and creative approaches to natural language processing in general. Some contributions put their focus on developing and improving NLP systems for tasks like Named Entity Recognition or Word Sense Disambiguation, or focus on semantic knowledge acquisition and exploitation with respect to collaboratively built ressources, or harvesting se - mantic information in virtual games. Others are set within the context of real-world applications, such as Authoring Aids, Text Summarisation and Information Retrieval. -
Text Corpora and the Challenge of Newly Written Languages
Proceedings of the 1st Joint SLTU and CCURL Workshop (SLTU-CCURL 2020), pages 111–120 Language Resources and Evaluation Conference (LREC 2020), Marseille, 11–16 May 2020 c European Language Resources Association (ELRA), licensed under CC-BY-NC Text Corpora and the Challenge of Newly Written Languages Alice Millour∗, Karen¨ Fort∗y ∗Sorbonne Universite´ / STIH, yUniversite´ de Lorraine, CNRS, Inria, LORIA 28, rue Serpente 75006 Paris, France, 54000 Nancy, France [email protected], [email protected] Abstract Text corpora represent the foundation on which most natural language processing systems rely. However, for many languages, collecting or building a text corpus of a sufficient size still remains a complex issue, especially for corpora that are accessible and distributed under a clear license allowing modification (such as annotation) and further resharing. In this paper, we review the sources of text corpora usually called upon to fill the gap in low-resource contexts, and how crowdsourcing has been used to build linguistic resources. Then, we present our own experiments with crowdsourcing text corpora and an analysis of the obstacles we encountered. Although the results obtained in terms of participation are still unsatisfactory, we advocate that the effort towards a greater involvement of the speakers should be pursued, especially when the language of interest is newly written. Keywords: text corpora, dialectal variants, spelling, crowdsourcing 1. Introduction increasing number of speakers (see for instance, the stud- Speakers from various linguistic communities are increas- ies on specific languages carried by Rivron (2012) or Soria ingly writing their languages (Outinoff, 2012), and the In- et al. -
Blick Am Abend St.Gallen Mediadaten
ImmoScout24 plus Blick am Abend Mediadaten St.Gallen www.blickamabend.ch Donnerstag, 6. Januar 2011 You are Zürich, Nr. 4 not alone ... Kerstin Cook Erscheinungsweise: und andere HEUTE ABEND Studien- Wöchentlich, abbrecher aus 3° aller Welt. PEOPLE 14/15 MORGEN 8° donnerstags Eiskalt zugeschlagen Dario Blitzeis blockiert Auflage: Cologna gewigewinntnnt heute MittagMittag das 35-Kilometer-Rennen35-Kilometer-Rennen Morgenverkehr in Toblach (I).(I). Tausende Pendler kamen heute im Mittelland nicht vom Fleck. Bus- und Postautobetrieb 22’500 waren lahmgelegt. NEWS 2/3 Ex. (Quelle: Grösste Leserschaft: Ringier) Sorge der 50 Männer ’000 entfällt Leser (Quelle: Endlich fanden WOHNTRÄUME MACH Forscher die Charakteristik: Basic Ursache für 2012-1) Zuvorzeitigen vermieten nach Vereinbarung Zu vermieten nach Vereinbarung Zu vermieten nach Vereinbarung 2-Zimmer-Dachwohnung Basel Basel Gratis-Abendzeitung Ort:Haarausfall. Basel 4.5-Zimmer-Dachwohnung 4.5-Zimmer-Dachwohnung Grösse: 230 m² NEWS 2 an ruhiger Lage an ruhiger Lage Preis: CHF 2’500.-/Mt. (inkl. NK) Preis: CHF 2’500.-/Mt. (inkl. NK) Preis: CHF 2’500.-/Mt. (inkl. NK) Grösse: 230 m² Grösse: 230 m² www.immoscout24.ch Cologna Telefon: 031 744 11 11 Telefon: 031 744 11 11 ImmoScout24-Code: 7568942 ImmoScout24-Code: 7568942 Interessiert? Mit dem ImmoScout24-Code Anzeigenschluss: Superstar Zu vermieten nach Vereinbarung Zu vermieten nach Vereinbarung gelangen Sie ganz einfach Basel Basel zu den detaillierten Der schnellste Lang- 4.5-Zimmer-Dachwohnung 4.5-Zimmer-Dachwohnung Informationen! Jeweils an ruhiger Lage an ruhiger Lage läufer der Welt fährt Preis: CHF 2’500.-/Mt. (inkl. NK) Preis: CHF 2’500.-/Mt. (inkl. NK) Immobilien-Treuhand AGHeute dienstags Grösse: 230 m² Grösse: 230 m² die Konkurrenz in Karin Muster, Tel. -
Kommentare in Deutschschweizer Tageszeitungen: Forschungssituation, Frequenz, Formale Gestalt Und Vernetzung
HARTMUT E. H. LENK Kommentare in Deutschschweizer Tageszeitungen: Forschungssituation, Frequenz, formale Gestalt und Vernetzung 1. Die Zeitungslandschaft in der Deutschschweiz Die Schweiz ist nicht nur das mitteleuropäische Land mit der höchsten Zahl von Zeitungsausgaben pro Kopf der Bevölkerung: Wie die Über- sicht in Abb. 1 des Beitrags von SKOG-SÖDERSVED (in diesem Band) ausweist, liegt sie nach den Daten aus dem Jahre 2013 (für das Jahr 2012) im internationalen Vergleich mit 286 Exemplaren je 1 000 Einwohner auf Platz vier (knapp vor Österreich mit 275 auf Platz fünf und etwas klarer vor der Bundesrepublik Deutschland mit 255 Ex. auf Platz acht). Die Presse in der Schweiz gehört auch zu den von der Medienforschung und von der Medien- und Textsortenlinguistik häufig und vergleichs- weise gründlich untersuchten Massenkommunikationsmitteln. 1.1 Zur Forschungssituation Die germanistische Linguistik an Schweizer Universitäten gehörte zu je- nen, die auf dem Gebiet der medienlinguistischen Forschung Pionierar- beit leisteten. Zu verweisen ist beispielsweise auf die Arbeiten von HA- RALD BURGER, dessen Werk Sprache der Massenmedien (BURGER 1984) zusammen mit den Arbeiten von LÜGER (1983/21995) und BU- CHER (1986) zu den „Meilensteinpublikationen“ (STÖCKL 2012, 15f.) der deutschsprachigen Medienlinguistik gehört und 2014 in Zusammen- arbeit mit MARTIN LUGINBÜHL in 4., neu bearbeiteter und erweiterter Auflage erschien. Auch die Zürcher Publizistikforschung leistete ihren Beitrag zur Beschreibung u. a. sprachlicher Eigenschaften von Print- medien, wie z.B. die Untersuchung von SAXER u.a. (1979) zur Boule- vardzeitung Blick. 262 Hartmut E. H. Lenk In Zürich entstanden in den folgenden Jahrzehnten zahlreiche weitere Arbeiten zur Analyse massenmedialer Kommunikation, die innovative Methoden einsetzten, historische und diachrone Aspekte berücksichtig- ten und in jüngerer Zeit zunehmend auch kontrastive Aspekte einschlie- ßen und über das Medium Zeitung hinaus gehen. -
World Languages Using Latin Script
World languages using Latin script Source: http://www.omniglot.com/writing/langalph.htm https://www.ethnologue.com/browse/names Sort order : Language status, ISO 639-3 Lang, ISO Language name Classification Population status Language map Comment 639-3 (EGIDS) Botswana, Lesotho, South Indo-European, Germanic, West, Low 1. Afrikaans, afr 7,096,810 1 Africa and Saxon-Low Franconian, Low Franconian SwazilandNamibia Azerbaijan,Georgia,Iraq 2. Azeri,Azerbaijani azj Turkic, Southern, Azerbaijani 24,226,940 1 Jordan and Syria Indo-European Balto-Slavic Slavic West 3. Czech Bohemian Cestina ces 10,619,340 1 Czech Republic Czech-Slovak Chamorro,Chamorru Austronesian Malayo-Polynesian Guam and Northern 4. cha 94,700 1 Tjamoro Chamorro Mariana Islands Seychelles Creole,Seselwa Creole, Creole, Ilois, Kreol, 5. Kreol Seselwa, Seselwa, crs Creole, French based 72,700 1 Seychelles Seychelles Creole French, Seychellois Creole Indo-European Germanic North East Denmark Finland Norway 6. DanishDansk Rigsdansk dan Scandinavian Danish-Swedish Danish- 5,520,860 1 and Sweden Riksmal Danish AustriaBelgium Indo-European Germanic West High Luxembourg and 7. German Deutsch Tedesco deu German German Middle German East 69,800,000 1 NetherlandsDenmark Middle German Finland Norway and Sweden 8. Estonianestieesti keel ekk Uralic Finnic 1,132,500 1 Estonia Latvia and Lithuania 9. English eng Indo-European Germanic West English 341,000,000 1 over 140 countries Austronesian Malayo-Polynesian 10. Filipino fil Philippine Greater Central Philippine 45,000,000 1 Filippines L2 users population Central Philippine Tagalog Page 1 of 48 World languages using Latin script Lang, ISO Language name Classification Population status Language map Comment 639-3 (EGIDS) Denmark Finland Norway 11. -
Parsing Approaches for Swiss German
Master’s thesis presented to the Faculty of Arts and Social Sciences of the University of Zurich for the degree of Master of Arts UZH nmod case compound amod Parsing Approaches for Swiss German NOUN NOUN ADP ADJ NOUN Author: Noëmi Aepli Student ID Nr.: 10-719-250 Examiner: Prof. Dr. Martin Volk Supervisor: Dr. Simon Clematide Institute of Computational Linguistics Submission date: 10.01.2018 nmod case compound amod Parsing Approaches for Swiss German Abstract NOUN NOUN ADP ADJ NOUN Abstract This thesis presents work on universal dependency parsing for Swiss German. Natural language pars- ing describes the process of syntactic analysis in Natural Language Processing (NLP) and is a key area as many applications rely on its output. Building a statistical parser requires expensive resources which are only available for a few dozen languages. Hence, for the majority of the world’s languages, other ways have to be found to circumvent the low-resource problem. Triggered by such scenarios, research on different approaches to cross-lingual learning is going on. These methods seem promising for closely related languages and hence especially for dialects and varieties. Swiss German is a dialect continuum of the Alemannic dialect group. It comprises numerous vari- eties used in the German-speaking part of Switzerland. Although mainly oral varieties (Mundarten), they are frequently used in written communication. On the basis of their high acceptance in the Swiss culture and with the introduction of digital communication, Swiss German has undergone a spread over all kinds of communication forms and social media. Considering the lack of standard spelling rules, this leads to a huge linguistic variability because people write the way they speak. -
ADVERTISING PRIMER 2016 You Can Clearly Only Market Your Shop Online
ADVERTISING PRIMER 2016 You can clearly only market your shop online ... ... but in combination with promotional letters, you can boost your sales. More information at: post.ch/wirkung-studien Boost your sales. CONTENTS Switzerland Data on Switzerland 2 Economic data 3 European comparison 4 Media Market Advertising expenditure 5 Media market Switzerland 6 Media in Europe 7 Social media comparison – Intermedia comparison 8 Media Data Magazines 14 Daily-, Sunday-, Financial-, Business-Press 16 Newspaper/Magazines Combinations 18 Television – Digital advertising displays 20 Internet – Online 22 Cinema – AdScreens in public transports 24 Radio 25 Out-of-home – Posters 26 Direct advertising 27 Studies Official Swiss media studies 28 Media terms – Internet/Online Glossary 32 Addtional data Map of WEMF regions 34 Swiss population data 36 All data given according to the sources indicated. No responsibility is accepted for their accuracy. State: December 16, 2015. For additional copies of the Ringier Advertising Primer please see www.go4media.ch (also with download). Contact Ringier AG Publishing Media Dufourstrasse 23 CH-8008 Zurich [email protected] www.go4media.ch www.ringier.com DATA ON SWITZERLAND 2 Geography Surface sq. km % Surface sq. km % Total 41,285 100.0 Unproductive surface 10,459 25.3 Cultivated area 14,817 35.9 Populated area 3,079 7.5 Floor surface 12,931 31.3 Politics Swiss Confederation: founded in 1291, Federal State since 1848 Administrative units (federalistic): 26 cantons (6 of them are half-cantons) Form of government: direct democracy Political levels and parties Parties State: 2015 Seats SVP SP FDP CVP others Federal Council Collegial board 7 2 2 2 1 Parliament Council of States cantonal repres. -
Ročník 68, 2017
2 ROČNÍK 68, 2017 JAZYKOVEDNÝ ČASOPIS ________________________________________________________________VEDEcKÝ ČASOPIS PrE OtáZKY tEórIE JAZYKA JOUrNAL Of LINGUIStIcS ScIENtIfIc JOUrNAL fOr thE thEOrY Of LANGUAGE ________________________________________________________________ hlavná redaktorka/Editor-in-chief: doc. Mgr. Gabriela Múcsková, PhD. Výkonní redaktori/Managing Editors: PhDr. Ingrid Hrubaničová, PhD., Mgr. Miroslav Zumrík, PhD. redakčná rada/Editorial Board: doc. PhDr. Ján Bosák, CSc. (Bratislava), PhDr. Klára Buzássyová, CSc. (Bratislava), prof. PhDr. Juraj Dolník, DrSc. (Bratislava), PhDr. Ingrid Hrubaničová, PhD. (Bra tislava), Doc. Mgr. Martina Ivanová, PhD. (Prešov), Mgr. Nicol Janočková, PhD. (Bratislava), Mgr. Alexandra Jarošová, CSc. (Bratislava), prof. PaedDr. Jana Kesselová, CSc. (Prešov), PhDr. Ľubor Králik, CSc. (Bratislava), PhDr. Viktor Krupa, DrSc. (Bratislava), doc. Mgr. Gabriela Múcsková, PhD. (Bratislava), Univ. Prof. Mag. Dr. Ste- fan Michael Newerkla (Viedeň – Rakúsko), Associate Prof. Mark Richard Lauersdorf, Ph.D. (Kentucky – USA), doc. Mgr. Martin Ološtiak, PhD. (Prešov), prof. PhDr. Slavomír Ondrejovič, DrSc. (Bratislava), prof. PaedDr. Vladimír Patráš, CSc. (Banská Bystrica), prof. PhDr. Ján Sabol, DrSc. (Košice), prof. PhDr. Juraj Vaňko, CSc. (Nitra), Mgr. Miroslav Zumrík, PhD. prof. PhDr. Pavol Žigo, CSc. (Bratislava). technický_______________________________________________________________ redaktor/technical editor: Mgr. Vladimír Radik Vydáva/Published by: Jazykovedný ústav Ľudovíta Štúra Slovenskej -
Proceedings of the Fourth Workshop on NLP for Similar Languages, Varieties and Dialects, Pages 1–15, Valencia, Spain, April 3, 2017
VarDial 2017 Fourth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial’2017) Proceedings of the Workshop April 3, 2017 Valencia, Spain c 2017 The Association for Computational Linguistics Order copies of this and other ACL proceedings from: Association for Computational Linguistics (ACL) 209 N. Eighth Street Stroudsburg, PA 18360 USA Tel: +1-570-476-8006 Fax: +1-570-476-0860 [email protected] ISBN 978-1-945626-43-2 ii Preface VarDial is a well-established series of workshops held annually and co-located with top-tier international NLP conferences. Previous editions of VarDial were VarDial’2014, which was co- located with COLING’2014, LT4VarDial’2015, which was held together with RANLP’2015, and finally VarDial’2016 co-located with COLING’2016. The great interest of the community has made possible the fourth edition of the Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial’2017), co-located with EACL’2017 in Valencia, Spain. The VarDial series has attracted researchers working on a wide range of topics related to linguistic variation such as building and adapting language resources for language varieties and dialects, creating language technology and applications that make use of language closeness, and exploiting existing resources in a related language or a language variety. We believe that this is a very timely series of workshops, as research in language variation is much needed in today’s multi-lingual world, where several closely-related languages, language varieties, and dialects are in daily use, not only as spoken colloquial language but also in written media, e.g., in SMS, chats, and social networks.