Slavicorp 2018 Book of Abstracts
Total Page:16
File Type:pdf, Size:1020Kb
SlaviCorp 2018 24–26 September 2018 Charles University, Prague Book of Abstracts Table of Contents Plenaries Björn Hansen, Edyta Jurkiewicz-Rohrbacher, Zrinka Kolaković Detecting constraints on clitic climbing – with the help of corpora and psycholinguistic tests ................................................................. 10 Alexandr Rosen The merits of a parallel corpus and how to get the most out of it ............... 13 Ruprecht von Waldenfels Variation on many levels: why and how comparing corpora and (Slavic) languages makes sense .................................................................... 14 Full papers Magdalena Adamczyk A contrastive look at discursive uses of English ‚now‘ and Polish ‚teraz‘ ..................................................................................................... 16 Dorota Adamiec, Renata Bronikowska, Włodzimierz Gruszczyński, Emanuel Modrzejewski, Aleksandra Wieczorek The Electronic Corpus of the 17th and 18th c. Polish Texts (up to 1772). The final result ....................................................................... 19 Anastasiia Baranchikova, Anna Dmitrieva, Mariia Fedorova, Aleksandr Klimov, Olesya Kisselev, Mikhail Kopotev, Svetlana Toldova, Natalia Zevakhina CAT&kittens: a corpus-based text-analytic tool for Russian academic writing ..................................................................................................... 22 Vladimír Benko, Radovan Garabík Ensemble Tagging Slovak Web Data ................................................................... 26 Neil Bermel, Luděk Knittl The fate of variant forms in historical corpora: Tracing locative exponents in DIAKON ............................................................. 29 3 Martina Berrocal Stefan Heck, Eugen Kravchenko A corpus-assisted study of the Presidential Polish być w trakcie + verbal noun – a progressive periphrastic interviews of Milos Zeman .................................................................................... 32 construction? ............................................................................................................ 63 Katja Brankačkec Milena Hnátková, Tomáš Jelínek, Marie Kopřivová, Productivity and Meaning of the Prefix nad- Vladimír Petkevič, Alexandr Rosen, Hana Skoumalová, in the Word-Formation of Upper Sorbian, Lower Sorbian Pavel Vondřička and Czech in a Diachronic Perspective: Evidence from Corpora ................. 34 Multiword Expressions in Czech: Typology and Lexicon .............................. 66 Kat Dziwirek Jakob Horsch To taste is to live and love: Verbs of taste in Polish and English .................. 37 A Construction Grammar Account of the Slovak Comparative Correlative Construction .............................................................. 70 Hanne Martine Eckhoff, Aleksandrs Berdičevskis, Marius Jøhndal Laura Janda, Francis Tyers From diachronic treebank to dictionary resource: Parts Give More Than Wholes: Paradigms from the Perspective the Varangian Rus project ..................................................................................... 39 of Corpus Data ......................................................................................................... 73 Tomaž Erjavec, Nikola Ljubešić, Darja Fišer Tomáš Jelínek Training data and tools for processing user-generated New error annotation of Czech learner corpora .............................................. 76 content in Slovene, Croatian and Serbian .......................................................... 42 Tomáš Káňa Matea Filko, Krešimir Šojat, Marko Tadić Terminology in and around Diminutives ........................................................... 79 Construction za + infinitive – evidence from the Croatian corpora .......... 47 Witold Kieraś, Łukasz Kobyliński, Maciej Ogrodniczuk Olga Goritskaya, Mikita Suprunchuk Korpusomat — new functionalities and future development ........................ 82 Frequency Dictionary of Belarusian Borrowings in the Belarusian Variety of the Russian Language ........................................ 50 Witold Kieraś, Marcin Woliński Basic natural language processing toolkit for 19th century Polish ............. 85 Natalia Grabar, Olga Kanishcheva, Thierry Hamon Multilingual aligned corpus with Ukrainian as the target language ........... 53 Valeria Kolosova, Ksenia Zaytseva, Kira Kovalenko PhytoLex – the Database of Russian Phytonyms: Jane Hacking, Erin Schnur, Fernando Rubio from Idea to Implementation ................................................................................ 88 MuSSeL: Designing and building a corpus of multilingual second language speech ......................................................................................... 57 Lucie Kopáčková Oprahin or Opražin? How to Correctly Form Possessive Juho Härme Adjective from Female First Name or Surname of Foreign Last year but not yesterday? Explaining differences in the locations Origin in Contemporary Written Czech Language? ....................................... 91 of Finnish and Russian time adverbials using comparable corpora............. 60 4 5 Natalia Kotsyba, Bohdan Moskalevskyi Jana Nová, Vít Michalec, Zdeňka Opavská, Renáta Neprašová An essential infrastructure of Ukrainian language Frequency (not) sacred: The headword list of a contemporary resources and its possible applications ............................................................... 94 Czech monolingual dictionary and corpora ................................................... 132 Anna Kryvenko Tatiana Perevozchikova A reference corpus for discourse dynamics analysis in Ukrainian? ............ 97 Pronominal expression of possession in noun phrases in Russian, Czech, and Bulgarian ....................................................................... 136 Miroslav Kubát, Jan Hůla, Radek Čech, David Číž, Kateřina Pelegrinová Alexander Piperski Context Specificity of Lemma. Diachronic analysis ...................................... 100 Aspect-Specific Keywords in Russian ............................................................... 139 Moulay Zaidan Lahjouji Adam Przepiórkowski, Agnieszka Patejuk The Corpus of Spoken Rusyn – A user-friendly resource An Enhanced Universal Dependencies Treebank of Polish ......................... 142 for research on Rusyn dialects .......................................................................... 103 Anna Řehořková Nikola Ljubešić, Tanja Samardžić, Tomaž Erjavec, Darja Fišer Czech conditional verb forms in assertive complement clauses ................ 148 Maja Miličević Petrović, Simon Krek “Kad se mnogo malih složi”: Collaborative development Thomas Samuelsson of gold resources for Slovene, Croatian and Serbian..................................... 107 The Russian adjectives antirossijskij , antirusskij and antisovetskij in Russian media: a corpus study ....................................................................... 152 David Lukeš, Zuzana Komrsková, Marie Kopřivová, Petra Poukarová Ranka Stanković, Miloš Utvić, Aleksandra Tomašević, Pronunciation of casual spoken Czech: A quantitative survey .................. 112 Ivan Obradović, Biljana Lazić Development and application of a domain specific corpus Lucie Lukešová (Chlumská), Dominika Kováříková for mining engineering ........................................................................................ 155 Extracting Multi-word Expressions for the Czech Academic Phrase List ............................................................................................ 117 Ilona Starý Kořánová Aspectual homonymy and polysemy in Czech .............................................. 159 Marek Łaziński Actional Interpretation of Verbal Aspect Marcin Szczepański in Legal Texts - Corpus Analysis ....................................................................... 121 Recent challenges and advances in the development of Lower Sorbian corpus resources ................................................................... 161 Jiří Milička, Alžběta Růžičková Slovak Vowel Phonotactics: Slavic Origins vs. Hungarian Influences ...... 124 Magda Ševčíková, Adéla Kalužová, Zdeněk Žabokrtský A language resource specialized in Czech word-formation: Tore Nesset Recent achievements in developing the DeriNet database .......................... 164 Cascading S-curves: What corpus linguistics tells us about language change ......................................................................................... 129 6 7 Svatava Škodová Sebrat se a …. a construction between coordination and subordination in contemporary Czech .................................................... 168 Petar Vuković The second future tense in contemporary Croatian: A corpus-driven study in grammatical semantics ......................................... 171 Adrian Jan Zasina Evaluating a corpus-driven approach in L2 classroom on the example of Czech ...................................................................................... 173 Plenaries Adrian Jan Zasina, Michal Škrabal Morfio.pl – the possibilities for the application of Czech corpus tools to other languages ......................................................................... 176 Jan Patrick Zeller Syntagmatic corpus analyses of mixed speech: code-shifting in Belarusian trasyanka and Ukrainian suržyk ..................... 180 8 Björn Hansen Although clitics in Bosnian, Croatian and Serbian (BCS) have attracted Universität Regensburg considerable attention in the syntactic literature (cf. Franks & King 2000, [email protected]