TOUR DE CLARIN CLARIN in a nutshell ANNUAL REPORT Tour de CLARIN is a highly successful Name CLARIN is short for "Common Language Resources CLARIN Tour de CLARIN Common Language Resources and initiative that highlights prominent user Technology Infrastructure VOLUME ONE and Technology Infrastructure". involvement activities of CLARIN national 2018 consortia online. In 2018 outstanding Vision All digital language resources and tools from all over Europe and beyond are accessible through a single resources, tools, events and researchers sign-on online environment for the support from , DLU/Flanders, , of researchers in the humanities and social sciences. , , and were featured. The first printed volume with an Mission Create and maintain an infrastructure to support the overview of 9 national consortia was also sharing, use and sustainability of language data and tools for research in the humanities and social sciences. Edited by Darja Fišer and Jakob Lenardič published in November 2018.

DH COURSE REGISTRY Value proposition The DH Course Registry, developed and maintained as a joint effort of the European research infrastructures CLARIN CLARIN makes digital language and DARIAH, has grown in content and global coverage. resources available to scholars, researchers, The DH Course Registry is a platform that allows users to students, and citizen-scientists from access a database containing over 200 courses and training all disciplines, especially in the humanities events by the end of 2018. and social sciences, through single sign-on access. CLARIN offers long-term solutions and PARTHENOS TRAINING MODULE technology services for deploying, connecting, analysing and sustaining digital language data In 2018 CLARIN contributed to the PARTHENOS Training and tools. CLARIN supports scholars who Suite by developing the module Digital Humanities Research want to engage in cutting edge Questions and Methods showcasing the collections of data-driven research, contributing Parliamentary Records and Computer-Mediated Communication. to a truly multilingual European Research Area. CLARIN PUBLICATIONS Among the output for 2018 one of the most salient publications is a paper by Franciska de Jong, Bente Maegaard, Governance Koenraad De Smedt, Darja Fišer and Dieter Van Uytvanck: “CLARIN: Towards FAIR and Responsible Data Science The General Assembly represents the members of CLARIN ERIC Using Language Resources.” It appeared in the Proceedings and is the highest decision-making body of CLARIN ERIC. of the Eleventh International Conference on Language It is assisted by an international Scientific Advisory Board. Resources and Evaluation (LREC 2018), May 2018, 3259-3264. The day-to-day management is in the hands of the Board of Directors chaired by the executive director Prof. Franciska de Jong and supported by the CLARIN Office. CLARIN IN EU-FUNDED PROJECTS The largest effort towards the further integration of data, tools In 2018 CLARIN ERIC continued its active participation and expertise stems from the activities in the national consortia. in several European and international initiatives. The The National Coordinators’ Forum is responsible for collaboration with Europeana and the activities in the context the coordination of the collaboration across countries. of H2020 project PARTHENOS continued. A new H2020 Various committees and working groups, as well as regular project had its kick-off in January: EOSC-hub. The first result workshops and conferences bring together experts from the CLARIN was the integration of a number of CLARIN services into the community to discuss and solve problems of common interest. EOSC Portal, which was presented at the EOSC launch event In April 2018 an update of the statutes was approved by the on 23 November 2018 in Vienna. European Commission.

STRATEGIC ALLIANCES CLARIN ERIC CLARIN is maintaining strategic relationships with several c/o Utrecht University www.clarin.eu organisations in the research infrastructure ecosystem, Drift 10 [email protected] among which the other SSH research infrastructures, and 3512 BS Utrecht LIBER and Europeana. In 2018 a collaboration agreement was The signed with ELRA. CLARIN ERIC MEMBERS EVENTS KNOWLEDGE SHARING

In 2018 has joined CLARIN ERIC as a member and A total of 192 events, such as The number of certified nodes in the Knowledge Sharing and have joined CLARIN ERIC as observers. At the summer schools, workshops, Infrastructure (K-centres) has gone up from nine to thirteen end of 2018, the total number of participants was 25. Members: tutorials, seminars and in 2018 with more applications in the pipeline. CLARIN AT, BG, CZ, DE, DK, DLU, EE, FI, GR, HR, HU, IT, LT, LV, NL, NO, PL, masterclasses, were offered Mobility Grants were awarded to five researchers for sharing PT, SE, SI. Observers: IS, UK, FR, ZA. Third party: Carnegie Mellon with support from CLARIN of expertise across European countries. University (USA). throughout the network and attracted over 10,000 C EUROPE participants. The highlights STEVEN KRAUWER AWARD include: The 5th edition of the award • CLARIN workshop for newcomers and countries preparing for ceremony took place at the annual membership (Utrecht, the Netherlands) covered a range of ERIC members CLARIN Annual Conference in Pisa, Observers B K topics, from technical and legal aspects to uptake by researchers C C B Countries with participating centres K . The 2018 Steven Krauwer C B Centre Providing Data and user involvement activities. B Awards for CLARIN Achievements C Centre Providing Metadata B K • The 2018 ParlaCLARIN workshop, held in Miyazaki (Japan) K Knowledge Centre K were awarded to Daan Broeder C

B as part of the 11th edition of the Language Resources and (Meertens Institute, KNAW) and C B B C C B C B Evaluation Conference (LREC2018) for researchers interested Pavel Straňák (Charles University), B K C K C B C B K C B B B K in compiling, annotating, structuring, linking and visualising who both made outstanding B B B K parliamentary records. contributions toward CLARIN goals. B C USA B • Hacking the News: from digitised newspapers to the

C C B archived-web, an introductory workshop to text and data C K mining (Helsinki, ) explored topics to consider when undertaking digital analysis of newspaper corpora and analysing FINANCES SOUTH AFRICA web-archives for research. BALANCE 31 December 2018 2017 • The Workshop Translation memories, corpora, term bases: Bridges between translation studies and research infrastructures Assets Fixed assets (Vienna, ), addressed the practical aspects of re-using, equipment €11.292 €9.261 sharing and archiving of language resources in translation ANNUAL CONFERENCE 2018 Current assets 8-10 October 2018, Pisa, Italy studies. receivables €466.124 €565.532 bank accounts €1.333.874 €1.477.357 The Annual Conference was organised by CLARIN ERIC in collaboration with the Institute for Computational Linguistics Total Assets €1.811.290 €2.052.150 (ILC) that is part of the Department of Social Science and Equity and liabilities Humanities, Cultural Heritage (DSU) of the National Research TECHNICAL DEVELOPMENTS capital and reserves €1.529.425 €1.458.438 Council of Italy (CNR). The conference was attended by 240 current liabilities €281.865 €593.712 participants, a rise of 25% in comparison to the previous year. CLARIN CENTRES Total Equity and liabilities €1.811.290 €2.052.150 The participants came from 27 countries and were authors Three B-centres were successfully reassessed. By the end of 2018 of accepted papers, members of national consortia and the number of certified B-centres was 21 and the total number of PROFIT and LOSS 2018 2017 representatives of CLARIN centres, as well as representatives registered centres was 47. from partner organizations. Income FEDERATED IDENTITY membership fees €1.127.005 €1.096.369 The Service Provider Federation (SPF) was extended with Croatia, projects/other €117.997 €80.041 bringing the total number of organisations that can login to 1800. Total Income €1.245.002 €1.176.410 VIRTUAL LANGUAGE OBSERVATORY (VLO) Expenditures Several new versions were launched. The improved functionality personnel costs €693.003 €510.344 brought cleaner facets, visualisation of licenses and a guided tour. travel €61.758 €50.577 other €419.254 €205.302 LANGUAGE RESOURCE SWITCHBOARD Total Expenditures €1.174.015 €766.223 This service to bridge data and tools, transitioned from a beta RESULT €70.987 €410.187 service into a stable production version. Moreover, many tools (e.g. UDPipe) were added and it was integrated into the EUDAT cloud The accounts for the participation by CLARIN ERIC and other storage platform B2DROP. CLARIN nodes in projects funded under EU programmes are not included in this overview.