<<

DIGICOM project - Inventory – May 2016

Open Data/ Linked Open Data

Activities How does it function in your office?

Major player and stakeholder in the Swiss Open Gov. Data portal (http://opendata.swiss); - We have our own Linked data We're still trying to put in place appropriate workflows. Ultimate aim is to use our LOD CH prototype portal (http://data.admin.ch), with SPARQL end-point. - Implementing best practices in terms of metadata and datastore as unique data source for all our online activities, as well as making it available at SDMX. large to all interested parties. Destatis acts as a data-provider. Database contents are provided for the portals govdata.de (administrative data) and geoportal.de (maps, aerial views and thematic maps from Germany). Furthermore our database offers an API to download Nightly build runs produce a set of special JSON-Files containing metadata of every DE machine-readable data. You can find a tutorial concerning the web services on the web site: database table. The harvesting mechanism of the govdata portal analyse this set of files. https://www-genesis.destatis.de/genesis/misc/GENESIS-Webservices_Einfuehrung. Statistical database available as open data (PX Files, CSV) Currently we are replacing Statistical database platform (OECD It is government policy to make as much as possible available as open data. All data is .STAT) to allow all data in SDMX and SDMX-JSON formats and also use SOAP and REST queries to get machine readable EE made publicly available in statistical database: http://pub.stat.ee/px- open data. document management systems metadata and some non restricted data is also available as open data. Linked web.2001/dialog/statfile1.asp data is next step as a vision but no specific steps have been made. .Stat platform will enable linked data at technical level. An option of the upgraded portal of ELSTAT is the dissemination of open datasets, in SDMX format, via web services. EL Furthermore, ELSTAT is a key supplier of open datasets for governmental portals. Renewed terms of use in 2012 grant a universal, free-of-charge, irrevocable, parallel right of use to the staitistical material. In practise the copyrights follow Creative commons BY licence -> see ”Copyrights and Terms of Use” (http://www.stat.fi/org/lainsaadanto/yleiset_kayttoehdot_en.) Seminar about the use of Oped data: -> see Several API´s have been prepared to the open data datasets: -> see ”Statistics Finland, FI (http://www.stat.fi/ajk/tapahtumia/2014-11-19_tiedolla_johtaminen_ja_avoin_data.html ; in Finnish only) Customer course open data and interfaces” (http://www.stat.fi/org/avoindata/index_en.html) about the use of Statistics Finland´s open data API´s: see-> (http://www.stat.fi/tup/koulutus/koulutus_k16tietokannat.html ;in Finnish only) These activities are managed by the dissemination business unit and IT people. Regarding the Census figures, the RDF data are created in the dissemination phase, from the same

Insee.fr presents an offer of data files online disseminated in several open data formats : csv, , rdf... geojson files for sources used for other dissemination formats. They are published at the same time. RDF FR geographical information... Publication of metadata (classifications, geographic codes) and figures from the Census as RDF. metadata are currently converted from other existing publication formats (e.g. Excel), but RDF data are available as downloadable files and through a SPARQL endpoint (http://rdf.insee.fr/sparql). Insee's central metadata repository is moving to RDF, so it will also be the main language in the next future and other formats will be produced from it. Before 2013 Before Istat participated in the working group of the Commission to coordinate the System of Public Connectivity of DigitPA, internal resources internal contributing to the drafting of the “Guidelines for semantic interoperability through Linked Open Data”. - Member of the task Data and analysis from the Italian National Statistical Institute are licensed under a force of Italian Digital Agency (AGID) for defining the “Linee Guida per la valorizzazione del patrimonio informativo pubblico” Creative Commons License – Attribution – 3.0. Most of Istat's statistics data available on - Three editions of the Data Journalism School Istat/Ahref foundation - Istat promoted AppsForItaly, the Italian competition dati.istat.it (Istat corporate data warehouse and the main channel that Istat uses for data on open data (2012) - Member of an Eurostat task-force on "Common ESS conditions for access to and re-use of data" dissemination) are disseminated also through SEP, a web service that can be queried to IT (2012) - Participation in the national action Plan G8 Opens Data proceedings - Organization of International Open Data Day get structured data in SDMX format. We converted all I.Stat datawarehouse to RDF, (2013 and 2014) - Organization of the Italian data contest "#Censimenti Data Challenge" to promote the reuse of Industry exposed it as Linked Open Data and made it accessible via SPARQL. Making Istat’s data and Services Census data (2014) - Participation to NSO Study Group on Open Data (World Bank) proceedings - “Open Data available as Linked Open Data would facilitate data reuse within the statistical domain but Challenges and Opportunities for National Statistical Offices”(2014) - International Journalism Festival (Perugia). more importantly it would allow third-party data consumer to more easily combine statistical Hackathon's organization with Census and statistics Data - Organization of Hack4DigitalGov (2015) We have two websites data with other contextual information. dedicated to Linked Open Data: http://datiopen.istat.it/ http://linkedstat.spaziodati.eu/ CSB dissemination database is legally and technically open (TSV, CSV a.o. formats for download), all datasets can be LV accessed also via API. There is no bulk download facility. We offer some of Our most popular data in an open data format, see http://www.ssb.no/en/omssb/tjenester-og-verktoy/api NO From early May all of Our statbank will be available as open data, see http://www.ssb.no/en/statistikkbanken. We also offer some of Our metadata as open data in the goverment run open data hotel, see http://data.norge.no/ (Norwegian only) Several websites are available to assist the dissemination of open data. • ONS website [https://www.ons.gov.uk/] – generally Dissemination of statistical outputs to the website is a well-established process and released XLS data with some CSV released via Open Government License, plus JSON API • Data Explorer controlled under our official code of practice. Outputs are prepared and submitted for [http://web.ons.gov.uk/ons/data/dataset-finder]– data available via XLS, CSV and XML. An API is also available for machine dissemination as part of scheduled release protocols. • Producing machine-readable open UK service [https://web.ons.gov.uk/ons/apiservice/web/apiservice/home] • NOMIS [http://www.nomisweb.co.uk/] – data Linked data formats requires some manual data wrangling happens to deconstruct and structure Open Data • Geography portal – providing geographic products to users using RDF and semantic web technologies • Linked data outputs accordingly. Small teams exist to carry out this process in line with the agreed open data pilot to convert population, business and earnings statistics into RDF • Linked open data interdepartmental release protocols. collaboration – connecting linked open data formats The creation of data sets relevant for publication on the open data webpage is integrated in Statistics Austria developed an open data webpage http://data.statistik.gv.at/web/ where data according to open data the production flow for the statistical database STATcube. As a general rule it can be said AT principles are available. This web page is synchronized with the general open data webpage for open data in Austria that everything what is available free of charge can be integrated in Open Data. For some relevant for public administration special data sets (e.g. cartographic geometries) special procedures have to be launched. CYSTAT adopts the ESS declaration for the PSI directive of the EU Commission, for providing its statistics free of charge as a public good irrespective of subsequent use. This provision is included in CYSTAT's dissemination and pricing policy CYSTAT publishes all its statistics on the website mainly in a machine readable form (XLS) CY document which is available on its website. In the framework of the PSI directive CYSTAT provides data on the Open Data free of charge. All the information on the website is available free of charge. portal of the government (www.data.gov.cy). CZSO is very active in this area within the public administration in the Czech Republic. There is a special site on the web Election results: immediately after data processing special output in open format is dedicated to Open data: http://www.volby.cz/opendata/opendata.htm (only in Czech) We publish election results regularly published at the website. Other files were prepared manually - Census results are not CZ (CZSO is responsible for election results processing) at Basic data from Census 2011 are accessible in open format, also updated, classifications are updated only rarely. Tables in public database are standard selected classifications are available at this time. Tables in public database can be exported in XML format with description. output prepared automatically as one of exporting formats.

All our public data can be accessed via our free and open API ( see: We do not engage in linked open data yet. DK

api.statbank.dk) 1. Automatic publication of relevant information and datasets in the National Open data portal: datos.gob.es . INE has 200

2014 - datasets (at the statistical operation level) published in this portal. There is also a special section called Open data in INE website: 2013 http://www.ine.es/ss/Satellite?L=en_GB&c=Page&cid=1259942408928&p=1259942408928&pagename=ProductosYServicio On a regular basis (two weeks) we extract an RDF file from our systems with all the internal resources internal ES s%2FPYSLayout metadata needed to publish in the open data portal (datos.gob.es). Data remain always in 2. API Json published (in 2015) to allow automatic exploitation of our output database. our systems. (http://www.ine.es/dyngs/DataLab/en/manual.html?cid=45) 3. INE is publishing microdata as Public Use Files for more than 40 statistical surveys (specially household surveys) which are very appreciated by the users. (http://www.ine.es/en/prodyser/microdatos_en.htm) All the data on the web portal is freely available and in format which is not commercial-dependent (e.g. CSV format and PT PDF); Additionaly has a cooperation with dados.gov initiative in order to provide statistical data in a standard and open way to everyone. Our experience is limited. We took part in a Linked Open Data project (initiated and led by another government agency Apart from making our API's available (http://www.scb.se/en_/About-us/Open-data-API/) SE Vinnova is Sweden’s innovation agency) a few years ago but after this project we have not done any more work in this field. we have not incorporated this in our workflows. There is a central Slovak web-site of open data (managed by the Ministry of Interior of the Slovak Republic - SK https://data.gov.sk/en/. Until now, the SO SR has published 586 datasets in total on the website (i. e. much more than any As above other institution of the Slovak Republic). The production of open data is very much linked to our datawarehouse system. Data BE We publish structured data in specific open data formats on our Statistics Belgium Open Data Portal . stored in our datawarehouse are gradually transformed into open data files if it is appropriate to do so. This requires training of involved statisticians.

All the tables of our statistics portal are available in EXCEL, CSV and XML format. An API does not exist actually. Tables will LU soon (April 2016) be available on the Open Data Portal of Luxembourg. Tests have already been made and were successfull.

Internal resources Internal

2015 or lateror 2015 All our statistics are available free of charge through our StatLine open data portal NL Open Data (the database contains 3,700 tables) A project "Development of guidelines for publishing statistical data as linked open data" has started in January 2016 and is PL expected to finish in January 2018. No project products available as of yet. http://www.swirrl.com/ http://opencube-toolkit.eu/ ODI: http://theodi.org Good practice outside New York Times the ESS dados.gov (http://www.dados.gov.pt) Most important libraries publish their resources as RDF, see for example the Library of Congress : http://id.loc.gov/ Another good reference is FAO's Agrovoc: http://aims.fao.org/fr/agrovoc

12