Chaudron: Extending Dbpedia with Measurement Julien Subercaze
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
A Survey of Top-Level Ontologies to Inform the Ontological Choices for a Foundation Data Model
A survey of Top-Level Ontologies To inform the ontological choices for a Foundation Data Model Version 1 Contents 1 Introduction and Purpose 3 F.13 FrameNet 92 2 Approach and contents 4 F.14 GFO – General Formal Ontology 94 2.1 Collect candidate top-level ontologies 4 F.15 gist 95 2.2 Develop assessment framework 4 F.16 HQDM – High Quality Data Models 97 2.3 Assessment of candidate top-level ontologies F.17 IDEAS – International Defence Enterprise against the framework 5 Architecture Specification 99 2.4 Terminological note 5 F.18 IEC 62541 100 3 Assessment framework – development basis 6 F.19 IEC 63088 100 3.1 General ontological requirements 6 F.20 ISO 12006-3 101 3.2 Overarching ontological architecture F.21 ISO 15926-2 102 framework 8 F.22 KKO: KBpedia Knowledge Ontology 103 4 Ontological commitment overview 11 F.23 KR Ontology – Knowledge Representation 4.1 General choices 11 Ontology 105 4.2 Formal structure – horizontal and vertical 14 F.24 MarineTLO: A Top-Level 4.3 Universal commitments 33 Ontology for the Marine Domain 106 5 Assessment Framework Results 37 F. 25 MIMOSA CCOM – (Common Conceptual 5.1 General choices 37 Object Model) 108 5.2 Formal structure: vertical aspects 38 F.26 OWL – Web Ontology Language 110 5.3 Formal structure: horizontal aspects 42 F.27 ProtOn – PROTo ONtology 111 5.4 Universal commitments 44 F.28 Schema.org 112 6 Summary 46 F.29 SENSUS 113 Appendix A F.30 SKOS 113 Pathway requirements for a Foundation Data F.31 SUMO 115 Model 48 F.32 TMRM/TMDM – Topic Map Reference/Data Appendix B Models 116 ISO IEC 21838-1:2019 -
Wikimedia Conferentie Nederland 2012 Conferentieboek
http://www.wikimediaconferentie.nl WCN 2012 Conferentieboek CC-BY-SA 9:30–9:40 Opening 9:45–10:30 Lydia Pintscher: Introduction to Wikidata — the next big thing for Wikipedia and the world Wikipedia in het onderwijs Technische kant van wiki’s Wiki-gemeenschappen 10:45–11:30 Jos Punie Ralf Lämmel Sarah Morassi & Ziko van Dijk Het gebruik van Wikimedia Commons en Wikibooks in Community and ontology support for the Wikimedia Nederland, wat is dat precies? interactieve onderwijsvormen voor het secundaire onderwijs 101wiki 11:45–12:30 Tim Ruijters Sandra Fauconnier Een passie voor leren, de Nederlandse wikiversiteit Projecten van Wikimedia Nederland in 2012 en verder Bliksemsessie …+discussie …+vragensessie Lunch 13:15–14:15 Jimmy Wales 14:30–14:50 Wim Muskee Amit Bronner Lotte Belice Baltussen Wikipedia in Edurep Bridging the Gap of Multilingual Diversity Open Cultuur Data: Een bottom-up initiatief vanuit de erfgoedsector 14:55–15:15 Teun Lucassen Gerard Kuys Finne Boonen Scholieren op Wikipedia Onderwerpen vinden met DBpedia Blijf je of ga je weg? 15:30–15:50 Laura van Broekhoven & Jan Auke Brink Jeroen De Dauw Jan-Bart de Vreede 15:55–16:15 Wetenschappelijke stagiairs vertalen onderzoek naar Structured Data in MediaWiki Wikiwijs in vergelijking tot Wikiversity en Wikibooks Wikipedia–lemma 16:20–17:15 Prijsuitreiking van Wiki Loves Monuments Nederland 17:20–17:30 Afsluiting 17:30–18:30 Borrel Inhoudsopgave Organisatie 2 Voorwoord 3 09:45{10:30: Lydia Pintscher 4 13:15{14:15: Jimmy Wales 4 Wikipedia in het onderwijs 5 11:00{11:45: Jos Punie -
Towards a Korean Dbpedia and an Approach for Complementing the Korean Wikipedia Based on Dbpedia
Towards a Korean DBpedia and an Approach for Complementing the Korean Wikipedia based on DBpedia Eun-kyung Kim1, Matthias Weidl2, Key-Sun Choi1, S¨orenAuer2 1 Semantic Web Research Center, CS Department, KAIST, Korea, 305-701 2 Universit¨at Leipzig, Department of Computer Science, Johannisgasse 26, D-04103 Leipzig, Germany [email protected], [email protected] [email protected], [email protected] Abstract. In the first part of this paper we report about experiences when applying the DBpedia extraction framework to the Korean Wikipedia. We improved the extraction of non-Latin characters and extended the framework with pluggable internationalization components in order to fa- cilitate the extraction of localized information. With these improvements we almost doubled the amount of extracted triples. We also will present the results of the extraction for Korean. In the second part, we present a conceptual study aimed at understanding the impact of international resource synchronization in DBpedia. In the absence of any informa- tion synchronization, each country would construct its own datasets and manage it from its users. Moreover the cooperation across the various countries is adversely affected. Keywords: Synchronization, Wikipedia, DBpedia, Multi-lingual 1 Introduction Wikipedia is the largest encyclopedia of mankind and is written collaboratively by people all around the world. Everybody can access this knowledge as well as add and edit articles. Right now Wikipedia is available in 260 languages and the quality of the articles reached a high level [1]. However, Wikipedia only offers full-text search for this textual information. For that reason, different projects have been started to convert this information into structured knowledge, which can be used by Semantic Web technologies to ask sophisticated queries against Wikipedia. -
Wiki-Metasemantik: a Wikipedia-Derived Query Expansion Approach Based on Network Properties
Wiki-MetaSemantik: A Wikipedia-derived Query Expansion Approach based on Network Properties D. Puspitaningrum1, G. Yulianti2, I.S.W.B. Prasetya3 1,2Department of Computer Science, The University of Bengkulu WR Supratman St., Kandang Limun, Bengkulu 38371, Indonesia 3Department of Information and Computing Sciences, Utrecht University PO Box 80.089, 3508 TB Utrecht, The Netherlands E-mails: [email protected], [email protected], [email protected] Pseudo-Relevance Feedback (PRF) query expansions suffer Abstract- This paper discusses the use of Wikipedia for building from several drawbacks such as query-topic drift [8][10] and semantic ontologies to do Query Expansion (QE) in order to inefficiency [21]. Al-Shboul and Myaeng [1] proposed a improve the search results of search engines. In this technique, technique to alleviate topic drift caused by words ambiguity selecting related Wikipedia concepts becomes important. We and synonymous uses of words by utilizing semantic propose the use of network properties (degree, closeness, and pageRank) to build an ontology graph of user query concept annotations in Wikipedia pages, and enrich queries with which is derived directly from Wikipedia structures. The context disambiguating phrases. Also, in order to avoid resulting expansion system is called Wiki-MetaSemantik. We expansion of mistranslated words, a query expansion method tested this system against other online thesauruses and ontology using link texts of a Wikipedia page has been proposed [9]. based QE in both individual and meta-search engines setups. Furthermore, since not all hyperlinks are helpful for QE task Despite that our system has to build a Wikipedia ontology graph though (e.g. -
Navigating Dbpedia by Topic Tanguy Raynaud, Julien Subercaze, Delphine Boucard, Vincent Battu, Frederique Laforest
Fouilla: Navigating DBpedia by Topic Tanguy Raynaud, Julien Subercaze, Delphine Boucard, Vincent Battu, Frederique Laforest To cite this version: Tanguy Raynaud, Julien Subercaze, Delphine Boucard, Vincent Battu, Frederique Laforest. Fouilla: Navigating DBpedia by Topic. CIKM 2018, Oct 2018, Turin, Italy. hal-01860672 HAL Id: hal-01860672 https://hal.archives-ouvertes.fr/hal-01860672 Submitted on 23 Aug 2018 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Fouilla: Navigating DBpedia by Topic Tanguy Raynaud, Julien Subercaze, Delphine Boucard, Vincent Battu, Frédérique Laforest Univ Lyon, UJM Saint-Etienne, CNRS, Laboratoire Hubert Curien UMR 5516 Saint-Etienne, France [email protected] ABSTRACT only the triples that concern this topic. For example, a user is inter- Navigating large knowledge bases made of billions of triples is very ested in Italy through the prism of Sports while another through the challenging. In this demonstration, we showcase Fouilla, a topical prism of Word War II. For each of these topics, the relevant triples Knowledge Base browser that offers a seamless navigational expe- of the Italy entity differ. In such circumstances, faceted browsing rience of DBpedia. We propose an original approach that leverages offers no solution to retrieve the entities relative to a defined topic both structural and semantic contents of Wikipedia to enable a if the knowledge graph does not explicitly contain an adequate topic-oriented filter on DBpedia entities. -
Wikipedia Editing History in Dbpedia Fabien Gandon, Raphael Boyer, Olivier Corby, Alexandre Monnin
Wikipedia editing history in DBpedia Fabien Gandon, Raphael Boyer, Olivier Corby, Alexandre Monnin To cite this version: Fabien Gandon, Raphael Boyer, Olivier Corby, Alexandre Monnin. Wikipedia editing history in DB- pedia : extracting and publishing the encyclopedia editing activity as linked data. IEEE/WIC/ACM International Joint Conference on Web Intelligence (WI’ 16), Oct 2016, Omaha, United States. hal- 01359575 HAL Id: hal-01359575 https://hal.inria.fr/hal-01359575 Submitted on 2 Sep 2016 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Wikipedia editing history in DBpedia extracting and publishing the encyclopedia editing activity as linked data Fabien Gandon, Raphael Boyer, Olivier Corby, Alexandre Monnin Université Côte d’Azur, Inria, CNRS, I3S, France Wimmics, Sophia Antipolis, France [email protected] Abstract— DBpedia is a huge dataset essentially extracted example, the French editing history dump represents 2TB of from the content and structure of Wikipedia. We present a new uncompressed data. This data extraction is performed by extraction producing a linked data representation of the editing stream in Node.js with a MongoDB instance. It takes 4 days to history of Wikipedia pages. This supports custom querying and extract 55 GB of RDF in turtle on 8 Intel(R) Xeon(R) CPU E5- combining with other data providing new indicators and insights. -
AI/ML Finding a Doing Machine
Digital Readiness: AI/ML Finding a doing machine Gregg Vesonder Stevens Institute of Technology http://vesonder.com Three Talks • Digital Readiness: AI/ML, The thinking system quest. – Artificial Intelligence and Machine Learning (AI/ML) have had a fascinating evolution from 1950 to the present. This talk sketches the main themes of AI and machine learning, tracing the evolution of the field since its beginning in the 1950s and explaining some of its main concepts. These eras are characterized as “from knowledge is power” to “data is king”. • Digital Readiness: AI/ML, Finding a doing machine. – In the last decade Machine Learning had a remarkable success record. We will review reasons for that success, review the technology, examine areas of need and explore what happened to the rest of AI, GOFAI (Good Old Fashion AI). • Digital Readiness: AI/ML, Common Sense prevails? – Will there be another AI Winter? We will explore some clues to where the current AI/ML may reunite with GOFAI (Good Old Fashioned AI) and hopefully expand the utility of both. This will include extrapolating on the necessary melding of AI with engineering, particularly systems engineering. Roadmap • Systems – Watson – CYC – NELL – Alexa, Siri, Google Home • Technologies – Semantic web – GPUs and CUDA – Back office (Hadoop) – ML Bias • Last week’s questions Winter is Coming? • First Summer: Irrational Exuberance (1948 – 1966) • First Winter (1967 – 1977) • Second Summer: Knowledge is Power (1978 – 1987) • Second Winter (1988 – 2011) • Third Summer (2012 – ?) • Why there might not be a third winter! Henry Kautz – Engelmore Lecture SYSTEMS Winter 2 Systems • Knowledge is power theme • Influence of the web, try to represent all knowledge – Creating a general ontology organizing everything in the world into a hierarchy of categories – Successful deep ontologies: Gene Ontology and CML Chemical Markup Language • Indeed extreme knowledge – CYC and Open CYC – IBM’s Watson Upper ontology of the world Russell and Norvig figure 12.1 Properties of a subject area and how they are related Ferrucci, D., et.al. -
Conceptualization and Visual Knowledge Organization: a Survey of Ontology-Based Solutions
View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Repository of the Academy's Library CONCEPTUALIZATION AND VISUAL KNOWLEDGE ORGANIZATION: A SURVEY OF ONTOLOGY-BASED SOLUTIONS A. Benedek1, G. Lajos2 1 Hungarian Academy of Sciences (HUNGARY) 2 WikiNizer (UNITED KINGDOM) Abstract Conceptualization and Visual Knowledge Organization are overlapping research areas of Intellect Augmentation with hot topics at their intersection which are enhanced by proliferating issues of ontology integration. Knowledge organization is more closely related to the content of concepts and to the actual knowledge items than taxonomic structures and ‘ontologies’ understood as “explicit specifications of a conceptualization” (Gruber). Yet, the alignment of the structure and relationship of knowledge items of a domain of interest require similar operations. We reconsider ontology-based approaches to collaborative conceptualization from the point of view of user interaction in the context of augmented visual knowledge organization. After considering several formal ontology based approaches, it is argued that co-evolving visualization of concepts are needed both at the object and meta-level to handle the social aspects of learning and knowledge work in the framework of an Exploratory Epistemology. Current systems tend to separate the conceptual level from the content, and the context of logical understanding from the intent and use of concepts. Besides, they usually consider the unification of individual contributions in a ubiquitous global representation. As opposed to this God's eye view we are exploring new alternatives that aim to provide Morphic-like live, tinkerable, in situ approaches to conceptualization and its visualization and to direct manipulation based ontology authoring capabilities. -
Easychair Preprint Digital Curation with Semantic Mediawiki And
EasyChair Preprint № 1853 Digital Curation with Semantic MediaWiki and Wikidata in numismatic research Bernhard Krabina EasyChair preprints are intended for rapid dissemination of research results and are integrated with the rest of EasyChair. November 5, 2019 Digital Curation with Semantic MediaWiki and Wikidata in numismatic research Bernhard Krabina1[0000-0002-6871-3037] 1 KDZ – Centre for Public Administration Research, Guglgasse 13, 1110 Vienna, Austria [email protected] Abstract. The FINA wiki is a Semantic MediaWiki (SMW) supporting numis- matic research showing how digital curation can be facilitated. It focuses on cu- ration of non-published material such as letters or manuscript sources scattered over archives and libraries across the world and previously neglected by scholars. A vast number of knowledge visualisations like maps, timelines, charts, word clouds or flow charts provide researchers with better support for handling large amount of content. With a simple mechanism, SMW can query Wikidata and users adding content can decide to use the suggested Wikidata IDs as point of reference. By leveraging Semantic Web standards, SMW instances can ensure long-term viability of valuable digital content. Keywords: Digital Curation, Semantic MediaWiki, Wikis, Wikidata, Semantic Web, Knowledge Visualisation, Collaboration, Metadata, Controlled Vocabu- laries, Numismatics. 1 Introduction 1.1 Digital curation Librarians and archivists collect valuable information, organize it, keep it in usable condition and provide access to users. Though the forms of content have changed throughout the years from handwritten items and published books now to digital con- tent, the purpose of librarianship and archiving has always been to collect and provide effective access to curated information. -
What You Say Is Who You Are. How Open Government Data Facilitates Profiling Politicians
What you say is who you are. How open government data facilitates profiling politicians Maarten Marx and Arjan Nusselder ISLA, Informatics Institute, University of Amsterdam Science Park 107 1098XG Amsterdam, The Netherlands Abstract. A system is proposed and implemented that creates a lan- guage model for each member of the Dutch parliament, based on the official transcripts of the meetings of the Dutch Parliament. Using ex- pert finding techniques, the system allows users to retrieve a ranked list of politicians, based on queries like news messages. The high quality of the system is due to extensive data cleaning and transformation which could have been avoided when it had been available in an open machine readable format. 1 Introduction The Internet is changing from a web of documents into a web of objects. Open and interoperable (linkable) data are crucial for web applications which are build around objects. Examples of objects featuring prominently is (mashup) websites are traditional named entities like persons, products, organizations [6,4], but also events and unique items like e.g. houses. The success of several mashup sites is simply due to the fact that they provide a different grouping of already (freely) available data. Originally the data could only be grouped by documents; the mashup allows for groupings by objects which are of interest in their specific domain. Here is an example from the political domain. Suppose one wants to know more about Herman van Rompuy, the new EU “president” from Belgium. Being a former member of the Belgium parliament and several governments, an im- portant primary source of information are the parliamentary proceedings. -
End-User Story Telling with a CIDOC CRM- Based Semantic Wiki
End-user storytelling with a CIDOC CRM-based semantic wiki Vincent Ribaud (reviewed by Patrick Le Bœuf) Département d'Informatique - U. B. O. Brest, France [email protected] Abstract: This paper presents the current state of an experiment intended to use the CIDOC CRM as a knowledge representation language. STEM freshers freely constitute groups of 2 to 4 members and choose a theme; groups have to model, structure, write and present a story within a web-hosted semantic wiki. The main part of the CIDOC CRM is used as an ontological core where students are hanging up classes and properties of the domain related to the story. The hypothesis is made that once the entry ticket has been paid, the CRM guides the end-user in a fairly natural manner for reading - and writing - the story. The intermediary assessment of the wikis allowed us to detect confusion between immaterial work and (physical) realisation of the work; and difficulty in having event-centred modelling. Final assessment results are satisfactory but may be improved. Some groups did not acquire modelling abilities - although this is a central issue in a semantic web course. Results also indicate that the scope of the course (semantic web) is somewhat too ambitious. This experience was performed in order to attract students to computer science studies but it did not produce the expected results. It did however succeed in arousing student interest, and it may contribute to the dissemination of ontologies and to making CIDOC CRM widespread. CIDOC 2010 ICOM General Conference Shanghai, China 2010-11-08 – 11-10 Introduction This paper presents the current state of an experiment intended to use the CIDOC CRM as a knowledge representation language inside a semantic wiki. -
Toward the Use of Upper Level Ontologies
Toward the use of upper level ontologies for semantically interoperable systems: an emergency management use case Linda Elmhadhbi, Mohamed Hedi Karray, Bernard Archimède To cite this version: Linda Elmhadhbi, Mohamed Hedi Karray, Bernard Archimède. Toward the use of upper level ontolo- gies for semantically interoperable systems: an emergency management use case. 9th Conference on Interoperability for Enterprise Systems and Applications I-ESA2018 Conference, Mar 2018, Berlin, Germany. pp.1-10. hal-02359706 HAL Id: hal-02359706 https://hal.archives-ouvertes.fr/hal-02359706 Submitted on 12 Nov 2019 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Open Archive Toulouse Archive Ouverte (OATAO ) OATAO is an open access repository that collects the wor of some Toulouse researchers and ma es it freely available over the web where possible. This is an author's version published in: http://oatao.univ-toulouse.fr/22794 Official URL : https://doi.org/10.1007/978-3-030-13693-2 To cite this version: Elmhadhbi, Linda and Karray, Mohamed Hedi and Archimède, Bernard Toward the use of upper level ontologies for semantically interoperable systems: an emergency management use case. ( In Press: 2018) In: 9th Conference on Interoperability for Enterprise Systems and Applications I-ESA2018 Conference, 23 March 2018 - 19 March 2018 (Berlin, Germany).