The World of Knowledge Graphs in Oracle Databases

Collaboration between Semantic Web Company and Oracle

Emma Thomas Sebastian Gabler Principal Solutions Architect Director of Sales Oracle A-team Semantic Web Company

November 19th 2019

2 Confidential – © 2019 Oracle Restricted Safe Harbor Statement

The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any material, code, or functionality, and should not be relied upon in making purchasing decisions. The development, release, timing, and pricing of any features or functionality described for Oracle’s products may change and remains at the sole discretion of Oracle Corporation.

3 The World of Knowledge Graphs in Oracle Databases

Introduction to Knowledge Graphs

Oracle Database Spatial and Graph - RDF Focus

Oracle RDF4J adapter

Using PoolParty for Search, Similarity, Recommendation and Visualisation

4 Confidential – © 2019 Oracle Restricted Graphs are Everywhere

Heiko Paulheim. Journal of Web Semantics: Special Issue on Knowledge Graph Refinement. “Knowledge Graph Refinement: A Survey of Approaches and Evaluation Methods.” [September, 2016]

5 Confidential – © 2019 Oracle Restricted An Opte Project Visualisation of routing paths through a portion of the Internet

By The Opte Project - Originally from the English Wikipedia; description page is/was here., CC BY 2.5, https://commons.wikimedia.org/w/index.php?cu rid=1538544

6 What is a Knowledge Graph?

Knowledge graphs are large networks of entities, their semantic types, properties, and relationships between entities1.

Key Features: Knowledge “Things, not strings” Database Base Knowledge global unique identifiers Graph Formal structure/semantics machine processable, unambiguous Linked descriptions Graph resources are described by their connections

1. M. Kroetsch and G. Weikum. Journal of Web Semantics: Special Issue on Knowledge Graphs. http://www.websemanticsjournal.org/index.php/ps/an 7 Confidential – © 2019 Oracle Restricted nouncement/view/19 [August, 2016] Data Storage

Name Desc

Product in-a

has-a Category

SKU

Image Relational Hierarchical Graph

8 The Semantic Web and Graph Databases

“I have a dream for the Web [in which computers] become capable of analyzing all the data on the Web – the content, links, and transactions between people and computers. A "Semantic Web", which makes this possible, has yet to emerge, but when it does, the day- to-day mechanisms of trade, bureaucracy and our daily lives will be handled by machines talking to machines. The "intelligent agents" people have touted for ages will finally materialize”

Tim Berners-Lee, 1999

9 Confidential – © 2019 Oracle Restricted W3C Standards for Knowledge Graphs

The W3C (World Wide Web Consortium) has defined a suite of standards to support Linked Data and Knowledge Graphs.

Fundamental Concepts are:

• Globally Unique IDs: URI • Links to other resources • Standard Data Model: RDF • Standard Ontology Language: OWL • Standard Query Language: SPARQL

10 Resource Description Framework (RDF)

An RDF graph is a directed, labeled graph with some syntactic restrictions

● Edge labels are URIs ● Source vertex for an edge must be a URI ● Destination vertex is a URI or a scalar value

An edge is the atomic unit of an RDF graph – called an RDF triple

RDF Triple

11 Confidential – © 2019 Oracle Restricted Modeling Vocabularies

With the Semantic Web, vocabularies define the concepts and relationships (also referred to as “terms”) used to describe and represent an area of concern:

RDFS Resource Description Framework Schema allows you to express the relationships between things by standardizing on a flexible, triple-based format and then providing a vocabulary.

OWL Web Ontology Language is an ontology language. The semantic web standard that is used to defined ontologies (metadata sets) so that they can be used and understood in that environment.

Owls purpose is to develop ontologies that are compatible with the world wide web, where an ontology is definition and classification of concepts and entities and the relationship between them

12 Confidential – © 2019 Oracle Restricted Modeling Vocabularies

SKOS stands for Simple Knowledge Organisation System. The name SKOS was chosen to emphasise the goal of providing a simple yet powerful framework for expressing knowledge organisation systems in a machine- understandable way.

SKOS provides a standard way to represent knowledge organization systems using the Resource Description Framework (RDF). Encoding this information in RDF allows it to be passed between computer applications in an interoperable way.

13 Confidential – © 2019 Oracle Restricted SPARQL

SELECT ?name WHERE { ?s a onto:Employee . ?s dcterms:title ?name }

14 Confidential – © 2019 Oracle Restricted Recap: Description of a Knowledge Graph

Knowledge graphs provide holistic knowledge, including ▹ instance data (ground truth), either open, private, or closed ▹ schema data (vocabularies, ontologies) ▹ metadata (e.g. provenance, versioning, licensing) ▹ comprehensive taxonomies to categorize entities ▹ links between internal and external data ▹ mappings to data and documents stored in other systems and databases Source: Sören Auer, Fraunhofer IAI

15 Confidential – © 2019 Oracle Restricted The World of Knowledge Graphs in Oracle Databases

Introduction to Knowledge Graphs

Oracle Database Spatial and Graph - RDF Focus

Oracle RDF4J adapter

Using PoolParty for Search, Similarity, Recommendation and Visualisation

16 Confidential – © 2019 Oracle Restricted Database Version and Options for RDF Knowledge Graph

▪ On-premise ▪ Oracle Database Enterprise Edition with Spatial & Graph and Partitioning options

▪ Oracle Database Cloud Services ▪ DBCS (Bare Metal & VM) ▪ Exadata Cloud Service ▪ Exadata Cloud at Customer ▪ ATP-Dedicated (planned)

17 Copyright © 2019, Oracle and/or its affiliates. All rights reserved. Oracle Spatial and Graph 19c – RDF Knowledge Graph Features

• Fast bulk-load and indexing Load / • RDF view of Relational data RDF Knowledge Graph Storage • Manages over a trillion quads Leverages Oracle capabilities: • SPARQL-Jena/Fuseki RAC & Exadata scalability Query • SPARQL-in-SQL query & update • Federated query Compression & partitioning • GeoSPARQL In-Memory Column Store SQL*Loader direct path load • RDFS, OWL2 RL, EL, SKOS • Ladder-based inference Parallel DDL, DML, and query Reasoning • Incremental, parallel reasoning RMAN Backup and Recovery • User-defined rules High Availability - Data Guard + DR • Plug-in architecture Oracle Label security Analytics • OBIEE Manageability - Enterprise Manager • Oracle Advanced Analytics Logical Standby • PGX

18 Copyright © 2019, Oracle and/or its affiliates. All rights reserved. Big Data Graph Benchmark 1 Trillion Triple RDF Benchmark with Oracle Spatial and Graph

Oracle Database can load, query and inference millions of RDF graph edges per second

World’s fastest data loading performance World’s fastest query performance Worlds fastest inference performance Massive scalability: 1.08 trillion edges

Platform: Oracle Exadata X4-2 Database Machine Source: w3.org/wiki/LargeTripleStores

19 Confidential – © 2019 Oracle Restricted Oracle Spatial and Graph 19c – RDF Knowledge Graph Architecture

SQL Developer Enterprise Manager Protégé Plugin Fuseki Endpoint Cytoscape Plugin RDF Support and Other DB Tools

Support for (Java API)

SQL and PL/SQL API

RDF Bulk Loader Forward-chaining SPARQL-to-SQL SPARQL Update OWL Reasoner Query Translator Processor

Generic Relational Schema for RDF Views of Relational Data Storing RDF Data

20 Confidential – © 2019 Oracle Restricted What makes a store RDF ready?

It implements at least one of the RDF ready libraries/frameworks: • RDF4J • Apache Jena

It offers a SPARQL endpoint to query the database using SPARQL

21 Confidential – © 2019 Oracle Restricted Oracle Spatial and Graph - Apache Jena

Oracle Database Release 18c offers support for: • Apache Jena 3.1 • Apache Jena Fuseki 2.4 • Protege 5.2 Available adapters:

Source: Oracle Spatial and Graph downloads

22 Confidential – © 2019 Oracle Restricted The World of Knowledge Graphs in Oracle Databases

Introduction to Knowledge Graphs

Oracle Database Spatial and Graph - RDF Focus

Oracle RDF4J adapter

Using PoolParty for Search, Similarity, Recommendation and Visualisation

23 Confidential – © 2019 Oracle Restricted Oracle RDF4J Adapter

Collaboration work between Semantic Web Company and the Oracle A-Team to create a new adapter for the Java based RDF4J library.

RDF4J capabilities brings: • Community opportunity - RDF4J GitHub repo • Easy access to the Java community • Easy access and tested with PoolParty Semantic Suite

24 Confidential – © 2019 Oracle Restricted Oracle RDF4J Adapter

25 Confidential – © 2019 Oracle Restricted The World of Knowledge Graphs in Oracle Databases

Introduction to Oracle Spatial and Graph

Core Graph Database Features

Oracle RDF4J

Using PoolParty for Search, Similarity, Recommendation and Visualisation

26 Confidential – © 2019 Oracle Internal/Restricted/Highly Restricted Introducing Semantic Web Company

Founded in 2004 Developer & Vendor of SWC named to KMWorld’s Based in Vienna PoolParty Semantic Suite ‘100 Companies That Matter in Privately held Participating in projects with Knowledge €2.5 million Management’ in 2016, until 2019 50+ FTE funding for R&D Engineers & ISO 27001:2013 Consultants for NLP, ~35% certified Semantics and Machine revenue growth/year learning 9,000 followers

27 PoolParty Semantic Suite

28 Architecture/ Flow

Oracle RDF4J Demo

1. Oracle Database Cloud Service 2. PoolParty Thesaurus Server 3. Semantic Graph Search 4. Graph Visualisation

30 Demo - Oracle Database Cloud Service

31 Demo - PoolParty Thesaurus Editor

32 Confidential – Oracle Internal/Restricted/Highly Restricted Demo - Semantic Search

33 Demo - Semantic GraphViews

34 Confidential – Oracle Restricted 1

Oracle Database RDF Semantic Graph Database Summary

• Parallel load, inference, query • Compression & partitioning • Triple-level Label security • W3C standards compliance • Semantic Indexing of text • Enterprise Manager, Data Guard Support, High Availability • Support for Open Source (Development framework, ontology editing, visualization)

• On-premise, Oracle Database Cloud Service (DBCS), Exadata Cloud Service, Exadata Cloud at Customer Slide 35

1 [email protected] not sure about open source - the vizualisation we show here is not open source-.... Sebastian Gabler, 11/11/2019 Where does the community meet?

36 Interested in project experience, best practices, networking?

37 Questions?

38 Confidential – Oracle Restricted Thank you

Emma Thomas Sebastian Gabler Principal Solutions Architect Director of Sales Oracle A-team Semantic Web Company

39 Confidential – Oracle Restricted 40 Confidential – Oracle Internal/Restricted/Highly Restricted