FOUNDATIONS OF TECHNOLOGIES

Linked Data and Applications

Sebastian Rudolph Agenda

1 Linked (Open) Data

2 Semantic Web and HTML RDFa Google Knowledge Graph

3 OWL Applications OWL DL Application EDF Energy OWL Profile Application BBC World Cup Semantic Technologies in the Pharmaceutical Industry

4 Summary

TU Dresden Foundations of Semantic Web Technologies slide 2 of 51 Agenda

1 Linked (Open) Data

2 Semantic Web and HTML RDFa Microformats Google Knowledge Graph

3 OWL Applications OWL DL Application EDF Energy OWL Profile Application BBC World Cup Semantic Technologies in the Pharmaceutical Industry

4 Summary

TU Dresden Foundations of Semantic Web Technologies slide 3 of 51 Data in the Web

• more and more data is available in the Web for programmatic access • often specified using Semantic Web Standards, e.g., the following Linking Open Data (LOD) Initiative http://www.w3.org/wiki/SweoIG/TaskForces/ CommunityProjects/LinkingOpenData • using APIs, e.g. via JSON/REST

• Semantic Web technologies simplify the integration of data from different sources • combination of data leads to deeper insights

TU Dresden Foundations of Semantic Web Technologies slide 4 of 51 in the Web 01.05.2007

Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/

TU Dresden Foundations of Semantic Web Technologies slide 5 of 51 Linked Data in the Web 31.03.2008

ECS Sem- South- Web- ampton Central updated Music- Doap- Audio- brainz space Flickr Scrobbler QDOS exporter SIOC profiles

BBC BBC Magna- SW Later + John Onto- tune Conference Jamendo Peel world TOTP FOAF Corpus profiles Open- Guides Geo- names Revyu

DBpedia US RDF Book Census Mashup World Data NEW! Fact- DBLP book lingvoj riese Berlin NEW! RKB Euro- Explorer stat flickr Gov- Wiki- Open wrappr Track company Cyc DBLP Hannover W3C Project WordNet Guten- berg

Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/

TU Dresden Foundations of Semantic Web Technologies slide 6 of 51 Linked Data in the Web 14.07.2009

ECS South- Sem- Wiki- Surge ampton BBC LIBRIS Web- company Playcount Radio Central RDF Data ohloh Resex Doap- Buda- Music- Semantic ReSIST space pest Eurécom brainz Audio- Flickr Web.org Project BME MySpace Scrobbler QDOS exporter SW Wiki Conference Wrapper IRIT Corpus Toulouse

RAE National BBC BBC Crunch 2001 Science FOAF SIOC ACM BBC Music Later + John Base Revyu Foundation Jamendo Peel profiles Sites TOTP Open- Guides DBLP RKB Project flickr Geo- Pub Euro- Guten- wrappr Explorer Guide names stat Virtuoso Pisa CORDIS berg Sponger eprints BBC Programmes Open Calais RKB World Linked riese ECS Fact- New- Magna- MDB South- IEEE book tune ampton castle RDF Book Mashup Linked DBpedia GeoData lingvoj Freebase US CiteSeer LAAS- CNRS Census W3C DBLP IBM Data WordNet Hannover UniRef GEO Species DBLP Gov- UMBEL Berlin Track Reactome LinkedCT UniParc Open Taxonomy Drug Cyc PROSITE Daily Bank Med Pub GeneID Chem Homolo KEGG UniProt Gene Pfam ProDom CAS Disea- Gene some ChEBI Ontology Symbol OMIM Inter Pro UniSTS PDB HGNC MGI PubMed As of July 2009

Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/

TU Dresden Foundations of Semantic Web Technologies slide 7 of 51 Linked Data in the Web 22.09.2010

Sussex St. Reading Andrews NDL Audio- Resource Lists subjects t4gm MySpace scrobbler Lists Moseley (DBTune) (DBTune) RAMEAU Folk NTU SH lobid Resource GTAA Plymouth Organi- Reading Lists Lists sations Music ECS Magna- The Open Brainz Music Library LCSH South- DB tune (Data LIBRIS Brainz lobid ampton Ulm Tropes Incubator) (zitgist) Man- Resources EPrints chester Surge biz. Music Reading RISKS Radio The Open ECS data. Brainz Lists John Discogs Library PSH Gem. South- gov.uk (DBTune) UB Peel FanHubz (Data In- (Talis) Norm- Mann- ampton (DB cubator) datei heim RESEX Tune) Jamendo Poké- DEPLOY Popula- Last.fm tion (En- pédia Artists Last.FM Linked RDF AKTing) research EUTC (DBTune) (rdfize) LCCN VIAF Book Wiki data.gov Eurécom Produc- P20 Mashup semantic Pisa NHS .uk classical tions Pokedex web.org (EnAKTing) (DB Mortality Tune) PBAC ECS (En- MARC (RKB AKTing) BBC Budapest Codes Explorer) OpenEI Program BBC Lotico Revyu Energy education List Semantic OAI (En- CO2 data.gov mes Music Crunch SW (En- Chronic- Linked Dog AKTing) .uk NSZL Base AKTing) ling Event- MDB RDF Food Catalog IRIT America Media ohloh BBC DBLP Good- ACM IBM Ord- Wildlife BibBase Recht- win (RKB nance Openly Finder Explorer) Local spraak. Family DBLP legislation Survey Tele- New VIVO UF nl graphis flickr (L3S) New- .gov.uk York VIVO Times wrappr OpenCal castle URI Indiana RAE2001 UK Post- Burner ais DBLP codes statistics (FU VIVO CiteSeer Roma data.gov LOIUS Taxon Berlin) IEEE iServe Cornell .uk World Concept Geo data Fact- ESD Names OS dcs stan- book dotAC reference Linked Data Freebase Project dards data.gov NASA (FUB) for Intervals Guten- .uk (Data GESIS Course- transport DBpedia berg STW ePrints CORDIS Incu- ware data.gov (FUB) bator) Fishes ERA .uk UN/ of Texas Geo Uberblic LOCODE Euro- Species The stat TCM SIDER Pub KISTI (FUB) lite Gene STITCH JISC London Geo Chem KEGG DIT LAAS Gazette TWC LOGD Linked Daily Drug Eurostat OBO Data UMBEL lingvoj Med (es) Disea- YAGO Medi some Care NSF ChEBI KEGG KEGG Linked Linked Drug Cpd KEGG GovTrack rdfabout CT Pathway Glycan Sensor Data Open Bank US SEC (Kno.e.sis) riese Uni Reactome Cyc Lexvo Path- totl.net way Pfam PDB Semantic HGNC XBRL WordNet KEGG KEGG Linked Taxo- CAS Reaction Twarql (VUA) UniProt Enzyme rdfabout EUNIS Open nomy US Census Numbers PRO- ProDom SITE Chem2 UniRef Bio2RDF Climbing WordNet SGD Homolo (W3C) Affy- Gene Linked Cornetto GeoData metrix PubMed UniParc Gene Ontology GeneID Airports Product DB UniSTS MGI Gen Bank OMIM InterPro As of September 2010 Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/

TU Dresden Foundations of Semantic Web Technologies slide 8 of 51 Linked Data in the Web 19.09.2011

Linked Audio LOV User Slideshare tags2con Feedback 2RDF delicious Moseley Scrobbler Bricklink Sussex GTAA Folk (DBTune) Reading St. Magna- Klapp- Lists Andrews tune stuhl- Resource NTU DB club Lists Resource Lotico Semantic yovisto Tropes John Music Man- Lists Music Tweet chester Hellenic Peel Brainz NDL Brainz Reading FBD (DBTune) (Data subjects EUTC (zitgist) Lists Open Incubator) Linked t4gm Hellenic Produc- Library Open Surge Crunch- info PD tions RDF Discogs base Library Radio ohloh Ontos Source Code Crime (Data Plymouth (Talis) News Ecosystem Reading RAMEAU LEM Reports business Incubator) Portal Lists Crime data.gov. Music Jamendo Linked Data SH UK (En- uk Brainz (DBtune) LinkedL Ox AKTing) ntnusc FanHubz gnoss SSW CCN Points (DBTune) Last.FM Poké- Thesaur Thesau- Popula- artists pédia Didactal us rus W LIBRIS tion (En- patents (DBTune) Last.FM ia theses. LCSH Rådata reegle AKTing) research MARC data.gov. data.go (rdfize) my fr Codes nå! NHS Good- Ren. uk v.uk Experi- List (En- Classical win flickr Energy (DB Pokedex ment Norwe- AKTing) Mortality Family wrappr Sudoc PSH Genera- BBC Tune) gian tors (En- Program AKTing) MeSH mes semantic IdRef GND CO2 educatio OpenEI BBC web.org Energy SW Sudoc ndlna Emission n.data.g Music (En- Chronic- Linked Dog VIAF EEA (En- ov.uk UB AKTing) ling Event MDB Portu- Food AKTing) guese Mann- Europeana BBC America Media DBpedia Calames heim Wildlife Deutsche Open Ord- Recht- Revyu DDC nance Openly spraak. Finder Bio- lobid Election RDF graphie legislation Survey Local nl data Ulm Resources NSZL Swedish EU Data Tele- New data.gov.uk Book Catalog Open Insti- Project graphis York bnf.fr Open Mashup Cultural tutions Times URI Greek P20 UK Post- Heritage Burner DBpedia Calais codes statistics ECS Wiki lobid data.gov. Taxon South- GovWILD LOIUS iServe BNB Organi- Brazilian uk Concept ECS ampton sations Geo World OS BibBase STW GESIS Poli- ESD South- ECS Names Fact- (RKB ticians stan- reference ampton data.gov.uk book Freebase Explorer) Budapest dards data.gov. EPrints intervals NASA Lichfield uk (Data Project OAI transport DBpedia data Pisa Incu- Guten- Spen- data.gov. dcs RESEX Scholaro- bator) Fishes berg DBLP ISTAT ding uk DBLP meter Scotland Geo Immi- of Texas (FU (L3S) Pupils & Uberblic DBLP gration Species Berlin) IRIT Exams Euro- dbpedia data- TCM (RKB London stat open- ACM lite Gene Explorer) IBM NVD Traffic Gazette (FUB) Geo ac-uk Eurostat DIT Scotland TWC LOGD Linked Daily UN/ Data UMBEL ERA Data Med LOCODE DEPLOY Gov.ie YAGO CORDIS lingvoj Disea- New- (RKB some SIDER RAE2001 castle LOCAH CORDIS Explorer) Linked Eurécom Eurostat Drug CiteSeer Roma (FUB) Sensor Data GovTrack (Ontology (Kno.e.sis) Open Bank Pfam Course- Central) riese Enipedia Cyc Lexvo LinkedCT ware Linked UniProt PDB VIVO EDGAR EURES US SEC Indiana ePrints dotAC (Ontology IEEE (rdfabout) totl.net Central) WordNet RISKS UniProt US Census (VUA) Taxono EUNIS Twarql (Bio2RDF) HGNC Semantic (rdfabout) Cornetto my VIVO PRO- FTS XBRL ProDom STITCH Cornell LAAS SITE Scotland KISTI NSF LODE Geo- GeoWord graphy Net WordNet WordNet JISC (W3C) Climbing Linked (RKB Affy- KEGG SMC Explorer) SISVU Pub Drug VIVO UF Piedmont GeoData metrix ECCO- Finnish Journals PubMed Gene SGD Chem Accomo- TCP Munici- El AGROV Ontology dations Alpine bible palities Viajero OC Tourism Ski KEGG ontology Austria Ocean Enzyme PBAC GEMET ChEMBL Italian Drilling Metoffice KEGG Open OMIM public Codices AEMET Weather Linked MGI Pathway Forecasts Data InterPro GeneID schools Thesau- Open KEGG Turismo EARTh rus Colors Reaction de Zaragoza Product Smart KEGG Weather DB Link Medi Glycan Janus Stations Care Product UniParc UniRef KEGG AMP Types Italian UniSTS Com- Yahoo! Homolo Airports Ontology Museums Google pound Geo Gene Art National Chem2 Planet wrapper Radio- Bio2RDF activity UniPath JP Sears Open Linked OGOLOD way Corpo- Amster- Reactome medu- Open rates dam Museum cator Numbers As of September 2011 Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/

TU Dresden Foundations of Semantic Web Technologies slide 9 of 51 Linked Data in the Web 30.08.2014

Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/ TU Dresden Foundations of Semantic Web Technologies slide 10 of 51 Linked Data Principles*

Linked Data consists mainly of a number of principles for publishing data in the Web: 1 Use URIs as names for things – documents, people, locations, concepts, etc. 2 Use HTTP URIs so that people can look up those names 3 When someone looks up a URI, provide useful information, using the standards (RDF*, SPARQL) 4 Include links to other URIs, so that they can discover more things.

*http://www.w3.org/DesignIssues/LinkedData.html

TU Dresden Foundations of Semantic Web Technologies slide 11 of 51 5 Star Linked (Open) Data

8 Available on the web (whatever format) but with an open licence, to be Open Data 88 Available as machine-readable structured data (e.g. excel instead of image scan of a table) 888 as (2) plus non-proprietary format (e.g. CSV instead of excel) 8888 All the above plus, use open standards from W3C (RDF and SPARQL) to identify things, so that people can point at your stuff 88888 All the above, plus: Link your data to other people’s data to provide context http://www.w3.org/DesignIssues/LinkedData.html

TU Dresden Foundations of Semantic Web Technologies slide 12 of 51 De-Referencing of an IRI

• These IRIs can then be used also in other documents • For example in the document : <#denise> fam:child <#edwin>, . • One can then extract the URL from and find information about #carol from there

TU Dresden Foundations of Semantic Web Technologies slide 13 of 51 Connection between the IRI of a Thing and IRI of a Source

User Agent http://www.polleres.net/foaf.rdf#me

HTTP RDF GET

Web Server http://www.polleres.net/foaf.rdf

TU Dresden Foundations of Semantic Web Technologies slide 14 of 51 Connection between the IRI of a Thing and IRI of a Source

User Agent http://dbpedia.org/resource/Gordon_Brown

HTTP HTTP 303* RDF GET GET

Web Server http://dbpedia.org/data/Gordon_Brown

http://dbpedia.org/page/Gordon_Brown

*HTTP Response Code 303: See Other TU Dresden Foundations of Semantic Web Technologies slide 15 of 51 Representations

• Information resources can have different representations • A representation is a stream of bytes in a certain format such as HTML, RDF/XML or JPEG • Example: an invoice is an information resource that might be represented as printable PDF or as RDF document • A single resource can have many different representations, e.g., in different formats, resolutions or languages

TU Dresden Foundations of Semantic Web Technologies slide 16 of 51 HTTP Content Negotiation

• Content Negotiation (CN, conneg) is the process of selecting the best representation for a query if several representations are available

$ curl -I -H "Accept: application/rdf+" http://dbpedia.org/resource/Gordon_Brown

$ curl -I -H "Accept: text/html" http://dbpedia.org/resource/Gordon_Brown curl – Tool to send requests to a server or receive responses -H Custom header to pass to server -I Show document info only

TU Dresden Foundations of Semantic Web Technologies slide 17 of 51 HTTP Content Negotiation

$ curl -I -H "Accept: text/html" http://dbpedia.org/resource/Gordon_Brown HTTP/1.1 303 See Other Date: Mon, 04 Feb 2013 10:37:10 GMT Content-Type: text/html; charset=UTF-8 Content-Length: 0 Connection: keep-alive Server: Virtuoso/06.04.3132 (Linux) [...] Accept-Ranges: bytes Location: http://dbpedia.org/page/Gordon_Brown

TU Dresden Foundations of Semantic Web Technologies slide 18 of 51 HTTP Content Negotiation

$ curl -I -H "Accept: application/rdf+xml" http://dbpedia.org/resource/Gordon_Brown HTTP/1.1 303 See Other Date: Mon, 04 Feb 2013 10:36:59 GMT Content-Type: application/rdf+xml; qs=0.95 Content-Length: 0 Connection: keep-alive Server: Virtuoso/06.04.3132 (Linux) [...] Accept-Ranges: bytes TCN: choice Vary: negotiate,accept Content-Location: /data/Gordon_Brown.xml Link:

Linked Audio LOV User Slideshare tags2con Feedback 2RDF delicious Moseley Scrobbler Bricklink Sussex GTAA Folk (DBTune) Reading St. Magna- Klapp- Lists Andrews tune stuhl- Resource NTU DB club Lists Resource Lotico Semantic yovisto Tropes John Music Man- Lists Music Tweet chester Hellenic Peel Brainz NDL Brainz Reading FBD (DBTune) (Data subjects EUTC (zitgist) Lists Open Incubator) Linked t4gm Hellenic Produc- Library Open Surge Query Crunch- info PD tions RDF Discogs base Library Radio ohloh Ontos Source Code Crime (Data Plymouth (Talis) News Ecosystem Reading RAMEAU LEM Reports business Incubator) Portal Lists Crime data.gov. Music Jamendo Linked Data SH UK (En- uk Brainz (DBtune) LinkedL Ox AKTing) FanHubz gnoss ntnusc (DBTune) SSW CCN

Points Response Last.FM Poké- Thesaur Thesau- Popula- artists pédia Didactal us rus W LIBRIS tion (En- patents (DBTune) Last.FM ia theses. LCSH Rådata reegle AKTing) research MARC data.gov. data.go (rdfize) my fr Codes nå! NHS Good- Ren. uk v.uk Experi- List (En- Classical win flickr Energy (DB Pokedex ment Norwe- AKTing) Mortality Family wrappr Sudoc PSH Genera- BBC Tune) gian tors (En- Program AKTing) MeSH mes semantic IdRef GND CO2 educatio OpenEI BBC web.org Energy SW Sudoc ndlna Emission n.data.g Music (En- Chronic- Linked Dog VIAF EEA (En- ov.uk UB AKTing) ling Event MDB Portu- Food AKTing) guese Mann- Europeana BBC America Media DBpedia Calames heim Wildlife Deutsche Open Ord- Recht- Revyu DDC nance Openly spraak. Finder Bio- lobid Election RDF graphie legislation Survey Local nl data Ulm Resources NSZL Swedish EU Data Tele- New data.gov.uk Book Catalog Open Insti- Project graphis York bnf.fr Open Mashup Cultural tutions Times URI Greek P20 UK Post- Heritage Burner DBpedia Calais codes statistics ECS Wiki lobid data.gov. Taxon South- GovWILD LOIUS iServe BNB Organi- Brazilian uk Concept ECS ampton sations Geo World OS BibBase STW GESIS Poli- ESD South- ECS Names Fact- (RKB ticians stan- reference ampton data.gov.uk book Freebase Explorer) Budapest dards data.gov. EPrints intervals NASA Lichfield uk (Data Project OAI transport DBpedia data Pisa Incu- Guten- Spen- data.gov. dcs RESEX Scholaro- bator) Fishes berg DBLP ISTAT ding uk DBLP meter Scotland Geo Immi- of Texas (FU (L3S) Pupils & Uberblic DBLP gration Species Berlin) IRIT Exams Euro- dbpedia data- TCM (RKB London stat open- ACM lite Gene Explorer) IBM NVD Traffic Gazette (FUB) Geo ac-uk Eurostat DIT Scotland TWC LOGD Linked Daily UN/ Data UMBEL ERA Data Med LOCODE DEPLOY Gov.ie YAGO CORDIS lingvoj Disea- New- (RKB some SIDER RAE2001 castle LOCAH CORDIS Explorer) Linked Eurécom Eurostat Drug CiteSeer Roma (FUB) Sensor Data GovTrack (Ontology (Kno.e.sis) Open Bank Pfam Course- Central) riese Enipedia Cyc Lexvo LinkedCT ware Linked UniProt PDB VIVO EDGAR EURES US SEC Indiana ePrints dotAC (Ontology IEEE (rdfabout) totl.net Central) WordNet RISKS UniProt US Census (VUA) Taxono EUNIS Twarql (Bio2RDF) HGNC Semantic (rdfabout) Cornetto my VIVO PRO- FTS XBRL ProDom STITCH Cornell LAAS SITE Scotland KISTI NSF LODE Geo- GeoWord graphy Net WordNet WordNet JISC (W3C) Climbing Linked (RKB Affy- KEGG SMC Explorer) SISVU Pub Drug VIVO UF Piedmont GeoData metrix ECCO- Finnish Journals PubMed Gene SGD Chem Accomo- TCP Munici- El AGROV Ontology dations Alpine bible palities Viajero OC Tourism Ski KEGG ontology Austria Ocean Enzyme PBAC GEMET ChEMBL Italian Drilling Metoffice KEGG Open OMIM public Codices AEMET Weather Linked MGI Pathway Forecasts Data InterPro GeneID schools Thesau- Open KEGG Turismo EARTh rus Colors Reaction de Zaragoza Product Smart KEGG Weather DB Link Medi Glycan Janus Stations Care Product UniParc UniRef KEGG AMP Types Italian UniSTS Com- Yahoo! Homolo Airports Ontology Museums Google pound Geo Gene Art National Chem2 Planet wrapper Radio- Bio2RDF activity UniPath JP Sears Open Linked OGOLOD way Corpo- Amster- Reactome medu- Open rates dam Museum cator Numbers As of September 2011

TU Dresden Foundations of Semantic Web Technologies slide 20 of 51 Linked Data Summary

Semantic technologies simplify the access to data: • Facts regarding Berlin? – http://de.dbpedia.org/resource/Berlin • Information about Queen – BBC Music: http://www.bbc.co.uk/music/artists/0383dadf-2a4e- 4d10-a46a-e9e041da8eb3 – MusicBrainz: http://musicbrainz.org/artist/0383dadf-2a4e-4d10- a46a-e9e041da8eb3.html • Data integration gives additional benefits

TU Dresden Foundations of Semantic Web Technologies slide 21 of 51 Linked Data Tools

• Tabulator Browser PlugIn/Ajax Scripts: http://www.w3.org/2005/ajar/tab • Semantic Web Client Library (Querying the complete Semantic Web with SPARQL): http://wifo5-03.informatik.uni-mannheim.de/ bizer/ng4j/semwebclient/ • D2R Server: Accessing with SPARQL and as Linked Data: http://d2rq.org/d2r-server • Data cleaning & linking to Freebase: https://github.com/OpenRefine (was Google Refine) • RDF Export for Google Refine: http: //lab.linkeddata.deri.ie/2010/grefine-rdf-extension/

TU Dresden Foundations of Semantic Web Technologies slide 22 of 51 Agenda

1 Linked (Open) Data

2 Semantic Web and HTML RDFa Microformats Google Knowledge Graph

3 OWL Applications OWL DL Application EDF Energy OWL Profile Application BBC World Cup Semantic Technologies in the Pharmaceutical Industry

4 Summary

TU Dresden Foundations of Semantic Web Technologies slide 23 of 51 RDFa Example

• Integration of RDF in (X)HTML documents All content on this site is licensed under a Creative Commons License. versus All content on this site is licensed under a Creative Commons License. license

TU Dresden Foundations of Semantic Web Technologies slide 24 of 51 RDFa Example 2

The trouble with Bob

Alice

...
versus

The trouble with Bob

Alice

...

TU Dresden Foundations of Semantic Web Technologies slide 25 of 51 RDFa Example 3

Alice Birpemswick

Email: [email protected]

Phone: +1 617.555.7332

versus

Alice Birpemswick

Email: [email protected]

Phone: +1-617-555-7332

TU Dresden Foundations of Semantic Web Technologies slide 26 of 51 Applications of RDFa

• Google filters, for example, RDFa terms and uses them to improve the presentation of search results

TU Dresden Foundations of Semantic Web Technologies slide 27 of 51 Applications of RDFa

• Google filters, for example, RDFa terms and uses them to improve the presentation of search results

TU Dresden Foundations of Semantic Web Technologies slide 28 of 51 Microformats

Microformats are simple and open data formats based on existing standards (XHTML) • Uses POSH (Plain Old Semantic HTML), i.e., HTML Tags that do not specify the presentation (bold, i), but that have semantics (abbr, acronym, title,...) • Use of semantic CSS class names – not: ... – but: ... • Special vocabularies for the markup in some domains • Considers the roles and semantics of the elements

TU Dresden Foundations of Semantic Web Technologies slide 29 of 51 hRecipe – for Recipes

• For the semantic annotation of web pages for recipes • Allows for – searching for recipes with certain ingredients – automatic grouping of recipes – finding quick recipes (short preparation time) • Mapping into RDFa exists (hrecipe-rdf)

Example

French Fries

French Fries ...

Contributed by Tom and the Cooky Gang .

...

TU Dresden Foundations of Semantic Web Technologies slide 30 of 51 hRecipe – Microformat for Recipes

Example ...

Published 14. Oct 2008

Ingredients

  • 500 gr potatoes.
  • ...
...

TU Dresden Foundations of Semantic Web Technologies slide 31 of 51 Example in the Web

• The Recipe Depository http://www.therecipedepository.com/ • SAPO Sabores http://sabores.sapo.pt/ • ITV Food http://www.itv.com/food/ • Epicurious.com http://www.epicurious.com/ • http://foodnetwork.com/ • Plan to Eat http://www.plantoeat.com/recipe_book • essen & trinken http://www.essen-und-trinken.de/ hRecipe-conform meta data in RDF • ...

TU Dresden Foundations of Semantic Web Technologies slide 32 of 51 TU Dresden Foundations of Semantic Web Technologies slide 33 of 51 Micro Data and schema.org

• Joint project of Google, Microsoft and Yahoo! • Compromise between the extensibility of RDFa and the simplicity of microformats • Goal: better understanding of the contents of web pages and, as a result, a better presentation of search results

TU Dresden Foundations of Semantic Web Technologies slide 34 of 51 Google Knowledge Graph

• The US version of Google now also uses structured data (from Freebase) • For the disambiguation of search terms and direct presentation of relevant information • Considers search terms no longer as simple strings (but as designator for things)

TU Dresden Foundations of Semantic Web Technologies slide 35 of 51 TU Dresden Foundations of Semantic Web Technologies slide 36 of 51 TU Dresden Foundations of Semantic Web Technologies slide 37 of 51 TU Dresden Foundations of Semantic Web Technologies slide 38 of 51 Agenda

1 Linked (Open) Data

2 Semantic Web and HTML RDFa Microformats Google Knowledge Graph

3 OWL Applications OWL DL Application EDF Energy OWL Profile Application BBC World Cup Semantic Technologies in the Pharmaceutical Industry

4 Summary

TU Dresden Foundations of Semantic Web Technologies slide 39 of 51 Application Areas of OWL

• OWL DL mainly is used outside the Semantic Web • Many applications in medicine and life sciences • Terminologies are traditionally popular there • Keyword indexing of documents • Semantic annotations of research data (e.g., gene sequences) • Classification used in health records and for statistics

TU Dresden Foundations of Semantic Web Technologies slide 40 of 51 Example Ontologies in OWL

• OBO Foundry: The Open Biological and Biomedical Ontologies • BioPortal ontologies – Terms for the electronic patient record – Annotation of gene sequences – Research into new drugs • GO Gene Ontology • ICD International Classification of Diseases • FMA Formal Model of Anatomy • ...

TU Dresden Foundations of Semantic Web Technologies slide 41 of 51 Use of OWL in the EDF Energy Management Advisor

Weather

Consumption

EMA Building Tip Tip ... Tip 50 parameters

TU Dresden Foundations of Semantic Web Technologies slide 42 of 51 EMA Ontology

• Ontology models the domain and situation of customers • First modelling was strictly following existing binary decision diagrams (simplified) • Reasoner “recognises” the situation of a customer • Certain situations correspond to tips • Original ontology used nominals and role chains • Was difficult to comprehend and reasoner performance was not optimal

TU Dresden Foundations of Semantic Web Technologies slide 43 of 51 EMA Ontology Improvements

• Customers are directly modelled in an ABox • Per customer only simple ABox facts are loaded • Constructors that are problematic for reasoning are avoided: nominals, role chains • TBox modelling was simplified • Ontology now allows incremental reasoning • Customers can be classified independent of each other in different reasoner instances • Used for about 30.000 customers in France

TU Dresden Foundations of Semantic Web Technologies slide 44 of 51 BBC Website for the Football World Cup 2010

• Ontology describes how facts about the world cup relate to each other • Such meta data are saved as RDF triples • For example, “Frank Lampard” is part of “England Squad” or “England Squad” competed in “Group C” of the “FIFA World Cup 2010”

TU Dresden Foundations of Semantic Web Technologies slide 45 of 51 BBC Website for the Football World Cup 2010

“The underlying publishing framework does not author content directly; rather it publishes data about the content - . The published metadata describes the world cup content at a fairly low-level of granularity, providing rich content relationships and semantic navigation. By querying this published metadata we are able to create dynamic page aggregations for teams, groups and players."

Jem Rayfield, Senior Technical Architect, BBC News and Knowledge http://www.bbc.co.uk/blogs/bbcinternet/2010/07/bbc_ world_cup_2010_dynamic_sem.html

TU Dresden Foundations of Semantic Web Technologies slide 46 of 51 BBC Website for the Football World Cup 2010

• OWL inference used to enrich the data (forward chaining) and SPARQL used for queries • Ontology contains texts contributed by journalists: stories, blogs, profiles, pictures, videos and statistics • Journalistic contributions are automatically classified (NLP techniques) and manually tagged • Statistics and game results from other sources are imported from XML and mapped to ontological concepts • Web pages are automatically assembled and contain relevant links • Approach also used for Olympia 2012

TU Dresden Foundations of Semantic Web Technologies slide 47 of 51 TU Dresden Foundations of Semantic Web Technologies slide 48 of 51 Semantic Web Technologies Meet Pharmaceutical Data

Phil Ashworth presents at the 2. European Semantic Technology Conference: http: //videolectures.net/estc08_ashworth_swtpdi/

TU Dresden Foundations of Semantic Web Technologies slide 49 of 51 Agenda

1 Linked (Open) Data

2 Semantic Web and HTML RDFa Microformats Google Knowledge Graph

3 OWL Applications OWL DL Application EDF Energy OWL Profile Application BBC World Cup Semantic Technologies in the Pharmaceutical Industry

4 Summary

TU Dresden Foundations of Semantic Web Technologies slide 50 of 51 Summary

• The amount of available machine processable data grows continuously • Semantics is needed to integrate data from different sources • Query and visualisation of data provides added value

• The processing and querying data from different sources increases the transparency and facilitate research (tests of hypotheses becomes easier)

TU Dresden Foundations of Semantic Web Technologies slide 51 of 51