. Multi-agent Semantic Web Systems: . . . .1 Intro to Microformats . RDFa . . . .
Michael Rovatsos .2. RDFa School of Informatics .
21 February 2011
Michael Rovatsos (School of Informatics) Multi-agent Semantic Web Systems:RDFa 21 February 2011 1 / 22 Michael Rovatsos (School of Informatics) Multi-agent Semantic Web Systems:RDFa 21 February 2011 2 / 22 Review Semantic Markup Embedded in HTML, 1
Existing document web is firmly established — can we use this base to support semantic applications? General idea: embed semantic markup into existing (X)HTML tags. RDF(S) implements part of the vision of the SemWeb: . Obtaining network effect for SemWeb requires overcoming the ’Semantic Example: hCard . . . markup bottleneck’. .0131 650-1234 . Promising approaches: . . . . I Using NLP to convert unstructured information into RDF(S) Example: RDFa . . . I Microformats / RDFa for embedding RDF into web documents .0131 650-1234 I Exporting from RDBs to RDF stores. . . . . Perspective 1: This is a regular web page, but hey, we can get semantics out of it. Perspective 2: This is machine-readable data, but hey, humans can browse it just like the Good Old Document Web.
Michael Rovatsos (School of Informatics) Multi-agent Semantic Web Systems:RDFa 21 February 2011 3 / 22 Michael Rovatsos (School of Informatics) Multi-agent Semantic Web Systems:RDFa 21 February 2011 4 / 22 Semantic Markup Embedded in HTML, 2 Semantic Markup Embedded in HTML, 3
What about HTML’s META tag? . META in Informatics Web Page . . . Approach is best-suited to information which fits into familiar and frequently used categories, e.g., contact information, calendar events, geo-location information, product reviews, social network information. . . Why do we need more than existing HTML tags? . . . But: I In fact, has been explored: cf. “screen scraping”, “wrapper induction” I But traditional HTML tags primarily oriented towards visual rendering. META information is invisible to humans Tends to fall out of synchronization with visible content Has been abused by people concerned with SEO (and therefore can actually harm your Google rank).
Michael Rovatsos (School of Informatics) Multi-agent Semantic Web Systems:RDFa 21 February 2011 5 / 22 Michael Rovatsos (School of Informatics) Multi-agent Semantic Web Systems:RDFa 21 February 2011 6 / 22 Microformats, 3 RDFa
Microformats = Semantic Web for the Impatient ? RDFa can be seen as a new way of serializing RDF. But so far, efforts to infuse the Web with meaning have gained little traction. These initiatives have been bogged down by complexity and over-ambitious RDFa = RDF_in_attributes goals, or have simply been too much trouble to implement at a large scale. Integrates into XHTML, but simpler than RDF/XML. Motivation is similar to microformats, but
Proponents of microformats subscribe to general idea of “web of data”, I more generic; but unconvinced by RDF, OWL, etc. I can benefit from many existing tools for processing RDF. Microformats intended to lower “barrier to entry” — no need to learn a Two basic cases, corresponding to whether object is literal or resource. new language. Try to use visible data rather than invisible metadata. subject predicate object Try to re-use existing standards / vocabularies for data modelling; e.g., literal as object about property PCDATA or content vCard, iCalendar hCard, hCalendar. URI as object about rel href ⇒
Michael Rovatsos (School of Informatics) Multi-agent Semantic Web Systems:RDFa 21 February 2011 7 / 22 Michael Rovatsos (School of Informatics) Multi-agent Semantic Web Systems:RDFa 21 February 2011 8 / 22 RDFa = RDF + XHTML, Case 1 RDFa = RDF + XHTML, Case 1
isbn:51649-9 RDF triple dc:title NLP with Python
text: text: element: div object (literal) element: div NLP with Python XML node tree NLP with Python attribute:class attribute: class title title attribute:about subject isbn:51649-9 XML node tree text: element: div NLP with Python + RDFa attribute:property attribute: class dc:title predicate title
attribute: about isbn:51649-9
attribute: property dc:title
Michael Rovatsos (School of Informatics) Multi-agent Semantic Web Systems:RDFa 21 February 2011 9 / 22 Michael Rovatsos (School of Informatics) Multi-agent Semantic Web Systems:RDFa 21 February 2011 10 / 22 RDFa, Case 1 RDFa = RDF + XHTML, Case 2
isbn:51649-9 orm:ewan_klein RDF triple dc:creator . RDF triple . . . isbn:51649-9 dc:title "NLP with Python" . . text: . element: div . . . XML node tree Ewan Klein . XHTML . . . attribute: class
Michael Rovatsos (School of Informatics) Multi-agent Semantic Web Systems:RDFa 21 February 2011 11 / 22 Michael Rovatsos (School of Informatics) Multi-agent Semantic Web Systems:RDFa 21 February 2011 12 / 22 RDFa = RDF + XHTML, Case 2 RDFa, Case 2
. RDF triple . . . .isbn:51649-9 dc:author orm:ewan_klein . text: . element: div Ewan Klein . . . . attribute: class XHTML . . . author
. . . . attribute:rel . dc:creator dc:creator XHTML with RDFa . . .Michael Rovatsos (School of Informatics) Multi-agent Semantic Web Systems:RDFa 21 February 2011 13 / 22 Michael Rovatsos (School of Informatics) Multi-agent Semantic Web Systems:RDFa 21 February 2011 14 / 22 Cafe Web Page Cafe Web Page, 1
. File Metadata, 1 . . .
. . . . Invisible in this case, so vulnerable to previous criticism. Empty value of about refers to current document by default.Michael Rovatsos (School of Informatics) Multi-agent Semantic Web Systems:RDFa 21 February 2011 15 / 22 Michael Rovatsos (School of Informatics) Multi-agent Semantic Web Systems:RDFa 21 February 2011 16 / 22 Cafe Web Page, 2 Cafe Web Page, 3
. Image Metadata . . . . File Metadata, 2 . . .
Michael Rovatsos (School of Informatics) Multi-agent Semantic Web Systems:RDFa 21 February 2011 17 / 22 Michael Rovatsos (School of Informatics) Multi-agent Semantic Web Systems:RDFa 21 February 2011 18 / 22 Cafe Web Page, 4 Cafe Web Page, 4
. Table Row, 1 . . . . Table Row, 2 .
Michael Rovatsos (School of Informatics) Multi-agent Semantic Web Systems:RDFa 21 February 2011 19 / 22 Michael Rovatsos (School of Informatics) Multi-agent Semantic Web Systems:RDFa 21 February 2011 20 / 22 Cafe Web Page, 5 Cafe Web Page, 6
. Table Row, 2 . . .
I the review I the business We use content attribute to normalize the rating literal.
Michael Rovatsos (School of Informatics) Multi-agent Semantic Web Systems:RDFa 21 February 2011 21 / 22 Michael Rovatsos (School of Informatics) Multi-agent Semantic Web Systems:RDFa 21 February 2011 22 / 22