Social Semantic Web

© Copyright 2010 Dieter Fensel and Katharina Siorpaes www.sti-innsbruck.at 1 Where are we?

# Title 1 Introduction 2 Semantic Web Architecture 3 Resource Description Framework (RDF) 4 Web of data 5 Generating Semantic Annotations 6 Storage and Querying 7 (OWL) 8 (RIF) 9 Reasoning on the Web 10 Ontologies 11 Social Semantic Web 12 Semantic Web Services 13 Tools 14 Applications

www.sti-innsbruck.at 2 Agenda

1. Motivation 2. From Web to Web 2.0: Technical solution and illustrations 3. Social Semantic Web : Technical solution and illustrations 4. Summary 5. References

www.sti-innsbruck.at 3 3 MOTIVATION

www.sti-innsbruck.at 4 4 Motivation

www.sti-innsbruck.at 5 5 Motivation (cont‘d)

• Information Sharing: • Social Websites and – Image sharing: Flickr Communication: – Video sharing: YouTube – Online encyclopedia:  Facebook – Blogs: eblogger  LastFM – Open Source Community: Linux  Skype • File Management  StudiVZ – Tagging: Delicious  LinkedIn, Xing

• Open Systems: APIs, partly open source allow extensions by users

www.sti-innsbruck.at 6 6 Motivation (cont‘d)

Internet platform for creation of social networks

• More than 400 million active users • 50% of our active users log on to Facebook in any given day • More than 35 million users update their status each day • More than 60 million status updates posted each day • More than 3 billion photos uploaded to the site each month • More than 5 billion pieces of content (web links, news stories, blog posts, notes, photo albums, etc.) shared each week • More than 3.5 million events created each month • More than 3 million active Pages on Facebook • More than 1.5 million local businesses have active Pages on Facebook http://www.facebook.com/press/info.php? statistics, 17.2.2010

www.sti-innsbruck.at 7 7 Wikipedia Motivation (cont‘d)

Free Online Encyclopedia • 3,197,507 Articles (english Wikipedia)‏ • 11,693,499 registered contributors • Clever mechanisms combined with human intelligence • High quality articles • Self-organized control • Semi-openess

http://en.wikipedia.org/wiki/ Wikipedia:Statistics, 17.2.2010 www.sti-innsbruck.at 8 8 FROM WEB TO WEB 2.0: TECHNICAL SOLUTION AND ILLUSTRATIONS

www.sti-innsbruck.at 9 9 Web 1.0 to Web 2.0

• Evolution

• Definition

• Applications and success stories

• Statistics to Web 2.0

www.sti-innsbruck.at 10 10 Web 2.0

“Web 2.0 is a notion for a row of interactive and collaborative systems of the

http://widgets-gadgets.com/2006_10_01_archive.html

www.sti-innsbruck.at 11 11 What is the web 2.0? „Definition“ by O‘Reilly

Web 1.0 Web 2.0 improvement

DoubleClick Google AdSense personalized Ofoto Flickr tagging, community Britannica Online Wikipedia community, free content Webseiten blogging dialogue publishing participation CMS flexibility, freedom directories tagging community taxonomy

Consumers  Prosumers

www.sti-innsbruck.at 12 12 What is the Web 2.0? - Examples

• Gmail • Google Documents (Collaborative Notepad in the Web) • Wikis • Wikipedia – Worlds biggest encyclopedia, Top 30 web site, 100 langueges • Del.icio.us (Social Tagging for Bookmarks) • Flickr (Photo Sharing and Tagging) • Blogs, RSS, Blogger.com • Programmableweb.com (APIs)

www.sti-innsbruck.at 13 13 Blogs

• Easy usable user interfaces to update contents

• Easy organization of contents

• Easy usage of contents

• Easy publishing of comments

• Social: collaborative (single users but strongly connected)‏

www.sti-innsbruck.at 14 14 Wikis

 invented by Ward Cunningham • Collection of HTML sites: read and edit • Most famous and biggest Wiki: Wikipedia (MediaWiki) – But: Also often used in Intranets (i. e. our group) • Problems solved socially instead of technically • Flexible structure • Background algorithms + human intelligence • No new technologies • social: collaborative (nobody owns contents)‏

www.sti-innsbruck.at 15 15 Wikis: Design Principles

• Open Should a page be found to be incomplete or poorly organized, any reader can edit it as they see fit. • Incremental Pages can cite other pages, including pages that have not been written yet. • Organic The structure and text content of the site are open to editing and evolution. • Mundane A small number of (irregular) text conventions will provide access to the most useful page markup. • Universal The mechanisms of editing and organizing are the same as those of writing so that any writer is automatically an editor and organizer. • Overt The formatted (and printed) output will suggest the input required to reproduce it. Source: http://c2.com/cgi/wiki?WikiDesignPrinciples www.sti-innsbruck.at 16 16 Tagging

• Idea: Enrich contents by user chosen keywords

• Replace folder based structure by a organisation using tags

• New: Simple user interfaces for tagging and tag based search

• First steps to Semantic Web?

• Technically: user interfaces

• Social: collaborative (own contents, shared tags)

www.sti-innsbruck.at 17 17 Tagging: Flickr.com

www.sti-innsbruck.at 18 18 Collaborative Tagging

www.sti-innsbruck.at 19 19 Collaborative Tagging: Delicious

• Browser plug-ins available from http://del.icio.us • Allows the tagging of bookmarks • Community aspect: – Suggestion of tags that were used by other users – Availability of tag clouds for bookmarks of the whole community – Possibility to browse related bookmarks based on tags

www.sti-innsbruck.at 20 20

Data created by tagging, knowledge structures

User Tag Resource

Tag Resource User Tag Resource

Tag Resource User Tag Resource

Mary tags www.wikipedia.org with wiki wikipedia encyclopedia Bob tags www.wikipedia.org with wiki web2.0 encyclopedia knowledge

www.sti-innsbruck.at 21 21 Folksonomies: Taxonomie Marlow et al. (2006)

• Rights for Tagging – Self-tagging: Contents only tagged by owner (Technorati) – Free-for-all tagging: Tagging by all users (Yahoo!)

• Support of Tagging – Blind Tagging: Existing Tags are not displayed (Flickr) – Viewable Tagging: Existing Tags are displayed (Del.icio.us) – Suggestive Tagging: Suggestions for Tags (MyWeb 2.0)

• Aggregation of Tags – Bag-model: Multiple entries (Del.icio.us) – Set-model: Only single entries (YouTube)

www.sti-innsbruck.at 22 22 Tag Clouds

Size of Tags: count of usage

Browsing replaces Searching

Different meaning for different users

Orientation in Information Set

www.sti-innsbruck.at 23 23 What is the Web 2.0? Trends for Web Applications

• Technical Evolution – Web User Interfaces become faster (AJAX) – Desktop shifts to Web (GMail, Google Notebooks, AJAX)

• Social Evolution – Collective creates additional value (Wiki, Tagging) – Free contents become popular (Licenses) – Attention is getting monetarized (Text-Ads) – Websites with additional value by recombination (Mash-Ups, RSS)

www.sti-innsbruck.at 24 24 Web 2.0

People, Services, Technologies

www.sti-innsbruck.at 25 Web 2.0

• Web 2.0 is a vaguely defined phrase referring to various topics such as social networking sites, wikis, communication tools, and folksonomies. • Tim O'Reilly provided a definition of Web 2.0 in 2006: "Web 2.0 is the business revolution in the computer industry caused by the move to the internet as platform, and an attempt to understand the rules for success on that new platform. Chief among those rules is this: Build applications that harness network effects to get better the more people use them." • Tim BL is right that all these ideas are already underlying his original web ideas, however, …

www.sti-innsbruck.at 26 Web 2.0

The four major breakthroughs of Web 2.0 are: 1. Blurring the distinction between content consumers and content providers.

2. Moving from media for individuals towards media for communities.

3. Blurring the distinction between service consumers and service providers.

4. Integrating human and machine computing in a new way.

www.sti-innsbruck.at 27 Blurring the distinction between content consumers and providers

Interactive Web applications through asynchronous JavaScript and XML (AJAX)

www.sti-innsbruck.at 28 Blurring the distinction between content consumers and providers

Interactive Web applications through asynchronous JavaScript and XML (AJAX)

www.sti-innsbruck.at 29 Blurring the distinction between content consumers and providers:

Weblogs or Blogs, Wikis

www.sti-innsbruck.at 30 Blurring the distinction between content consumers and providers:

Flickr, YouTube

www.sti-innsbruck.at 31 Blurring the distinction between content consumers and providers

Tagging – del.icio.us, shazam.com

www.sti-innsbruck.at 32 Blurring the distinction between content consumers and providers

RDFA, micro formats

www.sti-innsbruck.at 33 Moving from a media for individuals towards a media for communities

Folksomonies, FOAF

www.sti-innsbruck.at 34 Moving from a media for individuals towards a media for communities

Community pages (friend-of-a-friend, flickr, LinkedIn, myspace, …)

www.sti-innsbruck.at 35 Moving from a media for individuals towards a media for communities

Second Life

www.sti-innsbruck.at 36 Moving from a media for individuals towards a media for communities

Wikipedia

www.sti-innsbruck.at 37 Moving from a media for individuals towards a media for communities

Wikipedia

www.sti-innsbruck.at 38 Blurring the distinction between service consumers and service providers

RSS feeds

www.sti-innsbruck.at 39 Blurring the distinction between service consumers and service providers

Yahoo pipes allow people to connect internet data sources, process them, and redirect the output.

www.sti-innsbruck.at 40 Blurring the distinction between service consumers and service providers

Widgets, gadgets, and mashups.

www.sti-innsbruck.at 41 Integrating human and machine computing in a new way

Amazon Mechanical turk

www.sti-innsbruck.at 42 Integrating human and machine computing in a new way

Human computing (captchas)

www.sti-innsbruck.at 43 SOCIAL SEMANTIC WEB: TECHNICAL SOLUTION AND ILLUSTRATIONS

www.sti-innsbruck.at 44 44 Semantic Web + Web 2.0 = Web 3.0?

Web 2.0 Web 3.0

Tagging  Annotation with Tags  Annotation with concepts

 Singular/Plural Problem  Inference (Tag „Dog“ -->

 Synonyms Tag „Pet“)

 No Intelligence Recombination of • Mash-Ups developed earlier • Spontaneous by End User data from different by programmer sources

Search • Keyword Search or Tag • Structured Search Search finds documents combines Data und creates documents

Period • 2004 - 2007 • 2007 – 2010

Based on Völkl, Vrandecic and colleagues. www.sti-innsbruck.at 45 45 Web 3.0 Approaches

• Automatic Extraction of knowledge based on large (and free) sets of data, generated by Web 2.0 • Integration and Reuse of knowledge (Yahoo Pipes) • Motivate users for generating semantic contents by using Web 2.0 paradigms • Creation of Semantic as side effect of working processes (semantic wikis)

www.sti-innsbruck.at 46 46 Gartner Hype Curve

Gartner, July 2007

www.sti-innsbruck.at 47 47 Web 2.0 and the Semantic Web

Web 2.0 and Semantic Web are complementary approaches

1. Semantic Blogging 2. Semantic Wikis • Semantic MediaWiki 3. Web 2.0 ontology building • myOntology 4. Games for semantic content creation • OntoGame

www.sti-innsbruck.at 48 48 Semantic Blogging

• Creating blog entries in a structured fashion • Based on ontologies • This allows: – Acquiring complementary information from the Web – Finding blog entries better

www.sti-innsbruck.at 49 49 Semantic Wikis

• A is a wiki that has an underlying model of the knowledge described in its pages. • Regular, or syntactic, wikis have structured text and untyped hyperlinks. • Semantic wikis, on the other hand, allow the ability to capture or identify information about the data within pages, and the relationships between pages, in ways that can be queried or exported like data. • Wikis: – Platypus wiki – IkeWiki – Kiwi – WikiFactory – Semantic MediaWiki

www.sti-innsbruck.at 50 50 Semantic Wikipedia: Advantages

Structured Knowledge can be exported (in RDF)

• New Web 2.0 Applications are possible

• Reusing of knowledge beyond languege borders

• Aggregated Search over more than one site

• Quality: Finding of mistakes and conflicts – Has every country a capital city? – Is every person born before dying? – Does the population density match to population and area?

Based on Völkl, Vrandecic and colleagues. www.sti-innsbruck.at 51 51 Semantic Wiki = Wiki + Semantic Web

• Semantic MediaWiki: Extension of the MediaWiki Software

• Syntax extension allows typed links • Page Karlsruhe – Up to now: … in the south of [[]] … – Now: … in the south of [[is in::Germany]] …

• Syntax extension allows annotaion of values • Page Karlsruhe – Up to now: … has a population of 280,000 people. … – Now: … has a population of [[population:=280000]] people.

Based on Völkl, Vrandecic and colleagues.

www.sti-innsbruck.at 52 52 Semantic MediaWiki

Based on Völkl, Vrandecic and colleagues. www.sti-innsbruck.at 53 53 What is located in California?

Based on Völkl, Vrandecic and colleagues.

www.sti-innsbruck.at 54 54 Web 2.0 ontology building

• Make use of various Web 2.0 paradigms to capture knowledge required for ontologies • Lower entrance barriers for users • Usually emphasis on collaboration • Ontologies as community contracts • Methods for consensus finding • Visualization of ontologies • Examples: – Ontology editor based on SMW – Myontology – Soboleo (image annotation)

www.sti-innsbruck.at 55 55 Ontology editor for Semantic MediaWiki

• http://smw-active.sti-innsbruck.at

www.sti-innsbruck.at 56 56 Semantic Media Wiki – Ontology Editor

• Ontology Element Management – Management of vocabularies, categories, properties, and elements – Enhanced user interface: tagcloud, forms, tree-view – Enriched content through external media (e.g. Flickr)‏ • Knowledge Repair

– Statistical analysis of knowledge within the wiki – Detection and correction of inconsistencies – Discovery of redundancies • Knowledge Import

– Folksonomy import through mapping to the SKOS ontology

www.sti-innsbruck.at 57 57 The create functionalities-category

• Primary navigation • Create functionalities • Import functionalities • Knowledge repair • Media Wiki links

• Introduction

• Important links

• Content overview

• Tag Cloud

www.sti-innsbruck.at 58 The create functionalities-category

• Input form • Name • Description

• Auto completion

• Entity creation

• Automatic annotations

• Overview page

www.sti-innsbruck.at 59 The create functionalities-category

• Introduction • Input fields • Name • Vocabulary • SeeAlso • Synonym • Example • Description

• Properties • New ones • Existing ones

• Supercategories • Name analysis • Help fields

www.sti-innsbruck.at 60 Knowledge repair-cycles, links, names

• Comprehensive overview

• Table with explanations

• Table with min, avg, max values

• Table with results for each category

• Colored cells and symbols to indicate probable issues

www.sti-innsbruck.at 61 Knowledge repair-cycles, links, names

• Tab ´repair´

• Repair of a specific category

• Explanations

• Tables with numerical values

• Paragraphs and lists

www.sti-innsbruck.at 62 Knowledge repair-cycles, links, names

www.sti-innsbruck.at 63 myOntology

• Collaborative creation of ontologies

• A tool which helps specialists and ontology experts to collaborate easily

• MyOntology uses the Web 2.0 paradigm

www.myontology.org

www.sti-innsbruck.at 64 64 myOntology Philosophy

• Collaboration of specialists and ontology experts • In first phase (until lightweight ontologies) • High usability • Integration and Reusing of web knowledge (Web 2.0: Folksonomies, Flickr, YouTube, Wikipedia, etc.)‏ • „Background intelligence“ supports development team

www.sti-innsbruck.at 65 65 Games for semantic content creation

A prerequisite for the Semantic Web to become a reality is the availability of annotated data.

Building the Semantic Web is not a one-time task, but a continuous effort.

www.sti-innsbruck.at 66 66 Observation

There are tasks that are easy for humans but difficult for computers

Cf. Von Ahn Not all the tasks on the Semantic Web can be automated. Some at least partly require human intelligence.

www.sti-innsbruck.at 67 67 Web 2.0 is Hot, Semantic Web is Not. Why?

• Web 2.0 applications enjoy great popularity

• The incentive structures are clear (Marlow et al.,2006; Kuznetsov, 2004)

• Incentives for ontology building, ontology

alignment, and semantic annotation have not been investigated so far

www.sti-innsbruck.at 68 68 The OntoGame Idea and Principles

Make people weave the Semantic Web by playing cool multi-player online games.

1. Fun and intellectual challenge 2. Consensus 3. Massive content generation

www.sti-innsbruck.at 69 69 10 Challenges

1. Identifying suitable tasks in semantic content creation 2. Designing games 3. Designing a usable, attractive interface 4. Identifying suitable knowledge corpora 5. Preventing cheating 6. Defusing typical pitfalls of conceptual modeling 7. Distribution of labor 8. Fostering user participation 9. Deriving formal representations 10. Scalability and performance

www.sti-innsbruck.at 70 70 TheOntoGame Series

www.sti-innsbruck.at 71 71 Features

• Players paired randomly and anonymously • Best strategy to get points: truthful answers • Live mode, single player mode, chess mode (remote) • Skip • Limited amount of time • Cheating: – Anonymity – Pre-recorded challenges • Generic gaming platform • Derive formal representations of the data

www.sti-innsbruck.at 72 72 OntoPronto: Creating a Huge Domain Ontology

www.sti-innsbruck.at 73 73 OntoTube: Annotating YouTube videos

www.sti-innsbruck.at 74 74 SUMMARY

www.sti-innsbruck.at 75 75 Summary

• The Web has undergone change from „Web 1.0“ to „Web 2.0“. • Web 2.0 stands for more user interaction, the change from consumers to prosumers, the programmable Web, collaboration, easy interfaces, etc. • This movement has triggered enormous user participation. Many tools provide strong incentives to their users. • Social Semantic Web approaches aim at combining parts of Web 2.0 with semantics. • Examples including Semantic Wikis, Semantic blogs, games for Semantic content creation, etc.

www.sti-innsbruck.at 76 76 REFERENCES

www.sti-innsbruck.at 77 77 References

• Mandatory reading: – http://oreilly.com/web2/archive/what-is-web-20.html

www.sti-innsbruck.at 78 78 References

• Further reading: – http://social.semantic-web.at/index.php/Main_Page – S. Braun, A. Schmidt, A.Walter, G. Nagypal, and V. Zacharias. Ontology maturing: A collaborative web 2.0 approach to ontology engineering, May 8 2007. – S. E. Campanini, P. Castagna, and R. Tazzoli. Platypus wiki: a semantic wiki wiki web. pages 16, December 10 2004. – Peter Mika. Ontologies are us: A unied model of social networks and semantics. volume LNCS 3729. Springer, 2005. – S. Schaffert. Ikewiki: A semantic wiki for collaborative . June 2006. – K. Siorpaes, M. Hepp, A. Klotz, and M. Hackl. myontology: Tapping the wisdom of crowds for building ontologies. Technical report, STI Innsbruck Technical Report, 2008. – K. Siorpaes and M. Hepp. Games with a purpose for the semantic web. IEEE Intelligent Systems, 23(3):5060, 2008. – M. Völkel, M. Krötzsch, Denny Vrandecic, and H. Haller. Semantic wikipedia. May 23-26, 2006.

www.sti-innsbruck.at 79 79 References

• Wikipedia links: – http://en.wikipedia.org/wiki/Social_Semantic_Web – http://en.wikipedia.org/wiki/Web_2.0

www.sti-innsbruck.at 80 80 Next Lecture

# Title 1 Introduction 2 Semantic Web Architecture 3 Resource Description Framework (RDF) 4 Web of data 5 Generating Semantic Annotations 6 Storage and Querying 7 Web Ontology Language (OWL) 8 Rule Interchange Format (RIF) 9 Reasoning on the Web 10 Ontologies 11 Social Semantic Web 12 Semantic Web Services 13 Tools 14 Applications

www.sti-innsbruck.at 81 Questions?

www.sti-innsbruck.at 82 82