<<

Lecture Overview

Web 2.0, Tagging, Multimedia, • Introduction to Web 2.0 , Lecture, • Overview of Tagging Systems – Overview of tagging Important, Must Attend, … – Design and attributes Martin Halvey – Case Studies • Other Web 2.0 Technologies • Conclusions

Web 2.0 Definition Web 2.0 Definition

“Web 2.0 is the business revolution in “An idea in people's heads rather than a the computer industry caused by the reality. It’s actually an idea that the move to the as platform, and an reciprocity between the user and the attempt to understand the rules for provider is what's emphasised. In other success on that new platform” words, genuine interactivity if you like, simply because people can upload as Tim O'Reilly (2006-12-10). Web 2.0 well as download” Compact Definition: Trying Again Stephen Fry: Web 2.0

1 Web 2.0 Definition Web 2.0

“piece of jargon” “Participatory Web” “nobody really knows what it means” “if Web 2.0 for you is and , Bart Decrem (2006-06-13). Introducing then that is people to people. But that Flock Beta 1. Flock official was what the Web was supposed to be all along” -Tim Berners-Lee

Web 2.0 Web 2.0 Feature/Techniques

• Allow users to do more than retrieve • Rich Internet Application techniques, AJAX information • Semantically valid XHTML and HTML • "Network as platform" computing, allowing • REST and/or XML- and/or JSON-based APIs users to run software applications entirely through a browser • Cascading Style Sheets • In contrast to systems which categorise users • RSS or feeds into roles with varying degrees of functionality • Mash Ups • Re-use of existing technologies, some new • Weblog publishing • or Forum • Social Networking

2 Tagging Folksonomies

• A is a (relevant) keyword or term • A is a user generated associated with or assigned to a piece used to categorize and retrieve web content of information (e.g. a picture, article, or such as Web pages, photographs and Web clip), thus describing the item and links, using open-ended labels called tags enabling keyword-based classification • Intended to make a body of information increasingly easy to search, discover, and of information. navigate over time • Usually chosen informally and • Normally used online but they can arise in a personally by item author/creator or by number of other contexts its consumer/viewers/community

Folksonomies and Semantic Folksonomies Web • Well-developed folksonomy is ideally • The is an evolving extension accessible as a shared vocabulary that of the WWW in which content is expressed is both created by, and familiar to, its not only in a format that can be read and used by automated tools, as well as natural primary users language • Folksonomy tools are not part of the • Folksonomies can be used in conjunction with WWW protocols semantic web technologies to provide rich • Arise where special provisions are descriptions, but not quite yet. made • However from folksonomies is not • Particularly useful when no other text is consistent or reliable available

3 Bridging the Semantic Gap Example Tagging Systems

• The difference between low-level data • Del.ioco.us (http://del.ioco.us) representation of multimedia and the • CiteULike (http://www.citelike.org) higher level concepts users associate • (http://www.flickr.com) with the same multimedia • YouTube (http://www.youtube.com) • Providing can alleviate this problem • Last.fm (http://www.last.fm) • Need to improve annotations to • Technorati (http://www.technorati.com) overcome this problem • ESP Game (http://www.espgame.org)

Types of Tags System Design and Attributes

• Identifying What (or Who) it is About • Tagging Rights • Identifying What it Is • Tagging Support • Identifying Who Owns It • Aggregation • Refining Categories • Type of Object • Identifying Qualities or Characteristics • Source of Material • Self Reference • Resource Connectivity • Task Organising • Social Connectivity

4 User Incentives Tagging/Folksonomies Pros

• Future Retrieval • Easy to use • Contribution and Sharing • Intuitive • Attract Attention • Cheap way of getting annotations • Play and Competition • Gives new users quick and simple • Self Presentation impression of content • Opinion Expression • Can aid browsing and search

Tagging/Folksonomies Cons Tagging Conclusions

• Inaccurate and irrelevant tags • Provide keywords to describe objects • Lack of stemming • Collaborative tagging provides a • Freely chosen tags can result in community view of object – • Number of pros and cons to using – Homonyms tagging – Polysemy • Number of design approaches can be taken • Number of incentives for users to tag

5 Tagging Case Studies Tagging Case Studies

• A number of large studies of tagging YouTube Flickr Del.ico.us have taken place • Focus on different aspects of tagging Rights Self Tagging Self Tagging Free for all • We will focus on three analyses of Support Blind tagging Blind tagging Blind tagging different systems, the systems analysed Type of Photographs Links are: Object – Del.icio.us Source Participants Participants Web resource – Flickr – YouTube

Del.icio.us – Golder and Del.icio.us – Golder and Huberman Huberman • Provide an analysis of collaborative • Analyse two del.icio.us datasets tagging systems • Dataset 1 • They study the del.icio.us system for – 212 URL’s organising bookmarks – 19,422 bookmarks • Analyse the structure of collaborative • Dataset 2 tagging systems as well as their – 229 users dynamical aspects – 68,668 bookmarks

6 Del.icio.us – Golder and Del.icio.us – Golder and Huberman Huberman • Users have variety in use of tags, some • Appears that most tags are added for have many, some have few personal use • Tags vary in frequency of use • Never the less they are still useful to the • However, stable patterns emerge in public tags • They believe that consensus choices • Adding numbers of infrequently used that emerge may be used on a large tags/opinions, does not disrupt the scale to describe and organise how web general consensus documents interact with one another, and also can be used to make recommendations

Flickr – Marlow et al Flickr – Marlow et al

• Present a study of the photo sharing • Present a model for tagging systems and tagging system Flickr • Compare tagging on Flickr with that of • Looked at the dynamics of the Flickr del.icio.us and hope to expose interesting trends • Dynamics of interaction and and topics in the Flickr participation are different to Del.icio.us • Set of 25,000 users for individual • To be expected as they are different analysis models of tagging, and different user • Set of 2,500 users for network analysis incentives

7 YouTube – Halvey and Keane YouTube – Halvey and Keane

• More interested in user goals when • Views of pages match distributions using and searching in a tag based found in web search system • Views from navigation match • Examine why some videos are viewed distributions found in web navigation more often than others • Found that in general the more tags that • Investigation of user interactions to see users provide the more likely that a if they vary in Web 2.0 video will be watched, up to a certain • Analysed104,465 video pages from point 57,639 users

YouTube – Halvey and Keane Tagging Case Studies

• In general users are consumers rather than • Briefly seen a number of analyses of creators of resources for the service tagging systems • Users of YouTube do not use the social tools • A number of different approaches were unless they gain a benefit investigated, from a number of different • Videos are recommended because they are aspects popular, not popular because they are recommended • Although tagging and tag based • Videos receive the majority of their views in systems seem random, there are a the first couple of days number of regularities

8 Tag Clouds Tag Clouds

• Visual depiction of user generated tags • There are a number of uses for tag • Normally there is a weighting clouds, these include associated with the tags – Browsing • Can be alphabetised – Search • Importance of a tag can be represented – Impression Formation/Gisting by font size or colour – Recognition/Matching • Tags are usually hyperlinks that lead to further information

Tag Clouds – Rivadeneira et Tag Clouds – Rivadeneira et al al • Investigated the use of tag clouds for • The use of different font sizes had an impression formation effect • Experiment 1 • Layout has an influence on the – Influences of attributes on low- effectiveness of the users in performing level cognitive processes tasks • Experiment 2 • Some of the results can be attributed to – Effect of font size and word layout on westernised reading impression formation and memory

9 Tag Clouds – Halvey and Tag Clouds – Halvey and Keane Keane • Investigated browsing and search • Alphabetisation can aid users to find • Participants carried out search tasks on information more easily and quickly tags presented 6 different formats • Font size is very important for how • Font size, alphabetisation and layout quickly and easily users find information were varied for each of the layouts • Position of tags is also very important where appropriate. • It appears that users scan lists and • The effect of each scenario on task clouds rather than read them completion time was investigated

Other Web 2.0 Technologies Rich Internet Applications

• Rich Internet Applications • Web applications that have the features and • Really Simple Syndication (RSS) functionality of desktop applications • typically transfer the processing necessary for • Cascading Style Sheets (CSS) the user interface to the web client • Rest, XML, JSON • Keep the bulk of the data back on the • Mash Up application server • Wiki • Run in a web browser, and/or do not require software installation • Weblogs • Run locally in a secure environment called a • Social Networking sandbox

10 Really Simple Syndication Rich Internet Applications (RSS) • Can provide additional support for • formats used to publish frequently multimedia, which allow more updated content, e.g. Blogs, etc. interaction • RSS formats are specified using XML, extend – Java Applications the basic XML schema – User Interface Languages, e.g. SMIL • RSS first launched in 1999 – ActiveX • Several BitTorrent-based peer-to-peer applications also support RSS – Google’s GWT Framework • Media RSS from Yahoo provides an RSS for multimedia

Cascading Style Sheets REST, XML or JSON API’s (CSS) • Stylesheet language used to describe the • Application Programming Interface (API) is a presentation of a document written in a source code interface that an operating markup language system or library provides to support requests • Separates document content from document for services presentation • Representational State Transfer (REST) is a • Improves accessibility, provides more collection of network architecture principles flexibility and controls the specification of that outline how resources are defined and presentation characteristics addressed • CSS specifications are maintained by the • JavaScript Object Notation (JSON) is a W3C, CSS2 supports different media types lightweight computer data interchange format

11 Mash Ups Wikis

• Combines data from more than one source • Computer software that allows users to easily into a single integrated tool create, edit and link web pages • Content is typically sourced from a third party • Can provide collaborative websites, power via a public interface or API community websites, and effective intranets • Yahoo, Google and Microsoft provide editors for use in • Three types of mash up • Wikipedia is one of the largest, it has approximately 9.1 million articles in 252 – Consumer mash up languages, comprising a combined total of – Data mash up over 1.41 billion words for all Wikipedias as of – Business mash up November 2007

Weblogs (Blogs) Social Networking Sites

• Website where entries are written in • Social networks for communities of chronological order people, online • Combine text, images, and links to other blogs, web pages, and other media related to its topic • Provide a collection of various ways for • Primarily textual, although some focus on art users to interact, including the use of (artlog), photographs (), sketchblog, multimedia videos (), music (MP3 blog), audio • Contain directories of some categories (podcasting) • Technorati is main blog search engine as was • Contain means to connect with friends tracking more than 106 million blogs as of • Use recommender systems linked to September 2007 trust

12 Conclusions

• Tags provide cheap, easy metadata to describe objects • Collaborative tagging (folksonomies) provide a community based description • Tagging systems can vary in a number of ways • There are a number of technologies that have emerged and are used as part of Web 2.0

13