<<

metadata

is: structured that describes, explains, locates, or otherwise makes it easier to retrieve, use, or manage an information resource (NISO)

it allows systems to collocate related information, and helps users find relevant information

lecture 18: tagging and

of#49# ece#627,#winter#‘13# 2# of#49#

metadata metadata ways of creation third (new) way

generally in two ways: user-created metadata professional creation (professionals working with complex, detailed rule sets and vocabularies) users of the documents and media create metadata for their own individual use that is also shared author creation (authors of documents provide throughout a community metadata along with their creations)

are ontologies the result of that???

ece#627,#winter#‘13# 3# of#49# ece#627,#winter#‘13# 4# of#49#

folksonomy folksonomy what is it? what is it?

it is a people's the set of terms (called tags) that a group of users is composed of terms in a flat namespace tagged content with, they are not a predetermined there is no hierarchy, no parent-child or sibling set of classification terms or labels relationships between these terms

ece#627,#winter#‘13# 5# of#49# ece#627,#winter#‘13# 6# of#49# folksonomy what is tagging? … introduction

a is a non-hierarchical keyword or term the cumulative force of all the individual tags can assigned to a piece of information - such as an produce a bottom-up, self-organized system for bookmark, , or classifying items on the web ()

tagging – to mark with a tag; to label, identify, or recognize with or as if with a tag - a unique and powerful way of organizing information

ece#627,#winter#‘13# 7# of#49# ece#627,#winter#‘13# 8# of#49#

what is tagging? tagging system tagging system users

three components the people who employ a tagging system - users (sometimes also called taggers) – they create the tags, and sometimes they add resources - resources

- tags have a variety of different interests, needs, goals, and motivations – but they are trying to achieve some larger goal – such as sharing a photo or labeling a document so they can find it later

ece#627,#winter#‘13# 9# of#49# ece#627,#winter#‘13# 10# of#49#

tagging system tagging system resources tags

are items that users tag the keywords added by users are tags

a resource can be just about anything – a , a can be just about any kind of term, they can be Web page, a video, or even a location descriptions of the resources subject matter, its location, its intended user, a reminder, or something within each tagging system, resources often share else entirely – can be individual words or phrases some common properties – they are , or photos, or … tags are essentially metadata about the resource

ece#627,#winter#‘13# 11# of#49# ece#627,#winter#‘13# 12# of#49# tagging system tagging system tags

tags are more than just metadata in an application – is a method of presenting tags where the more they are a tool people use to track, share, and find frequently used tags are emphesized information

ece#627,#winter#‘13# 13# of#49# ece#627,#winter#‘13# 14# of#49#

tagging system tagging system tag cloud – example …

all tagging happens in the context of a system, and the system defines what kind of tagging can take place

for example, the system may allow users to add their own resources or not, may allow to tag any resource or not, may forbid certain kinds of tags

www.wordle.net ece#627,#winter#‘13# 15# of#49# ece#627,#winter#‘13# 16# of#49#

tagging system tagging system perspectives …

Information Social Architecture the structural design of shared information environments tagging and

tagging sits the art and science of organizing and labeling web at the intersection sites, intranet, online communities, and software to support usability and of three established Personal fields Information Management information architects focus on using controlled vocabularies, search-and-browse systems

ece#627,#winter#‘13# 17# of#49# ece#627,#winter#‘13# 18# of#49# tagging system tagging system social software personal information management

refers to the practice and study of the activities applications that people use to communicate, people perform in order to acquire, organize, collaborate, and share online maintain, retrieve, and use information items such as documents, web pages, e-mail messages …

(Wikipedia) people who design social software are interested in facilitating group interaction within the system they are programs for managing information and methods for keeping yourself on track – help you file, track, and find your information when you need it

ece#627,#winter#‘13# 19# of#49# ece#627,#winter#‘13# 20# of#49#

tagging system tagging system tensions … tensions …

personal <-> social idiosyncratic<-> standard

do people tag primary for their own benefit? should tags be unique? or are they motivated to share information with a or should be standardized so they can be used for group …? browsing and searching?

ece#627,#winter#‘13# 21# of#49# ece#627,#winter#‘13# 22# of#49#

tagging system tagging system tensions … tensions …

freedom<-> control amateur<-> expert

does the system give users complete freedom? how qualified are the people who do tagging? or does it influence or control their tags? should tags contributed by amateurs count as much as tags created by experts?

ece#627,#winter#‘13# 23# of#49# ece#627,#winter#‘13# 24# of#49# tagging tagging why matters motivation

ease of use it is popular - tags are simple it is multifaceted just typing few words it is flexible - tags are flexible it is also made for the stream – the constant flow of tags can be whatever you need them to be information we experience online - tags are extensible you can always add new tags - tags can be aggregated

… can be messy and may not conform to any recognizable pattern ece#627,#winter#‘13# 25# of#49# ece#627,#winter#‘13# 26# of#49#

tagging tagging motivation motivation

managing personal information collaborating and sharing - do not need to consider the whole categorization scheme, you - you can explore topics using the tags of other users just add tags - other users may be experts - you can add any tags, instead of finding the one category that is - you may use tags to connect with other users who share the best fit interests - re-categorization is easy if we make a mistake having fun

expressing yourself

ece#627,#winter#‘13# 27# of#49# ece#627,#winter#‘13# 28# of#49#

tagging system tags as metadata architecture kinds of metadata

metadata: requires to set up rules about your users (who they - helps you (or others) find you want are and how they join the system), your resources - helps you manage your data (how they are added to the system), and tags (who - lets you relate your data to other data you own, as can tag which resources) well as other data out there in the world

how users interact with each other

ece#627,#winter#‘13# 29# of#49# ece#627,#winter#‘13# 30# of#49# tags as metadata tags as metadata kinds of metadata kinds of metadata

tag type example descriptive – provide details about the resource descriptive webdesign, drama, sushi administrative – used to manage a collection of gardening, music resources (for example, date a resource was resource blog, book, video, photo acquired, the person who owns the rights to the ownership/source nytimes, genesmith (author) resource) opinion cool, funny, lame structural – used to associate the resource with other resources (for example, volume of books, self-reference mystuff, mine maps of how individual files relate to each other) task organizing todo, work play/performance helo3, poetry

ece#627,#winter#‘13# 31# of#49# ece#627,#winter#‘13# 32# of#49#

tags … tags … and controlled vocabularies controlled vocabularies

two kinds of classification systems – define a system for managing the meaning of words – it relationships between terms removes ambiguity of language

help us understand and navigate concepts by synonym rings – give two or more words an making language less ambiguous, by connecting equivalent meaning concepts, and by capturing the relationships authority files – as above but one of the words is between objects observed in the real world identified as a preferred term

ece#627,#winter#‘13# 33# of#49# ece#627,#winter#‘13# 34# of#49#

tags … tags … taxonomies enriching taxonomy with tags

bubble-up approach establishes parent-child relationships between tags are attached to a resource, for example, a song terms, are typically hierarchical those tags are bubble-up from several songs to describe their parent item, album album tags are then bubbled up again to describe the artist

relationships between resources are preserved while capturing the descriptive terms of users

ece#627,#winter#‘13# 35# of#49# ece#627,#winter#‘13# 36# of#49# folksonomy folksonomy introduction introduction

it is a term used to describe the bottom-up the relationships between tags are inferred based classification systems that emerge from social on their usage patterns tagging no formal relationships parent-child like in taxonomy no equivalences between terms as in a

ece#627,#winter#‘13# 37# of#49# ece#627,#winter#‘13 # 38# of#49#

folksonomy folksonomy introduction - independence

users are free to choose their tags css ajax HIV

ccr5 some systems offer suggestions – a tool aimed to help users add tags more easily and efficiently cxcr4 webdesign

ece#627,#winter#‘13# 39# of#49# ece#627,#winter#‘13# 40# of#49#

folksonomy folksonomy - aggregation - inference

pulling all the tags together in an automated way – this creates folksonomy relationships between tags are inferred from their use manual sampling of tags, few users – not a folksonomy they are based on the language and usage patterns of real users – based on users activities and interests

ece#627,#winter#‘13# 41# of#49# ece#627,#winter#‘13# 42# of#49# folksonomy folksonomy - methods to infer semantic relationships when to use

- counting tags to see which is most popular - nomenclature is uncertain or evolving - co-occurrence counts which tags are used together - dynamic information space (loose approximation of the associative - semantic relationships are not critical relationships) - multiple viewpoints are desirable - clustering of tags that have a high probability of co- occurence - you can tap in an active base of users

ece#627,#winter#‘13# 43# of#49# ece#627,#winter#‘13# 44# of#49#

from folksonomy to ontology from folksonomy to ontology super-class relationships super-class relationships

for example tags that co-occur with other tags often are thought "music" co-occurs with both "piano" and "guitar", to be more general than more specific-tags that co- and as such can be suspected being a super-class of occur with other tags less often both on the other hand, "piano" probably does not co- occur with more possible tags than "music" and usually co-occurs with "music" and so it likely is a subclass

ece#627,#winter#‘13# 45# of#49# ece#627,#winter#‘13# 46# of#49#

from folksonomy to ontology from folksonomy to ontology synonym relationships structured relationships

detecting synonyms is actually counter-intuitive, since I believe that the same user will not tag a URI tags that co-occur often might have a facet, or both "computer" and "PC," but will probably only structured relationship pick one of those these may be pairs or trids

however, groups of users will use different synonyms, and over time most of the convergence will come from synonyms being merged.

ece#627,#winter#‘13# 47# of#49# ece#627,#winter#‘13# 48# of#49# from folksonomy to ontology structured relationships

for example "book" and "author" and "Mark Twain" is a triadic ("triple" on the ) relationship, and if these co-occur quite often they are probably a relationship in fact, one would suspect that most co-occurences are pairs, like "author" and "Zadie Smith," or "book" and "Mark Twain," and making these work with the Semantic Web would be slightly more difficult

ece#627,#winter#‘13# 49# of#49#