Multimedia Information Systems VO/KU (707.021/707.022)

Denis Helic

KMI, TU Graz

Oct 10, 2011

Denis Helic (KMI, TU Graz) Multimedia Information Systems Oct 10, 2011 1 / 53 Outline

1 Definition

2 The Internet

3 History

4 The Web

5 Social Web

6 and the Web

Denis Helic (KMI, TU Graz) Multimedia Information Systems Oct 10, 2011 2 / 53 Information systems, Internet, Web Multimedia Information Systems VO/KU (707.021/707.022)

Denis Helic

KMI, TU Graz

Oct 10, 2011

Denis Helic (KMI, TU Graz) Information systems, Internet, Web Oct 10, 2011 3 / 53 Definition Information systems

Systems for managing information in different forms Data vs. Information Information is data in a context E.g. data: 23 Context: Age of Hermann Maurer is 23

Denis Helic (KMI, TU Graz) Information systems, Internet, Web Oct 10, 2011 4 / 53 Definition Information systems

Management Create, Retrieve (Search, Browse, etc.), Update, Delete, etc. CRUD operations Persistent storage With a special purpose

Denis Helic (KMI, TU Graz) Information systems, Internet, Web Oct 10, 2011 5 / 53 Definition Information systems

Various kinds of information systems Depending on the kind of information that is managed: Structured information → clear separation of structure and content → Databases Unstructured, or semi-structured information → e.g. text documents → Document Management Systems If semi-structured information comes in the form of different media (text, video, images, etc.) → Multimedia Information Systems

Denis Helic (KMI, TU Graz) Information systems, Internet, Web Oct 10, 2011 6 / 53 Definition Information systems

Taking into account the networking environment Standalone information systems → no network Distributed information system → network is needed

Denis Helic (KMI, TU Graz) Information systems, Internet, Web Oct 10, 2011 7 / 53 Definition Information systems

In this course we discuss Distributed Multimedia Information Systems The largest network infrastructure: the Internet → Internet-based Multimedia Information Systems The largest application infrastructure on the Internet: → the Web

Denis Helic (KMI, TU Graz) Information systems, Internet, Web Oct 10, 2011 8 / 53 The Internet The Internet and the Web

Internet =6 Web

Denis Helic (KMI, TU Graz) Information systems, Internet, Web Oct 10, 2011 9 / 53 The Internet Technical Background

Internet is the net of the nets TCP/IP is the communication protocol for the Internet A computer communication protocol is a description of the rules computers must follow to communicate with each other TCP/IP stands for Transmission Control Protocol / Internet Protocol

Denis Helic (KMI, TU Graz) Information systems, Internet, Web Oct 10, 2011 10 / 53 The Internet Technical Background

Inside the TCP/IP standard there are a number of protocols for handling data communication TCP (Transmission Control Protocol) communication between applications UDP (User Datagram Protocol) simple communication between applications IP (Internet Protocol) communication between computers ICMP (Internet Control Message Protocol) for errors and statistics DHCP (Dynamic Host Configuration Protocol) for dynamic addressing

Denis Helic (KMI, TU Graz) Information systems, Internet, Web Oct 10, 2011 11 / 53 The Internet Technical Background

TCP is for communication between applications When an application wants to communicate with another application via TCP, it sends a communication request The request must be sent to an exact address The connection between two computers is open until it is closed by one of the two applications TCP is reliable (if a part of request is lost, TCP ensures that it is sent again) Connection-oriented

Denis Helic (KMI, TU Graz) Information systems, Internet, Web Oct 10, 2011 12 / 53 The Internet Technical Background

UDP is for communication between applications UDP is simpler and less reliable than TCP Connection-less Does not ensure that the missed parts are sent again Faster than TCP - good e.g. for streaming media applications

Denis Helic (KMI, TU Graz) Information systems, Internet, Web Oct 10, 2011 13 / 53 The Internet Technical Background

IP is for communication between computers Connection-less IP breaks messages into small independent packets IP is responsible for routing each packet to its destination An IP packet first arrives at an IP router The IP router routes the packet to its destination, directly or via another router Router works with IP addresses

Denis Helic (KMI, TU Graz) Information systems, Internet, Web Oct 10, 2011 14 / 53 The Internet Technical Background

IP addressing: IPv4: 32 bit long (4 bytes) / dot notation e.g. 2166031126 = 10000001000110110000001100010110 → 10000001.00011011.00000011.00010110 → 129.27.3.22 IPv6: 128 bit (2128 different addresses, 7 ∗ 1023 pro m2)

Denis Helic (KMI, TU Graz) Information systems, Internet, Web Oct 10, 2011 15 / 53 The Internet Technical Background

hierarchical naming service (e.g. news.tu-graz.ac.at → 129.27.3.22)

Generic Countries

int com edu gov mil org net jp us nl . . .

sun yale acm ieee ac co oce ¡ vu

eng cs eng jack jill keio nec cs

ai linda cs csl flits fluit

robot pc24

Figure: Space

Denis Helic (KMI, TU Graz) Information systems, Internet, Web Oct 10, 2011 16 / 53 The Internet Technical Background

TCP/IP and UDP/IP combinations TCP/IP is TCP and IP working together TCP takes care of the communication between your application software (i.e. your browser) and your network software IP takes care of the communication with other computers TCP is responsible for breaking data down into IP packets before they are sent, and for assembling the packets when they arrive

Denis Helic (KMI, TU Graz) Information systems, Internet, Web Oct 10, 2011 17 / 53 The Internet Technical Background

Higher-layer applications are referred to by a port identifier in TCP/UDP messages The port identifier and IP address together form a socket The end-to-end communication between two hosts is uniquely identified on the Internet by the four-tuple (source port, source address, destination port, destination address) TCP/UDP and specific ports are the basis for the Internet services

Denis Helic (KMI, TU Graz) Information systems, Internet, Web Oct 10, 2011 18 / 53 The Internet Services in the Internet

Telnet Email Usenet (news) Gopher Ftp WWW

Denis Helic (KMI, TU Graz) Information systems, Internet, Web Oct 10, 2011 19 / 53 The Internet Protocols on top of TCP/IP

Specific for a service RFCs (http://www.ietf.org/rfc.html) HTTP1.0: RFC 1945 FTP: RFC 959

Denis Helic (KMI, TU Graz) Information systems, Internet, Web Oct 10, 2011 20 / 53 The Internet Protocols on top of TCP/IP

Figure: Protocols

Denis Helic (KMI, TU Graz) Information systems, Internet, Web Oct 10, 2011 21 / 53 The Internet Internet Growth

Anzahl der Computer im Internet (Source: Internet Software Consortium (http://www.isc.org/)) 180

160

140

120

100

80 in Millionen 60

40

20

0 01/1993 01/1994 01/1995 01/1996 01/1997 01/1998 01/1999 01/2000 01/2001 01/2002 01/2003

Figure: Internet Growth Denis Helic (KMI, TU Graz) Information systems, Internet, Web Oct 10, 2011 22 / 53 In 2008: 100.000G transferred over the Internet in a second

The Internet Internet Growth

Not only computers, but other devices, e.g. phones connected to the Internet About billion devices connected today In 1993: 100.000G transferred over the Intern per year

Denis Helic (KMI, TU Graz) Information systems, Internet, Web Oct 10, 2011 23 / 53 The Internet Internet Growth

Not only computers, but other devices, e.g. phones connected to the Internet About billion devices connected today In 1993: 100.000G transferred over the Intern per year In 2008: 100.000G transferred over the Internet in a second

Denis Helic (KMI, TU Graz) Information systems, Internet, Web Oct 10, 2011 23 / 53 The Internet Internet Growth

Figure: Internet Map http://www.opte.org/

Denis Helic (KMI, TU Graz) Information systems, Internet, Web Oct 10, 2011 24 / 53 History Information systems - Historical

1945 Vannevar Bush “As We May Think” http://www.theatlantic.com/magazine/archive/1945/07/ as-we-may-think/3881/ Device called memex

Denis Helic (KMI, TU Graz) Information systems, Internet, Web Oct 10, 2011 25 / 53 History Information systems - Historical

Figure: Drawing of Bush’s theoretical Memex machine (Life Magazine, November 19, 1945)

Denis Helic (KMI, TU Graz) Information systems, Internet, Web Oct 10, 2011 26 / 53 History Information systems - Historical

Digital library - knowledge management system Extends human brain Index device for later retrieval Recording information with microphone, camera Trails: today links

Denis Helic (KMI, TU Graz) Information systems, Internet, Web Oct 10, 2011 27 / 53 History Information systems - Historical

1967 formed terms “” and “” Book “Literary Machines” Project Xanadu (http://www.udanax.com) Worldwide, distributed library

Denis Helic (KMI, TU Graz) Information systems, Internet, Web Oct 10, 2011 28 / 53 History Information systems - Historical

Different types of (bookmarks, annotations, footnotes, hypertext-jumps, etc.) Transclusion Versioning Still unfinished :-)

Denis Helic (KMI, TU Graz) Information systems, Internet, Web Oct 10, 2011 29 / 53 History Information systems - Historical

1967 van Dam: Hypertext Editing System 1968 Douglas Engelbart: NLS (oN-Line System) Invented text-processing, window-technique, email, hypertext, mouse http://sloan.stanford.edu/MouseSite/1968Demo.html I don’t know why we call it a mouse. It started that way and we never changed it.

Denis Helic (KMI, TU Graz) Information systems, Internet, Web Oct 10, 2011 30 / 53 History Information systems - Historical

1975 Akscyn: KMS (Knowledge Management System) 1982 Hermann Maurer: BTX (Bildschrim TeXt), Mupid Developed at IICM BTX: > 20.000 pages, encyclopedias, games, discussion forums, ecommerce (http: //much.iicm.edu/much/projects/videotex_2/index.htm/) Mupid: the first network computer with software from the net

Denis Helic (KMI, TU Graz) Information systems, Internet, Web Oct 10, 2011 31 / 53 History Information systems - Historical

1985 Norman Meyrowitz: Intermedia, IRIS 1987 Bill Atkinson: Hypercard 1987 Notecards 1989 WAIS (Wide Area Information System) Beginning of 1990s Hyper-G (now Hyperwave) 1991 Gopher 1994

Denis Helic (KMI, TU Graz) Information systems, Internet, Web Oct 10, 2011 32 / 53 History Information systems - Historical

Reflections on NoteCards: seven issues for the next generation of hypermedia systems Paper by Frank Halasz at ACM Hypertext 1987 http://dl.acm.org/citation.cfm?id=317426.317451

Denis Helic (KMI, TU Graz) Information systems, Internet, Web Oct 10, 2011 33 / 53 History Information systems - Historical

Search and query in a hypermedia network Composites - augmenting the basic node and link model Virtual structures for dealing with changing information (e.g. query links) Computation in (over) hypermedia networks (dynamic content)

Denis Helic (KMI, TU Graz) Information systems, Internet, Web Oct 10, 2011 34 / 53 History Information systems - Historical

Versioning Support for collaborative work Extensibility and tailorability (e.g. personalization)

Denis Helic (KMI, TU Graz) Information systems, Internet, Web Oct 10, 2011 35 / 53 History Information systems - Historical

In 1994 the Web did not support any of these features The Web and the hypertext community split The Web did not even support many of the features of the first generation hypertext systems “Your future is my past” Ted Nelson at closing a video panel between Hypertext and WWW conference in 1997

Denis Helic (KMI, TU Graz) Information systems, Internet, Web Oct 10, 2011 36 / 53 History Information systems - Historical

In 1994 Hyper-G had typed links Integrity of links Metadata Composites such as collections, menus, etc

Denis Helic (KMI, TU Graz) Information systems, Internet, Web Oct 10, 2011 37 / 53 History Information systems - Historical

What are the reasons for the success of the Web? Network Addressability across the network: Simplicity Architecture that scales

Denis Helic (KMI, TU Graz) Information systems, Internet, Web Oct 10, 2011 38 / 53 The Web The Web

Quotes by Tim-Berners Lee “Big is beautiful: Network is everything” “Scruffy works: Let the links fail to make them scale” Search engines brought information retrieval

Denis Helic (KMI, TU Graz) Information systems, Internet, Web Oct 10, 2011 39 / 53 The Web The Web

The fastest growth of any technology in the human history Time to reach 50 million people Telephone 75 years Radio 35 years TV 13 years The Web 4 years

Denis Helic (KMI, TU Graz) Information systems, Internet, Web Oct 10, 2011 40 / 53 The Web The Web

Figure: Jakob Nielsen, 100 Million Web Sites, http://www.useit.com/alertbox/web-growth.html

Denis Helic (KMI, TU Graz) Information systems, Internet, Web Oct 10, 2011 41 / 53 The Web The Web

The size of the Web Visible Web and The Deep Web (behind passwords) The size of the Web ≈ 1000 billions http://googleblog. blogspot.com/2008/07/we-knew-web-was-big.html Indexed pages ≈ 50 billions http://www.worldwidewebsize.com/

Denis Helic (KMI, TU Graz) Information systems, Internet, Web Oct 10, 2011 42 / 53 The Web The Web

Figure: http://techcrunch.com/2009/05/08/ is-the-growth-of-the-web-slowing-down-or-just-taking-a-breather/

Denis Helic (KMI, TU Graz) Information systems, Internet, Web Oct 10, 2011 43 / 53 The Web The Web

Figure: The Web is dead, http://www.wired.com/magazine/2010/08/ff_webrip/all/1

Denis Helic (KMI, TU Graz) Information systems, Internet, Web Oct 10, 2011 44 / 53 The Web The Web

The Web Ain’t Dead Yet (And It’s Getting Easier to Create) http://www.wired.com/epicenter/2011/08/ web-aint-dead-easier-to-make/ Apps and big platforms: easy to use but hard to program

Figure: HTML5

Denis Helic (KMI, TU Graz) Information systems, Internet, Web Oct 10, 2011 45 / 53 Social Web Web 2.0

Web 2.0 is not a new technology It can be understand as a set of principles and practices in developing Web applications Particular Web 2.0 applications will demonstrate some or all of these principles/practices Social aspects of the Web Connect the people not machines: use network effect

Denis Helic (KMI, TU Graz) Information systems, Internet, Web Oct 10, 2011 46 / 53 Social Web Web 2.0

Technical infrastructure of the Web has a social impact Hyperlinking Users add new content and new sites It is discovered by the others who link to it The web of connections grows through a social process

Denis Helic (KMI, TU Graz) Information systems, Internet, Web Oct 10, 2011 47 / 53 Social Web Web 2.0

Figure: TBL, The Web, Locking back, looking forward

Denis Helic (KMI, TU Graz) Information systems, Internet, Web Oct 10, 2011 48 / 53 Social Web Web 2.0

Hyperlinking (success stories) Yahoo is a directory of links Google’s PageRank uses the link structure to rank the search results Make use of hypelinking in your Web apps The links need to be easy to use especially for humans: clean URLs

Denis Helic (KMI, TU Graz) Information systems, Internet, Web Oct 10, 2011 49 / 53 Social Web Web 2.0

Harness collective activity Encourage users activity, i.e. make an activity infrastructure eBay enables a context in which activity can happen Amazon supports reviews, invitations, discussion Similar interests, similar books, suggestions all based on user activity Support community

Denis Helic (KMI, TU Graz) Information systems, Internet, Web Oct 10, 2011 50 / 53 Social Web Web 2.0

Further extension of the activity principle: engage users in creating content Wikipedia, trust your users to create content Flickr, delicious, collaborative structuring of sites Sourceforge.net collaborative source creation

Denis Helic (KMI, TU Graz) Information systems, Internet, Web Oct 10, 2011 51 / 53 Social Web Web 2.0

Blogging: personal diaries Big breakthrough: RSS You can not only link to a page but subscribe to it Push content to users instead of users pulling it Use the network effect from user contributions

Denis Helic (KMI, TU Graz) Information systems, Internet, Web Oct 10, 2011 52 / 53 Hypertext and the Web Reflections on Seven Issues

Reflections on Seven Issues: Hypertext in the Era of the Web 2001 Paper by Frank Halasz http://dl.acm.org/citation.cfm?id=507317.507328 The Web got there!

Denis Helic (KMI, TU Graz) Information systems, Internet, Web Oct 10, 2011 53 / 53