Seventh Framework Programme Fp7-Ict-2009-6

Total Page:16

File Type:pdf, Size:1020Kb

Seventh Framework Programme Fp7-Ict-2009-6 SEVENTH FRAMEWORK PROGRAMME FP7-ICT-2009-6 BlogForever Grant agreement no.: 269963 BlogForever: D2.1 Survey Implementation Report Editor: Silvia Arango-Docio Revision: Draft V1.0 Dissemination Level: WP2 Author(s): S. Arango-Docio, P. Sleeman, H. Kalb Due date of deliverable: 31/08/2011 Actual submission date: Start date of project: 01 March 2011 Duration: 30 months Lead Beneficiary name: University of London Abstract: This report outlines a principal study aiming to inform the development of preservation and dissemination solutions for blogs. To achieve this, the study encompassed: [i] a user survey exploring the aspects of blog preservation and blogging practices in general; [ii] an investigation into the use of tools and technologies within the Blogosphere; and finally [iii] an inquiry into the recent theoretical and technological advances for analysing blogs and their networks. The results of the study, as summarised in this report, enable addressing the objectives pursued as part of the D2.1 deliverable of the BlogForever project. More specifically, this report comments on: [a] common weblog authoring practices; [b] important aspects and types of blog data that should be preserved; [c] the patterns in weblogs structure and data; [d] the technology adopted by current blogs; and finally [e] the developments and prospects for analysing blog networks and weblog dynamics. As an account for the conducted work this report includes implementation details and adopted ethical guidelines. The results of the inquiry are discussed within the context of BlogForever – offering directions for further development of the project. Project co-funded by the European Commission within the Seventh Framework Programme (2007-2013) BlogForever_D2_120110808 31 August 2011 The BlogForever Consortium consists of: Aristotle University of Thessaloniki (AUTH) Greece European Organisation for Nuclear Research (CERN) Switzerland University of Glasgow (UG) UK The University of Warwick (UW) UK University of London (UL) UK Technische Universitat Berlin (TUB) Germany Cyberwatcher Norway SRDC Yazilim Arastrirma ve Gelistrirme ve Danismanlik Ticaret Limited Sirketi (SRDC) Turkey Tero Ltd (Tero) Greece Mokono GMBH Germany Phaistos SA (Phaistos) Greece Altec Software Development S.A. (Altec) Greece BlogForever Consortium Page 2 of 142 BlogForever_D2_120110808 31 August 2011 History Version Date Modification reason Modified by 0.1 18 June 2011 First Draft S.Arango-Docio 0.2 24 July 2011 Updated First Draft S. Arango-Docio, P. Sleeman 0.3 27 July 2011 Table of Contents S. Arango-Docio 0.4 27 July 2011 Comments received S. Arango-Docio 0.5 30 July 2011 Second Draft S. Arango-Docio, H. Kalb 0.5.1 03 August 2011 Adaptations to the format, Addition of literature H. Kalb references, Extension of the SmartPLS chapter 0.5.2 03 August 2011 Comments on second draft P. Sleeman 0.5.3 08 August 2011 Merged versions 0.5.1 & 0.5.2 S. Arango-Docio 0.5.4 18 August 2011 Updates to 0.5.3 S. Arango-Docio 0.5.5 24 August 2011 Inclusion of the chapter 4.1.1 and 4.3.3 H. Kalb 0.5.6 24 August 2011 Integration of the Technical Survey K. Stepanyan 0.5.7 25 August 2011 Expansion, review, copy-editing and commenting M. Joy, K Stepanyan 1.0 27 August 2011 Pre-final formatting H. Kalb, K. Stepanyan 1.1 30 August 2011 Review and editing M. Joy BlogForever Consortium Page 3 of 142 BlogForever_D2_120110808 31 August 2011 Table of Contents 1 EXECUTIVE SUMMARY ......................................................................................................................... 8 2 INTRODUCTION ................................................................................................................................. 10 3 BLOG PRESERVATION AND STUDIES REVIEW ..................................................................................... 11 4 BLOGFOREVER SURVEY ...................................................................................................................... 13 4.1 QUESTIONNAIRE DEVELOPMENT ............................................................................................................. 13 4.1.1 Influence Factors for Author and Reader Behaviour .................................................................. 14 4.1.1.1 Blog author intention to contribute to a blog archive ............................................................................. 14 4.1.1.2 Influence factors for the search strategy in the archive .......................................................................... 17 4.1.2 Survey Pilot Testing .................................................................................................................... 18 4.1.3 Survey Translations .................................................................................................................... 20 4.2 SURVEY IMPLEMENTATION ..................................................................................................................... 21 4.2.1 Survey Population and Sampling................................................................................................ 21 4.2.1.1 Sampling strategy ..................................................................................................................................... 21 4.2.1.2 Internet sampling ...................................................................................................................................... 22 4.2.1.3 Sample size ............................................................................................................................................... 25 4.2.2 Survey software.......................................................................................................................... 26 4.2.3 Survey Implementation and Promotion ..................................................................................... 31 4.3 DATA ANALYSES AND RESULTS ................................................................................................................ 33 4.3.1 From IProbe to SPSS and SmartPLS ............................................................................................ 33 4.3.2 SPSS and Excel Analyses ............................................................................................................. 34 4.3.2.1 Authors survey respondents summary ...................................................................................................... 34 4.3.2.2 Common blog authoring practices ............................................................................................................ 37 4.3.2.3 Patterns in blog structure and data ............................................................................................................ 39 4.3.2.4 Network-based metrics ............................................................................................................................. 41 4.3.2.5 Blog lifecycle examination ....................................................................................................................... 43 4.3.2.4 Aspects of blogs for preservation ............................................................................................................. 45 4.3.3 Blog Author Intentions for Contributing to Blog Archives .......................................................... 47 4.3.3.1 Measurement model ................................................................................................................................. 48 4.3.3.2 Structural model ....................................................................................................................................... 49 4.3.4 Influence Factors for Search Strategies within Archives ............................................................ 50 5 TECHNOLOGY USED BY CURRENT BLOGS ........................................................................................... 53 5.1 OBJECTIVES, DATA COLLECTION METHODS AND DATASETS .......................................................................... 53 5.1.1 Data Collection Methods and Datasets ...................................................................................... 54 5.1.2 Evaluation Method ..................................................................................................................... 55 5.2 EVALUATION RESULTS ........................................................................................................................... 56 5.2.1 Platforms and Software Used .................................................................................................... 56 5.2.2 Document Character Sets ........................................................................................................... 58 5.2.3 Use of CSS, Images, HTML5 and Flash ....................................................................................... 60 5.2.4 Semantic Markup: Microformats, Microdata and Metadata .................................................... 62 5.2.5 RSS and Atom Feeds ................................................................................................................... 65 5.2.6 APIs and Libraries ....................................................................................................................... 65 5.2.7 Social Media ............................................................................................................................... 67 5.2.8 Media Types and Common File Formats ...................................................................................
Recommended publications
  • CHAPTER 12 Making Your Web Site Mashable
    CHAPTER 12 Making Your Web Site Mashable This chapter is a guide to content producers who want to make their web sites friendly to mashups. That is, this chapter answers the question, how would you as a content producer make your digital content most effectively remixable and mashable to users and developers? Most of this book is addressed to creators of mashups who are therefore consumers of data and services. Why then should I shift in this chapter to addressing producers of data and services? Well, you have already seen aspects of APIs and web content that make it either easier or harder to remix, and you’ve seen what makes APIs easy and enjoyable to use. Showing content and data producers what would make life easier for consumers of their content provides useful guidance to service providers who might not be fully aware of what it’s like for consumers. The main audience for the book—as consumers (as opposed to producers) of services—should still find this chapter a helpful distillation of best practices for creating mashups. In some ways, this chapter is a review of Chapters 1–11 and a preview of Chapters 13–19. Chapters 1–11 prepared you for how to create mashups in general. I presented a lot of the technologies and showed how to build a reasonably sophisticated mashup with PHP and JavaScript as well as using mashup tools. Some of the discussion in this chapter will be amplified by in-depth discussions in Chapters 13–19. For example, I’ll refer to topics such as geoRSS, iCalendar, and microformats that I discuss in greater detail in those later chapters.
    [Show full text]
  • N8ek Kf N?8Kêj L
    :FM<IJKFIP Trackbacks in Drupal :fe]`^li`e^KiXZbYXZbj`e;ilgXc C<8M@E> 8o\cK\`Z_dXee#=fkfc`X KI8:BJ Trackbacks offer a simple means for bloggers to connect and share information. BY JAMES STANGER trackback is a way for a blogger With trackbacks, two seemingly unre- Several content management systems to automatically notify different lated conversations become more (CMSs) include trackback options. In N8EKKFBEFN 8blogs that he or she has either strongly associated. Each time an update Drupal [1], if you’ve enabled trackbacks, begun or extended a conversation with occurs in the conversation, the context a blogger on your system just has to another blogger. A trackback is one of becomes stronger and richer. Search en- enter the URL of a remote blogger who three main types of linkbacks (see the gines often rank pages higher if they are supports trackbacks, and the blogger N?8KÊJLGE<OK6 “Trackbacks and Linkbacks” box) that linked from other sites. Trackbacks thus will be notified. In this article, I describe bloggers use to keep track of each oth- promote higher ratings and perhaps how to set up trackbacks in Drupal with er’s postings and ensure that their read- more exposure for a project or product. examples based on the implementation ers can link to related content. Once a website has trackbacks enabled, one blogger can reach out to another on a separate site by sending a “ping” to that user. The ping simply says, “Here’s a topic that is related to what you’ve JL9J:I@9<KFC@ELO posted, check it out.” If a blogger on a separate site wants to D8>8Q@E<GI<M@<N# respond, the conversation between the two bloggers becomes stronger.
    [Show full text]
  • Life Sciences and the Web: a New Era for Collaboration
    Life Sciences and the web: a new era for collaboration The Harvard community has made this article openly available. Please share how this access benefits you. Your story matters Citation Sagotsky, Jonathan A., Le Zhang, Zhihui Wang, Sean Martin, and Thomas S. Deisboeck. 2008. Life Sciences and the web: a new era for collaboration. Molecular Systems Biology 4: 201. Published Version doi:10.1038/msb.2008.39 Citable link http://nrs.harvard.edu/urn-3:HUL.InstRepos:4874800 Terms of Use This article was downloaded from Harvard University’s DASH repository, and is made available under the terms and conditions applicable to Other Posted Material, as set forth at http:// nrs.harvard.edu/urn-3:HUL.InstRepos:dash.current.terms-of- use#LAA Molecular Systems Biology 4; Article number 201; doi:10.1038/msb.2008.39 Citation: Molecular Systems Biology 4:201 & 2008 EMBO and Nature Publishing Group All rights reserved 1744-4292/08 www.molecularsystemsbiology.com PERSPECTIVE Life Sciences and the web: a new era for collaboration Jonathan A Sagotsky1, Le Zhang1, Zhihui Wang1, Sean Martin2 inaccuracies per article compared to Encyclopedia Britannica’s and Thomas S Deisboeck1,* 2.92 errors. Although measures have been taken to improve the editorial process, accuracy and completeness remain valid 1 Complex Biosystems Modeling Laboratory, Harvard-MIT (HST) Athinoula concerns. Perhaps the issue is one in which it has become A Martinos Center for Biomedical Imaging, Massachusetts General Hospital, difficult to establish exactly what has actually been peer Charlestown, MA, USA and reviewed and what has not, given that the low cost of digital 2 Cambridge Semantics Inc., Cambridge, MA, USA publishing on the web has led to an explosive amount * Corresponding author.
    [Show full text]
  • Quality Spine Care
    Quality Spine Care Healthcare Systems, Quality Reporting, and Risk Adjustment John Ratliff Todd J. Albert Joseph Cheng Jack Knightly Editors 123 Quality Spine Care John Ratliff • Todd J. Albert Joseph Cheng • Jack Knightly Editors Quality Spine Care Healthcare Systems, Quality Reporting, and Risk Adjustment Editors John Ratliff Todd J. Albert Department of Neurosurgery Hospital for Special Surgery Stanford University New York, NY Stanford, CA USA USA Jack Knightly Joseph Cheng Atlantic Neurosurgical Specialists University of Cincinnati Morristown, NJ Cincinnati, OH USA USA ISBN 978-3-319-97989-2 ISBN 978-3-319-97990-8 (eBook) https://doi.org/10.1007/978-3-319-97990-8 Library of Congress Control Number: 2018957706 © Springer Nature Switzerland AG 2019 This work is subject to copyright. All rights are reserved by the Publisher, whether the whole or part of the material is concerned, specifically the rights of translation, reprinting, reuse of illustrations, recitation, broadcasting, reproduction on microfilms or in any other physical way, and transmission or information storage and retrieval, electronic adaptation, computer software, or by similar or dissimilar methodology now known or hereafter developed. The use of general descriptive names, registered names, trademarks, service marks, etc. in this publication does not imply, even in the absence of a specific statement, that such names are exempt from the relevant protective laws and regulations and therefore free for general use. The publisher, the authors, and the editors are safe to assume that the advice and information in this book are believed to be true and accurate at the date of publication. Neither the publisher nor the authors or the editors give a warranty, express or implied, with respect to the material contained herein or for any errors or omissions that may have been made.
    [Show full text]
  • Integrating with Blogs
    CHAPTER 5 Integrating with Blogs Blogs (also known as weblogs) have become lightweight, general-purpose platforms for publication, self-expression, and collaboration. Bloggers push the limits of new-media production, especially in the area of integration, because they want ultimately to discuss anything they can see or think or hear—without any effort, of course. Because you can directly tie blogs in with other systems—often without any programming on your own part—you’ll now study how to combine blogs with other applications and data sources. In this chapter, I cover end-user functionality that lets you publish content to a blog from a web site or a desktop application. In Chapter 7, you’ll study how you can program the relevant web APIs to read and publish blog content. I close this chapter by applying lessons from blog integration to wikis, which I believe are ripe for a similar type of remixing. In this chapter, you will do the following: * You’ll learn how to configure your WordPress or Blogger blog to receive pictures from Flickr through Flickr’s Blog This button. * You’ll study the mechanisms behind blog integration by studying how it’s done with Flickr. * You’ll learn how to use a desktop blogging client to take advantage of a richer writing environment for blogging. * You’ll see how the combination of syndication feeds and blogging can be recursive (that is, how content from blogs can be refashioned into new blog entries). * You’ll experience the forward-looking social browser integration of Flock, which combines a Web browser, Flickr photos, and blogging all in one user interface.
    [Show full text]
  • Trackback Spam: Abuse and Prevention
    TrackBack Spam: Abuse and Prevention Elie Bursztein∗ Peifung E. Lam* John C. Mitchell* Stanford University Stanford University Stanford University [email protected] pfl[email protected] [email protected] ABSTRACT The TrackBack mechanism [3] is used to automatically insert Contemporary blogs receive comments and TrackBacks, which cross-references between blogs. A new blog post citing an result in cross-references between blogs. We conducted a lon- older one on a different blog can use the TrackBack interface gitudinal study of TrackBack spam, collecting and analyzing to insert a link in the older post automatically. TrackBacks almost 10 million samples from a massive spam campaign are an intrinsic part of the blogosphere, and a key ingredient over a one-year period. Unlike common delivery of email used in blog ranking (Sec.2). Because TrackBacks are auto- spam, the spammers did not use bots, but took advantage of mated, CAPTCHA and registration requirements cannot be an official Chinese site as a relay. Based on our analysis of used to protect the TrackBack mechanism. Over the last few TrackBack misuse found in the wild, we propose an authenti- years, abuse of the TrackBack mechanism has emerged as a cated TrackBack mechanism that defends against TrackBack key problem, with some attacks causing sites to disable this spam even if attackers use a very large number of different feature [5]. So far, however, very little research has been con- source addresses and generate unique URLs for each Track- ducted on how TrackBack spam is carried out in the wild. To Back blog. better understand how attackers currently abuse the Track- Back mechanism and to help design better defenses in the future, we instrumented an operating blog site and collected 1.
    [Show full text]
  • Elie Bursztein, Baptiste Gourdin, John Mitchell Stanford University & LSV-ENS Cachan
    Talkback: Reclaiming the Blogsphere Elie Bursztein, Baptiste Gourdin, John Mitchell Stanford University & LSV-ENS Cachan 1 What is a blog ? • A Blog ("Web log") is a site, usually maintained by an individual with • Regular entries • Commentary • LinkBack • Entries displayed in reverse-chronological order. http://elie.im/blog Elie Bursztein, Baptiste Gourdin, John Mitchell TalkBack: reclaiming the blogosphere from spammer http://ly.tl/p21 Key Statistics • 184 Millions blogs • 73% of users read blogs • 50% post comments universalmccann Elie Bursztein, Baptiste Gourdin, John Mitchell TalkBack: reclaiming the blogosphere from spammer http://ly.tl/p21 Anatomy of a blog post Elie Bursztein, Baptiste Gourdin, John Mitchell TalkBack: reclaiming the blogosphere from spammer http://ly.tl/p21 Why blogs are special ? User Elie Bursztein, Baptiste Gourdin, John Mitchell TalkBack: reclaiming the blogosphere from spammer http://ly.tl/p21 Why blogs are special ? User Elie Bursztein, Baptiste Gourdin, John Mitchell TalkBack: reclaiming the blogosphere from spammer http://ly.tl/p21 What is a TrackBack ? Elie Bursztein, Baptiste Gourdin, John Mitchell TalkBack: reclaiming the blogosphere from spammer http://ly.tl/p21 Trackback Illustrated Little Timmy said to me... "What's Trackback, Daddy?" "Wow! Jimmy Lightning has written the best 1. post ever! It's so funny! And it's true! That's "Best Post Ever" why it's so good. I need to tell the world!" "Check it out world! I've "Jimmy written all about Jimmy 2. Lightning is Lightning's post on my Elie Bursztein, Baptiste Gourdin, John Mitchell swell"TalkBack: reclaiming the blogosphere from spammerweblog. My weblog's http://ly.tl/p21 called 'The Unbloggable Blogness of Blogging'.
    [Show full text]
  • Information Retrieval and Mining in Distributed Environments Studies in Computational Intelligence,Volume 324 Editor-In-Chief Prof
    Alessandro Soro, Eloisa Vargiu, Giuliano Armano, and Gavino Paddeu (Eds.) Information Retrieval and Mining in Distributed Environments Studies in Computational Intelligence,Volume 324 Editor-in-Chief Prof. Janusz Kacprzyk Systems Research Institute Polish Academy of Sciences ul. Newelska 6 01-447 Warsaw Poland E-mail: [email protected] Further volumes of this series can be found on our homepage: springer.com Vol. 313. Imre J. Rudas, J´anos Fodor, and Janusz Kacprzyk (Eds.) Vol. 301. Giuliano Armano, Marco de Gemmis, Computational Intelligence in Engineering, 2010 Giovanni Semeraro, and Eloisa Vargiu (Eds.) ISBN 978-3-642-15219-1 Intelligent Information Access, 2010 Vol. 314. Lorenzo Magnani,Walter Carnielli, and ISBN 978-3-642-13999-4 Claudio Pizzi (Eds.) Vol. 302. Bijaya Ketan Panigrahi,Ajith Abraham, Model-Based Reasoning in Science and Technology, 2010 and Swagatam Das (Eds.) ISBN 978-3-642-15222-1 Computational Intelligence in Power Engineering, 2010 Vol. 315. Mohammad Essaaidi, Michele Malgeri, and ISBN 978-3-642-14012-9 Costin Badica (Eds.) Vol. 303. Joachim Diederich, Cengiz Gunay, and Intelligent Distributed Computing IV, 2010 James M. Hogan ISBN 978-3-642-15210-8 Recruitment Learning, 2010 Vol. 316. Philipp Wolfrum ISBN 978-3-642-14027-3 Information Routing, Correspondence Finding, and Object Vol. 304.Anthony Finn and Lakhmi C. Jain (Eds.) Recognition in the Brain, 2010 Innovations in Defence Support Systems, 2010 ISBN 978-3-642-15253-5 ISBN 978-3-642-14083-9 Vol. 317. Roger Lee (Ed.) Vol. 305. Stefania Montani and Lakhmi C. Jain (Eds.) Computer and Information Science 2010 Successful Case-Based Reasoning Applications-1, 2010 ISBN 978-3-642-15404-1 ISBN 978-3-642-14077-8 Vol.
    [Show full text]
  • In the Spirit of Georgeʼs Request for Feedback from The
    In the spirit of Georgeʼs request for feedback from the employees I tossed together most of the things that buzz around my head as things we could do to take advantage of the fact we online media not print media. These are all technology related ideas and suggestions that apply both to efficiency issues internally and to B2B and consumers. Syndication Syndication on the web has formalized around RSS feeds. Aggregators like Google News use this, and users use this via RSS client software or aggregators like Google Reader. RSS clients exist for almost every platform from the PC to the iPhone. Optimally every page on the site that lists content should be available as an RSS feed including search results. For instance, I should be able to subscribe to an RSS feed for the “Europe” regional portal page, or the Terrorism Weekly, or the search results of the term “Putin”. Aggregation The flip side of pervasive syndication via RSS on the Web is the ability for users to specify feeds from other sites they would like to look at in what place or for sites like ours to pull content or links to content via RSS that is in context to our content. For instance, titles to recent NYTIMES articles on Europe on our Europe regional portal page. Links and References Weʼve been very good internally at taking advantage of our existing content and linking to previous articles in context within a new article. What we donʼt do so much is link to outside sources of information within our articles or provide links to relevent information as a footnote to articles.
    [Show full text]
  • Set-Up E-Jurnal Menggunakan Open Journal Systems (Ojs)
    SET-UP E-JURNAL MENGGUNAKAN OPEN JOURNAL SYSTEMS (OJS) Busro Pengelola Wawasan Jurnal Ilmiah Agama & Sosial Budaya Fakultas Ushuluddin UIN Sunan Gunung Djati Bandung http://journal.uinsgd.ac.id/index.php/jw/index http://bit.ly/setuprji ELEKTRONIK JOURNAL ≠ ONLINE JOURNAL Submission • online e-Journal Editor Assignment • online Reviewing • online Editing • online Publishing Persepsi Situs Jurnal Issue Paper MENGAPA HARUS JURNAL ONLINE DAN OJS Jurnal online berguna untuk meningkatkan kualitas dan kuatitas publikasi ilmiah Jurnal online dalam segi manajemen pengelolaan lebih efisien, hemat, murah, dan bisa dibaca oleh semua orang dari penjuru dunia Jurnal online mampu mencegah adanya plagiat pada artikel OJS membantu untuk setiap tahap proses penerbitan journal, dari pengajuan sampai publikasi online dan pengindeksan. Melalui sistem manajemen, pengindeksan yang berbau penelitian, dan konteks yang lengkap untuk penelitian, OJS berusaha untuk meningkatkan kualitas ilmiah dan penelitian dimaksud yang dikelola oleh OJS. OJS adalah perangkat lunak open source tersedia secara bebas untuk jurnal di seluruh dunia untuk tujuan membuat akses terbuka penerbitan pilihan yang layak untuk lebih jurnal, dengan akses terbuka dapat meningkatkan pembaca jurnal serta kontribusinya terhadap kepentingan publik pada skala global. ADAPUN FITUR-FITUR DALAM SISTEM OJS OJS diinstal secara lokal dan dikendalikan secara lokal Oleh Pemilik atau penginstal OJS. Terdapat Editor untuk mengkonfigurasi persyaratan, bagian, proses review journal, dll Terdapat Fitur Pendaftaran online dan pengelolaan semua konten journal yang telah kita buat. Terdapat Modul Tambahan dengan Level akses terbuka. Pengindeksan konten yang Komprehensif pada sistem global. Terdapat Fitur Reading Tools untuk konten, berdasarkan bidang dan pilihan editor. Terdapat Email pemberitahuan dan komentar untuk pembaca. Konteks-sensitif Lengkap dengan dukungan Bantuan online.
    [Show full text]
  • Web Crawling, Analysis and Archiving
    Web Crawling, Analysis and Archiving Vangelis Banos Aristotle University of Thessaloniki Faculty of Sciences School of Informatics Doctoral dissertation under the supervision of Professor Yannis Manolopoulos October 2015 Ανάκτηση, Ανάλυση και Αρχειοθέτηση του Παγκόσμιου Ιστού Ευάγγελος Μπάνος Αριστοτέλειο Πανεπιστήμιο Θεσσαλονίκης Σχολή Θετικών Επιστημών Τμήμα Πληροφορικής Διδακτορική Διατριβή υπό την επίβλεψη του Καθηγητή Ιωάννη Μανωλόπουλου Οκτώβριος 2015 i Web Crawling, Analysis and Archiving PhD Dissertation ©Copyright by Vangelis Banos, 2015. All rights reserved. The Doctoral Dissertation was submitted to the the School of Informatics, Faculty of Sci- ences, Aristotle University of Thessaloniki. Defence Date: 30/10/2015. Examination Committee Yannis Manolopoulos, Professor, Department of Informatics, Aristotle University of Thes- saloniki, Greece. Supervisor Apostolos Papadopoulos, Assistant Professor, Department of Informatics, Aristotle Univer- sity of Thessaloniki, Greece. Advisory Committee Member Dimitrios Katsaros, Assistant Professor, Department of Electrical & Computer Engineering, University of Thessaly, Volos, Greece. Advisory Committee Member Athena Vakali, Professor, Department of Informatics, Aristotle University of Thessaloniki, Greece. Anastasios Gounaris, Assistant Professor, Department of Informatics, Aristotle University of Thessaloniki, Greece. Georgios Evangelidis, Professor, Department of Applied Informatics, University of Mace- donia, Greece. Sarantos Kapidakis, Professor, Department of Archives, Library Science and Museology, Ionian University, Greece. Abstract The Web is increasingly important for all aspects of our society, culture and economy. Web archiving is the process of gathering digital materials from the Web, ingesting it, ensuring that these materials are preserved in an archive, and making the collected materials available for future use and research. Web archiving is a difficult problem due to organizational and technical reasons. We focus on the technical aspects of Web archiving.
    [Show full text]
  • Critical Point of View: a Wikipedia Reader
    w ikipedia pedai p edia p Wiki CRITICAL POINT OF VIEW A Wikipedia Reader 2 CRITICAL POINT OF VIEW A Wikipedia Reader CRITICAL POINT OF VIEW 3 Critical Point of View: A Wikipedia Reader Editors: Geert Lovink and Nathaniel Tkacz Editorial Assistance: Ivy Roberts, Morgan Currie Copy-Editing: Cielo Lutino CRITICAL Design: Katja van Stiphout Cover Image: Ayumi Higuchi POINT OF VIEW Printer: Ten Klei Groep, Amsterdam Publisher: Institute of Network Cultures, Amsterdam 2011 A Wikipedia ISBN: 978-90-78146-13-1 Reader EDITED BY Contact GEERT LOVINK AND Institute of Network Cultures NATHANIEL TKACZ phone: +3120 5951866 INC READER #7 fax: +3120 5951840 email: [email protected] web: http://www.networkcultures.org Order a copy of this book by sending an email to: [email protected] A pdf of this publication can be downloaded freely at: http://www.networkcultures.org/publications Join the Critical Point of View mailing list at: http://www.listcultures.org Supported by: The School for Communication and Design at the Amsterdam University of Applied Sciences (Hogeschool van Amsterdam DMCI), the Centre for Internet and Society (CIS) in Bangalore and the Kusuma Trust. Thanks to Johanna Niesyto (University of Siegen), Nishant Shah and Sunil Abraham (CIS Bangalore) Sabine Niederer and Margreet Riphagen (INC Amsterdam) for their valuable input and editorial support. Thanks to Foundation Democracy and Media, Mondriaan Foundation and the Public Library Amsterdam (Openbare Bibliotheek Amsterdam) for supporting the CPOV events in Bangalore, Amsterdam and Leipzig. (http://networkcultures.org/wpmu/cpov/) Special thanks to all the authors for their contributions and to Cielo Lutino, Morgan Currie and Ivy Roberts for their careful copy-editing.
    [Show full text]