Studying and Documenting Web Archives Provenance Emily Maemura, Phd Candidate Faculty of Information, University of Toronto Netlab Forum February 27, 2018 the Team

Total Page:16

File Type:pdf, Size:1020Kb

Studying and Documenting Web Archives Provenance Emily Maemura, Phd Candidate Faculty of Information, University of Toronto Netlab Forum February 27, 2018 the Team If These Crawls Could Talk: Studying and Documenting Web Archives Provenance Emily Maemura, PhD Candidate Faculty of Information, University of Toronto NetLab Forum February 27, 2018 The Team Nich Worby Christoph Becker Ian Milligan Librarian, Assistant Professor Associate Professor, Web Archivist Director, Digital Curation Institute Digital Historian Emily Maemura Doctoral Candidate brought together through the Digital Curation Institute McLuhan Centenary Fellowship in Digital Sustainability 2016-17 and supported by SSHRC Number of Pages per Domain, Canadian Political Parties Collection, 2005-2015 http://lintool.github.io/warcbase/vis/crawl-sites/ Pages from “policyalternatives.ca” Sept. 2007 - Nov. 2009 Dec. 2009 - May 2015 over 200.000 less than 50.000 pages per crawl pages per crawl How are web archives made and used? How can we document or communicate this? Creating web archives Using web archives As a web archivist... As a researcher… What do I need to document What do I need to ask about for researchers using this data to have confidence in web archives material? the analysis and findings? Researcher Perspective: “Web Archives Research Objects” Maemura, Milligan, Becker, 2016. Understanding computational web archives research methods using research objects, Computational Archival Science Workshop at IEEE Big Data 2016 http://hdl.handle.net/1807/74866 How are web archives used by researchers? Can shared conceptual frameworks of research methods used with Web Archives collections help to systematize practices, advance the field, and make it easier to introduce new researchers to the area? “Research Objects” in Computational Science Bechhofer et al., 2013, Why Linked Data is Not Enough for Scientists, Future Generation Computer Systems 29(2), February 2013, Pages 599-611 www.researchobject.org Our Approach Adopting the Research Object (RO) Framework for Research with Web Archives ● to structure the complex aggregation of computational processes, services, forms of data, contexts, and approaches ● balancing systematic approaches from computational sciences with humanistic issues of provenance and trust Developed here to characterize three cases of web archives research studies, examples completed by co-author Ian Milligan “Research Objects” as a Conceptual Framework ORGANIZATIONAL CONTEXT Ethical and governance approvals, investigators, etc. Acknowledgements QUESTIONS STUDY DESIGN ANSWERS state a problem scope, rationale for choices publications and and/or hypothesis in tools, sources, methodology presentations DATA METHODS RESULTS materials studied, the tools, workflows, scripts, materials produced those taken as processes, settings, configurations, derived datasets, inputs to processes used to perform the analysis visualizations, etc. Case: GeoCities Community Milligan, I. (2017). Welcome to the Web: The Online Community of GeoCities and the Early Years of the World Wide Web. In R. Schroeder & N. Brügger (Eds.), The Web as History. ● sole-authored research by a historian ORGANIZATIONAL CONTEXT ● no ethics approval since data is publicly QUESTIONS ANSWERS available via Wayback Machine STUDY DESIGN ● research agreement signed with Internet DATA, METHODS, & RESULTS Archive limits sharing and publication of specific derivative datasets Case: GeoCities Community Research Question: Did GeoCities users have a sense of community? ● also understanding what tools and ORGANIZATIONAL CONTEXT approaches historians need to study the web ● iterative approach, exploratory investigation QUESTIONS ANSWERS STUDY DESIGN of archived GeoCities.com data DATA, METHODS, & RESULTS ● development of Warcbase analytics platform Case: GeoCities Community DATA METHODS RESULTS Analysis Phase 1 (2013): data from Archive Team data prep with bash scripts, analysis with Mathematica torrent (wget geocities.com) html2text.py scripts, bash (on sample) ~1TB ~4TB Analysis Phase 2 (2016): analysis of raw link structures data from Internet Archive data prep and selection popular images ordered by end-of-life crawl done within warcbase MD5 hash An Initial Profile for Web Archives Research Objects Disciplinary perspectives of Interpretation of results is researchers, roles in large ORGANIZATIONAL CONTEXT an important part of teams and partnerships humanities scholarship, described in publication of QUESTIONS ANSWERS Legal agreements and findings (not simply the contracts that impact data STUDY DESIGN results of workflows) sharing and use DATA, METHODS, & RESULTS Designs vary widely by Motivations for the study discipline and conceptual and research contributions perspectives - for whom is the work Include rationale for selecting relevant? sources, methods, scope How was data sourced? Which scripts, services, Are results FAIR, published, Collection timeframe(s) software packages used - citable? How were they formats and interoperability was code published? validated? Template and Workshop at RESAW 2017, London ORGANIZATIONAL CONTEXT Discipline(s) of Funding Approvals Partnerships Agreements & Acknowledgements Research Team Contracts QUESTIONS STUDY DESIGN ANSWERS ? Scope Rationale Research questions and limits of time period, for choices of tools, Publications and Presentations motivations for the study geographic area, etc. sources, methods of findings DATA, METHODS & RESULTS Data Sources Data Preparation Data Analysis Results Generated metadata, method of collection workflows and derived datasets methods, config settings, logs published figures, data, code Template and Workshop at RESAW 2017, London ORGANIZATIONAL CONTEXT Discipline(s) of Funding Approvals Partnerships Agreements & Acknowledgements Research Team Contracts QUESTIONS STUDY DESIGN ANSWERS ? Scope Rationale Research questions and limits of time period, for choices of tools, Publications and Presentations motivations for the study geographic area, etc. sources, methods of findings DATA, METHODS & RESULTS Data Sources Data Preparation Data Analysis Results Generated metadata, method of collection workflows and derived datasets methods, config settings, logs published figures, data, code Where do these come from, how are they made? Web Archivist Perspective: “Elements of Provenance” Maemura, Worby, Milligan, Becker, 2017. Origin Stories: documentation for web archives provenance, Web Archiving Week / IIPC Web Archiving Conference How are web archives made? Which individual decisions are made in web archives practices, and how can we understand and communicate the impacts on resulting collections? Our Approach Starting with Web Archiving Life Cycle Model (Bragg et al. 2013) as a framework: ● What decisions are at each life cycle phase? (e.g. Appraisal and selection; Scoping; Data Capture; Storage and Organization; Quality Assurance and Analysis) ● Studying the process of using Archive-It to create three web archives collections by University of Toronto Libraries (UTL) http://ait.blog.archive.org/files/2014/04/archiveit_life_cycle_model.pdf Overview of the Collections Studied Canadian Political Parties Pan Am Games Global Summitry Collection October 2005 to February 2015 to June 2016 to Timeframe present (ongoing) December 2016 present (ongoing) Crawl frequency Quarterly (every 3 months) Combination of Daily, TBD, based on timing of Weekly, Monthly and summit events One-time crawls Crawl duration 3 days Varies widely by crawl (from 5 days (currently test hours to days) crawls only) # of Active Seeds 62 434 167 Total data >900 GB >100 GB >400 GB archived* >29,000,000 documents >3,500,000 documents >5,000,000 documents Crawl limits and Ignore robots.txt Ignore robots.txt Ignore robots.txt rules specified Block twitter.com URLs for “lang=?” General Workflow for UTL crawls Importance of iterations, different types of crawls: test crawls, production crawls, patch crawls Managing organizational data budget across multiple collections test crawls may reveal size of material a collection set to run 4 crawls a on web is bigger than expected, requires year may have many more due to re-working scope or renegotiating data patch crawls budget allocated in proposal Three Key Findings 1) Scoping: decisions are made throughout 2) Process: Unforeseen issues arise during a crawl - the actions taken to resolve these issues need to be documented 3) Context: individual decisions interact and are influenced by changes in organizational context and wider environment, impacting the collection over time Interdependencies and interactions between factors Is a site with robots.txt exclusions captured? Does the technical Does the legal Does organizational system allow? environment permit? policy guide action? ☑Yes, option ☑Yes, after ☑Yes, new law available in Archive-It, Copyright Law interpreted in Permissions since 2010 Amendment, 2013 Policy, 2014 Individual curatorial choice to exclude sites with robots.txt for a particular collection or crawl Elements of Scoping, Process, Context to Document Element Key Questions and Information to Document Motivation What is the purpose of the collection? Has its mandate changed over time? Focus Which geographic, temporal, technical, political, topical and/or social boundaries are defined to scope the collection? Access & Discovery Who is the intended audience? Do they have known characteristics or needs? Which contractual, organizational, legal, or other agreements restrict access? What metadata fields and indexes support discovery? At what degree of granularity (by collection, site, or individual resource)? Which data formats or derivative datasets are available? Seed list What seeds
Recommended publications
  • INST 785 Section 0101 Documentation, Collection, and INST 785 Appraisal of Records Spring 2020
    Course Syllabus – INST 785 Section 0101 Documentation, Collection, and INST 785 Appraisal of Records Spring 2020 Course Description Dr. Eric Hung he / him / his Appraisal is considered to be the archivist’s “first responsibility.” The [email protected] responsibility is “first” because appraisal comes first in the sequence of archival functions and thus influences all subsequent archival activities, and it is “first” in importance because appraisal determines what tiny sliver of Class Meetings: the total human documentary production will actually become “archives” Tuesdays, 6:00-8:45pm and thus a part of society’s history and collective memory. The archivist is HBK 0105 thereby actively shaping the future’s history of our own times. Office Hours The topic of appraisal remains one of considerable controversy in archives. ELMS Chat Office Hours: The archival literature includes debates over the definitions and indicators Mondays, 2:00-3:00pm, of long-term value, the purpose of appraisal, who intervenes in appraisal or by appointment. If you decisions, when in the information life cycle do they intervene, and which want to meet in-person, I methods work for which types of records and which types of organizations. am generally on campus The literature is replete with tensions between the theory and practice of on Tuesdays. appraisal and between questions of universalism versus specificity (by type of record, media, type of organization, time period, country, etc.). Syllabus Policy One of the problems with the literature on appraisal is that there are few This syllabus is a guide for methods for rigorously evaluating the feasibility or effectiveness of different the course and is subject appraisal methodologies.
    [Show full text]
  • Archiving 2016 Preliminary Program
    M ARCHIVING2016 A April 19-22, 2016 • Washington, DC R G www.imaging.org/archiving General Chair: Kari Smith, O MIT Libraries, Institute Archives and Special Collections R P Y R A N I M I L E R P Sponsored by the Society for Imaging Science and Technology April 19-22, 2016 • Washington, DC About the Conference The IS&T Archiving Conference brings together provides a forum to explore new strategies an international community of imaging experts and policies, and reports on successful projects and technicians as well as curators, managers, that can serve as benchmarks in the field. and researchers from libraries, archives, mu- Archiving 2016 is a blend of short courses, seums, records management repositories, in- invited focal papers, keynote talks, and formation technology institutions, and com- peer-reviewed oral and interactive display mercial enterprises to explore and discuss the presentations, offering attendees a unique field of digitization of cultural heritage and opportunity for gaining and exchanging archiving. The conference presents the latest knowledge and building networks among research results on digitization and curation, professionals. Cooperating Societies • American Institute for Conservation Foundation of the American Institute for Conservation (AIC) • ALCTS Association for Library Collections & Technical Services • Coalition for Networked Information (CNI) • Digital Library Federation at CLIR . • Digital Preservation Coalition (DPC) s e g o • IOP/Printing & Graphics Science Group V h p o t s • ISCC – Inter-Society Color Council i r h C • Museum Computer Network (MCN) : o t o h • The Royal Photographic Society P Short courses offer an intimate setting to gain more in-depth knowledge about technical aspects of digital archiving.
    [Show full text]
  • Digital Preservation Workflows for Museum Imaging Environments
    The 11th International Symposium on Virtual Reality, Archaeology and Cultural Heritage VAST (2010) A. Artusi, M. Joly, D. Pitzalis, G. Lucet, and A. Ribes (Editors) Digital preservation workflows for museum imaging environments Michael Ashley1 1 Cultural Heritage Imaging, San Francisco, California, USA From: Principles and Practices of Robust, Photography-based Digital Imaging Techniques for Museums. Contributors: Mark Mudge, Carla Schroer, Graeme Earl, Kirk Martinez, Hembo Pagi, Corey Toler-Franklin, Szymon Rusinkiewicz, Gianpaolo Palma, Melvin Wachowiak, Michael Ashley, Neffra Matthews, Tommy Noble, Matteo Dellepiane. Edited by Mark Mudge and Carla Schroer, Cultural Heritage Imaging. The 11th International Symposium on Virtual Reality, Archaeology and Cultural Heritage VAST (2010) A. Artusi, M. Joly, D. Pitzalis, G. Lucet, and A. Ribes (Editors) This is a pre-print copy of a paper that is currently under review. Please respect the intellectual property of the authors and the efforts of the publishers and reviewers until its publication. If you find any errors that need modification please contact the author at [email protected]. Meanwhile this pdf is distributed to interested readers under a Creative Commons 3.0 license. M. Ashley / Digital Preservation Workflows For Museum Imaging Environments Digital preservation workflows for museum imaging environments Michael Ashley1 1 Cultural Heritage Imaging, San Francisco, California, USA Abstract We discuss and demonstrate practical digital preservation frameworks that protect images throughout the entire production life-cycle. Using off the shelf and open source software coupled with a basic understanding of metadata, it is possible to produce and manage high value digital representations of physical objects that are born archive- ready and long-term sustainable.
    [Show full text]
  • 10 Small Scale Academic Web Archiving: DACHS
    10 Small Scale Academic Web Archiving: DACHS Hanno E. Lecher Leiden University [email protected] 10.1 Why Small Scale Academic Archiving? Considering the complexities of Web archiving and the demands on hard- and software as well as on expertise and personnel, one wonders whether such projects are only feasible for large scale institutions such as national libraries, or whether smaller institutions such as museums, university depart- ments and the like would also be able to perform the tasks required for a Web archive with long-term perspective. Even if the answer to this is yes, the question remains whether this is necessary at all. One could think that the Internet Archive in combination with the efforts of the increasing number of national libraries is already covering many, if not most relevant Web resources. Does academic or other small scale Web archiving make sense at all? Let me begin with this second question. The Internet Archive has done groundbreaking work as the first initiative attempting comprehensive archiv- ing of Web resources. Its success in accomplishing this has been revolution- ary, and it has laid the foundation on which many other projects have built their work. Still, examining what the Internet Archive and other holistic pro- jects1 can achieve it is easy to discover some limitations. Since their focus of collection is very broad, they have to rely on robots for a large part of their collecting activities, automatically grabbing as many Web pages as possible. This kind of capturing is often very superficial, missing parts located further down the tree, many pages being downloaded incompletely, and some file types as well as the hidden Web being ignored altogether.
    [Show full text]
  • Introducticn Tc Ccnservoticn
    introducticntc ccnservoticn UNITED NATTONSEDUCATIONA],, SCIEIilIIFTC AND CULTIJRALOROANIZATTOII AN INIRODUCTION TO CONSERYATIOI{ OF CULTURAT PROPMTY by Berr:ar"d M. Feilden Director of the Internatlonal Centre for the Preservatlon and Restoratlon of Cultural Property, Rome Aprll, L979 (cc-ig/ws/ttt+) - CONTENTS Page Preface 2 Acknowledgements Introduction 3 Chapter* I Introductory Concepts 6 Chapter II Cultural Property - Agents of Deterioration and Loss . 11 Chapter III The Principles of Conservation 21 Chapter IV The Conservation of Movable Property - Museums and Conservation . 29 Chapter V The Conservation of Historic Buildings and Urban Conservation 36 Conclusions ............... kk Appendix 1 Component Materials of Cultural Property . kj Appendix 2 Access of Water 53 Appendix 3 Intergovernmental and Non-Governmental International Agencies for Conservation 55 Appendix k The Conservator/Restorer: A Definition of the Profession .................. 6? Glossary 71 Selected Bibliography , 71*. AUTHOR'S PREFACE Some may say that the attempt to Introduce the whole subject of Conservation of Cultural Propety Is too ambitious, but actually someone has to undertake this task and it fell to my lot as Director of the International Centre for the Study of the Preservation and Restoration of Cxiltural Property (ICCROM). An introduction to conservation such as this has difficulties in striking the right balance between all the disciplines involved. The writer is an architect and, therefore, a generalist having contact with both the arts and sciences. In such a rapidly developing field as conservation no written statement can be regarded as definite. This booklet should only be taken as a basis for further discussions. ACKNOWLEDGEMENTS In writing anything with such a wide scope as this booklet, any author needs help and constructive comments.
    [Show full text]
  • Selection in Web Archives: the Value of Archival Best Practices
    WITTENBERG: SELECTION IN WEB ARCHIVES Selection in Web Archives: The Value of Archival Best Practices Jamie Wittenberg, University of Illinois at Urbana-Champaign, United States of America Abstract: The abundance of valuable material available online has mobilized the development of preservation initiatives at collecting institutions that aim to capture and contextualize web content. Web archiving selection criteria are driven by the limitations inherent in harvesting technologies. Observing core archival principles like provenance and original order when establishing collection development policies for web content will help to ensure that archives continue to assure the authenticity of the materials they steward. Keywords: Web Archives; Archival Theory; Digital Libraries; Internet Content; Selection and Appraisal Introduction The abundance of valuable material available online has mobilized the development of preservation initiatives at collecting institutions that aim to capture and contextualize web content. Methodologies for web collection practices are institution and collection-specific. Among institutions charged with preserving cultural heritage, web archiving has become commonplace. However, the disparity between institutional selection and appraisal criteria reveals the absence of standardization for web archive establishment. The Australian web archive, for example, accessions content that it evaluates as having long-term research value. The Library of Congress web archive, represented by its Minerva team, established a collection
    [Show full text]
  • 2016 Technical Guidelines for Digitizing Cultural Heritage Materials
    September 2016 Technical Guidelines for Digitizing Cultural Heritage Materials Creation of Raster Image Files i Document Information Title Editor Technical Guidelines for Digitizing Cultural Heritage Materials: Thomas Rieger Creation of Raster Image Files Document Type Technical Guidelines Publication Date September 2016 Source Documents Title Editors Technical Guidelines for Digitizing Cultural Heritage Materials: Don Williams and Michael Creation of Raster Image Master Files Stelmach http://www.digitizationguidelines.gov/guidelines/FADGI_Still_Image- Tech_Guidelines_2010-08-24.pdf Document Type Technical Guidelines Publication Date August 2010 Title Author s Technical Guidelines for Digitizing Archival Records for Electronic Steven Puglia, Jeffrey Reed, and Access: Creation of Production Master Files – Raster Images Erin Rhodes http://www.archives.gov/preservation/technical/guidelines.pdf U.S. National Archives and Records Administration Document Type Technical Guidelines Publication Date June 2004 This work is available for worldwide use and reuse under CC0 1.0 Universal. ii Table of Contents INTRODUCTION ........................................................................................................................................... 7 SCOPE .......................................................................................................................................................... 7 THE FADGI STAR SYSTEM .......................................................................................................................
    [Show full text]
  • Art Deaccessioning Policy
    Deaccessioning Policy Document No. CUR 4.0 – Deaccessioning Policy Page 1 Approved: December 2020 Approved by: National Gallery of Australia Council next review due: December 2022 Summary Name of Policy Description of Policy Deaccessioning Policy Policy applies to ☒ NGA wide ☐ Specific (eg. Department) Policy Status ☐ New policy ☒ Revision of Existing Policy (previously Art Acquisition Policy) Approval Authority Director Responsible Officer Assistant Director, Artistic Programs Contact area Artistic Programs Date of Policy Review* October 2022 Related Policies, Procedures, National Gallery Act 1975 Guidelines and Local Protocols Public Governance, Performance and Accountability Act 2013 Council Instructions including Financial Delegations Aboriginal and Torres Strait Islander Cultural Rights and Engagement Policy Due Diligence and Provenance Policy Acquisitions Policy Research Library Collection Development Policy Research Archive Acquisition Policy The Copyright Act 1968 The Privacy Act 1988 Privacy Policy Australian Best Practice Guide to Collecting Cultural Material 2015 Collections Law: Legal issues for Australian Archives, Galleries, Libraries and Museums *Unless otherwise indicated, this policy will still apply beyond the review date. Approvals Position Name Endorsed Date Assistant Director Natasha Bullock Yes Director Nick Mitzevich Yes Council Ryan Stokes Yes Document No. CUR 4.0 – Deaccessioning Policy Page 2 Approved: December 2020 Approved by: National Gallery of Australia Council next review due: December 2022 Table of contents
    [Show full text]
  • Yale University Library Preservation Department
    Yale University Library Preservation Department 45th Annual Report July 2015-June 2016 Roberta Pilette January 17, 2016 Preservation Department Annual Rpt FY2016 1 Yale University Library Preservation Department 45th Annual Report July 2015-June 2016 Roberta Pilette, Director of Preservation and Chief Preservation Officer Murray Harrison, Senior Administrative Assistant Preservation Staffing: July 1, 2015 June 30, 2016 Positions budgeted: C&T 11.00 11.00 M&P 10.47 11.00 Positions filled: C&T 9.00 11.00 M&P 10.47 11.00 OVERVIEW OF THE DEPARTMENT The Yale University Library Preservation Department is responsible for the long-term preservation of all library materials. The Department consists of four units—Preservation Services; Digital Reformatting & Microfilm Services (DRMS); Conservation & Exhibition Services (CES) including Collections Conservation & Housing (CCH), Special Collections Conservation (SCC) and Exhibit Production Support (EPS); and Digital Preservation Services. The Department organizational chart can be found in Appendix I, the annual statistics for the Department can be found in Appendix II. 344 Winchester moving & more construction The construction of that portion of the department associated with the Beinecke Rare Book & Manuscript (BRBL) Technical Services construction was completed during the first quarter of FY16. The move for Preservation Administration, Preservation Services, and Digital Preservation Services took place in August 2015. Those moves went smoothly and the units settled into the new spaces. Digital Preservation Services moved all of their equipment into their new spaces and spent the year making significant use of the enlarged work areas. Digital Preservation and the Born Digital Working group collaborated on the specifications for the new di Bonaventura Digital Archeology and Preservation laboratory.
    [Show full text]
  • 2016 Program
    Culture Builds Communities Preserving the Past, Shaping the Future International Conference of Indigenous Archives, Libraries, and Museums October 9–12, 2016 ▪ Phoenix, Arizona Presented by the Association of Tribal Archives, Libraries, and Museums with funding from the Institute of Museum and Library Services SCHOOL FOR ADVANCED RESEARCH ANNE RAY INTERNSHIPS Interested in working with Native American collections? The Indian Arts Research Center (IARC) at the School for Advanced Research (SAR) in Santa Fe, NM, offers two nine-month paid internships to college graduates or junior museum professionals. Internships include a salary, housing, and book and travel allowances. Interns participate in the daily collections and programming activities and also benefit from the mentorship of the Anne Ray scholar. Deadline to apply March 1 internships.sarweb.org ANNE RAY FELLOWSHIP FOR SCHOLARS Are you a Native American scholar with a master’s or PhD in the arts, humanities, or social sciences who has an interest in mentorship? Apply for a nine-month Anne Ray Fellowship at SAR. The Anne Ray scholar works independently on their own writing or curatorial research projects, while also providing mentorship to the Anne Ray interns working at the IARC. The fellow receives a stipend, housing, and office space. Deadline to apply November 1 annerayscholar.sarweb.org For more information about SAR, please visit www.sarweb.org INNOVATIVE SOCIAL SCIENCE AND NATIVE AMERICAN ART Welcome to the International Conference of Indigenous Archives, Libraries, and Museums Wild Horse Pass Resort & Spa, Phoenix, AZ October 9-12, 2016 About the Program Cover Table of Contents Harry Fonseca was among a critical generation of twentieth- Special Thanks, Page 2 century Native artists who, inspired by tradition, created work that transcended expectations.
    [Show full text]
  • Appraisal Talk in Web Archives Ed Summers
    Appraisal Talk in Web Archives ed summers ABSTRACT The Web is a vast and constantly changing information landscape that by its very nature seems to resist the idea of the archive. But for the last 20 years, archivists and technologists have worked together to build systems for doing just that. While technical infrastructures for performing web archiving have been well studied, surprisingly little is known about the interactions between archi- vists and these infrastructures. How do archivists decide what to archive from the Web? How do the tools for archiving the Web shape these decisions? This study analyzes a series of ethnographic interviews with web archivists to understand how their decisions about what to archive function as part of a community of practice. It uses critical discourse analysis to examine how the participants’ use of language enacts their appraisal decision-making processes. Findings suggest that the politics and positionality of the archive are reflected in the ways that archivists talk about their network of personal and organizational relationships. Appraisal decisions are expressive of the structural relationships of an archives as well as of the archivists’ identities, which form during mentoring relationships. Self-reflection acts as a key method for seeing the ways that interviewers and interviewees work together to construct the figured worlds of the web archive. These factors have implications for the ways archivists communicate with each other and interact with the communities that they document. The results help ground the encounter between archival practice and the architecture of the Web. Archivaria The Journal of the Association of Canadian Archivists Appraisal Talk in Web Archives 71 RÉSUMÉ Le Web est un paysage informationnel vaste et en changement constant qui, par sa nature même, semble s’opposer à l’idée de l’archive.
    [Show full text]
  • Web Archiving: Past Present and Future of Evolving Multimedia Legacy
    IARJSET ISSN (Online) 2393-8021 ISSN (Print) 2394-1588 International Advanced Research Journal in Science, Engineering and Technology Vol. 3, Issue 3, March 2016 Web Archiving: Past Present and Future of Evolving Multimedia Legacy Meenakshi Srivastava1, Dr. S.K. Singh2, Dr. S.Q. Abbas3 Assistant Professor, Amity Institute of Information Technology, Amity University, Lucknow, India1 Professor, Amity Institute of Information Technology, Amity University, Lucknow, India2 Professor, Ambalika Institute of Information technology, Computer Science Department, Lucknow, India3 Abstract: A continuous up gradation of tools and methods is going on for archiving the web in a better way. Although, most accepted practices and common model for web archiving is yet to evolve. Keywords: Web archiving, Harvesting, Multimedia, Technological tool. I. INTRODUCTION II. WORLD WIDE WEB (WWW) Internet has evolved itself at an exponential pace and so Internet has grown enormously since 1990 and hence has evolved its role in our life and society. It’s not an usage of WWW has also increased. An increase in usage exaggeration to mention that evolution of internet has of www has resulted in rise of information available online made several technologies obsolete much before than they and hence the quantity of contents also pertaining to it. were expected to be. Evolution is a part and parcel of life. Increase in usage of internet exposed its weakness also and Any technological tool that does not evolve, improve or biggest weakness which was realized very soon was the upgrade itself is static and would perish on its own. volatility of the contents available on internet [1]. Web has Realizing the fact that evolution is bound to happen, huge data and information which is used by people of methodology of recording the historical data should be varied backgrounds who are professional and designed.
    [Show full text]