Web Archiving
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Module 8 Wiki Guide
Best Practices for Biomedical Research Data Management Harvard Medical School, The Francis A. Countway Library of Medicine Module 8 Wiki Guide Learning Objectives and Outcomes: 1. Emphasize characteristics of long-term data curation and preservation that build on and extend active data management ● It is the purview of permanent archiving and preservation to take over stewardship and ensure that the data do not become technologically obsolete and no longer permanently accessible. ○ The selection of a repository to ensure that certain technical processes are performed routinely and reliably to maintain data integrity ○ Determining the costs and steps necessary to address preservation issues such as technological obsolescence inhibiting data access ○ Consistent, citable access to data and associated contextual records ○ Ensuring that protected data stays protected through repository-governed access control ● Data Management ○ Refers to the handling, manipulation, and retention of data generated within the context of the scientific process ○ Use of this term has become more common as funding agencies require researchers to develop and implement structured plans as part of grant-funded project activities ● Digital Stewardship ○ Contributions to the longevity and usefulness of digital content by its caretakers that may occur within, but often outside of, a formal digital preservation program ○ Encompasses all activities related to the care and management of digital objects over time, and addresses all phases of the digital object lifecycle 2. Distinguish between preservation and curation ● Digital Curation ○ The combination of data curation and digital preservation ○ There tends to be a relatively strong orientation toward authenticity, trustworthiness, and long-term preservation ○ Maintaining and adding value to a trusted body of digital information for future and current use. -
View Presentation
a centre of expertise in data curation and preservation The Digital Curation Centre Hugh Corley DCC Services and Outreach Liaison Officer Funders: 1 University College London Information Day :: 16th January 2007 a centre of expertise in data curation and preservation What is an Information Day • Learn • Discuss • Action Funders: 2 University College London Information Day :: 16th January 2007 a centre of expertise in data curation and preservation Overview • Digital Curation • Aims of the DCC • Work of the DCC Funders: 3 University College London Information Day :: 16th January 2007 a centre of expertise in data curation and preservation Digital Curation • Active management of data over life- cycle of scholarly and scientific interest – reproducibility – reuse • Appreciation of differences between disciplines • Ubiquitous relationship with digital information Funders: 4 University College London Information Day :: 16th January 2007 a centre of expertise in data curation and preservation UK Digital Curation Centre • CCLRC • University of Edinburgh • HATII • UKOLN Funders: 5 University College London Information Day :: 16th January 2007 a centre of expertise in data curation and preservation UK Digital Curation Centre Support and promote continuing improvement in the quality of data curation and digital preservation activity Drivers – increasing awareness that digital assets are reusable – continuing access vital to ensure contemporary scholarship is reproducible and verifiable – digital assets are fragile Funders: 6 University College -
INST 785 Section 0101 Documentation, Collection, and INST 785 Appraisal of Records Spring 2020
Course Syllabus – INST 785 Section 0101 Documentation, Collection, and INST 785 Appraisal of Records Spring 2020 Course Description Dr. Eric Hung he / him / his Appraisal is considered to be the archivist’s “first responsibility.” The [email protected] responsibility is “first” because appraisal comes first in the sequence of archival functions and thus influences all subsequent archival activities, and it is “first” in importance because appraisal determines what tiny sliver of Class Meetings: the total human documentary production will actually become “archives” Tuesdays, 6:00-8:45pm and thus a part of society’s history and collective memory. The archivist is HBK 0105 thereby actively shaping the future’s history of our own times. Office Hours The topic of appraisal remains one of considerable controversy in archives. ELMS Chat Office Hours: The archival literature includes debates over the definitions and indicators Mondays, 2:00-3:00pm, of long-term value, the purpose of appraisal, who intervenes in appraisal or by appointment. If you decisions, when in the information life cycle do they intervene, and which want to meet in-person, I methods work for which types of records and which types of organizations. am generally on campus The literature is replete with tensions between the theory and practice of on Tuesdays. appraisal and between questions of universalism versus specificity (by type of record, media, type of organization, time period, country, etc.). Syllabus Policy One of the problems with the literature on appraisal is that there are few This syllabus is a guide for methods for rigorously evaluating the feasibility or effectiveness of different the course and is subject appraisal methodologies. -
Archiving 2016 Preliminary Program
M ARCHIVING2016 A April 19-22, 2016 • Washington, DC R G www.imaging.org/archiving General Chair: Kari Smith, O MIT Libraries, Institute Archives and Special Collections R P Y R A N I M I L E R P Sponsored by the Society for Imaging Science and Technology April 19-22, 2016 • Washington, DC About the Conference The IS&T Archiving Conference brings together provides a forum to explore new strategies an international community of imaging experts and policies, and reports on successful projects and technicians as well as curators, managers, that can serve as benchmarks in the field. and researchers from libraries, archives, mu- Archiving 2016 is a blend of short courses, seums, records management repositories, in- invited focal papers, keynote talks, and formation technology institutions, and com- peer-reviewed oral and interactive display mercial enterprises to explore and discuss the presentations, offering attendees a unique field of digitization of cultural heritage and opportunity for gaining and exchanging archiving. The conference presents the latest knowledge and building networks among research results on digitization and curation, professionals. Cooperating Societies • American Institute for Conservation Foundation of the American Institute for Conservation (AIC) • ALCTS Association for Library Collections & Technical Services • Coalition for Networked Information (CNI) • Digital Library Federation at CLIR . • Digital Preservation Coalition (DPC) s e g o • IOP/Printing & Graphics Science Group V h p o t s • ISCC – Inter-Society Color Council i r h C • Museum Computer Network (MCN) : o t o h • The Royal Photographic Society P Short courses offer an intimate setting to gain more in-depth knowledge about technical aspects of digital archiving. -
Front Page Title of Paper in Full: Centered
Preservation Is Not a Place 1 4th International Digital Curation Conference December 2008 Preservation Is Not a Place Stephen Abrams, Patricia Cruse, John Kunze California Digital Library, University of California December 2008 Abstract The Digital Preservation Program of the California Digital Library (CDL) is engaged in a process of reinvention involving significant transformations of its outlook, effort, and infrastructure. This includes a re-articulation of its mission in terms of digital curation, rather than preservation; encouraging a programmatic, rather than a project-oriented approach to curation activities; and a renewed emphasis on services, rather than systems. This last shift was motivated by a desire to deprecate the centrality of the repository as place. Having the repository as the locus for curation activity has resulted in the deployment of a somewhat cumbersome monolithic system that falls short of desired goals for responsiveness to rapidly changing user needs and operational and administrative sustainability. The Program is pursuing a path towards a new curation environment based on the principle of devolving curation function to a set of small, simple, loosely-coupled services. In considering this new infrastructure the Program is relying upon a highly deliberative process starting from first principles drawn from library and archival science. This is followed by a stagewise progressesion of identifying core preservable values, devising strategies promoting those values, defining abstract services embodying those strategies, and, finally, developing systems that instantiate those services. This paper presents a snapshot of the Program’s transformative efforts in its early phase. The 4th International Digital Curation Conference taking place in Edinburgh, Scotland over 1-3 December 2008 will address the theme Radical Sharing: Transforming Science?. -
10 Small Scale Academic Web Archiving: DACHS
10 Small Scale Academic Web Archiving: DACHS Hanno E. Lecher Leiden University [email protected] 10.1 Why Small Scale Academic Archiving? Considering the complexities of Web archiving and the demands on hard- and software as well as on expertise and personnel, one wonders whether such projects are only feasible for large scale institutions such as national libraries, or whether smaller institutions such as museums, university depart- ments and the like would also be able to perform the tasks required for a Web archive with long-term perspective. Even if the answer to this is yes, the question remains whether this is necessary at all. One could think that the Internet Archive in combination with the efforts of the increasing number of national libraries is already covering many, if not most relevant Web resources. Does academic or other small scale Web archiving make sense at all? Let me begin with this second question. The Internet Archive has done groundbreaking work as the first initiative attempting comprehensive archiv- ing of Web resources. Its success in accomplishing this has been revolution- ary, and it has laid the foundation on which many other projects have built their work. Still, examining what the Internet Archive and other holistic pro- jects1 can achieve it is easy to discover some limitations. Since their focus of collection is very broad, they have to rely on robots for a large part of their collecting activities, automatically grabbing as many Web pages as possible. This kind of capturing is often very superficial, missing parts located further down the tree, many pages being downloaded incompletely, and some file types as well as the hidden Web being ignored altogether. -
Modeling Popularity and Reliability of Sources in Multilingual Wikipedia
information Article Modeling Popularity and Reliability of Sources in Multilingual Wikipedia Włodzimierz Lewoniewski * , Krzysztof W˛ecel and Witold Abramowicz Department of Information Systems, Pozna´nUniversity of Economics and Business, 61-875 Pozna´n,Poland; [email protected] (K.W.); [email protected] (W.A.) * Correspondence: [email protected] Received: 31 March 2020; Accepted: 7 May 2020; Published: 13 May 2020 Abstract: One of the most important factors impacting quality of content in Wikipedia is presence of reliable sources. By following references, readers can verify facts or find more details about described topic. A Wikipedia article can be edited independently in any of over 300 languages, even by anonymous users, therefore information about the same topic may be inconsistent. This also applies to use of references in different language versions of a particular article, so the same statement can have different sources. In this paper we analyzed over 40 million articles from the 55 most developed language versions of Wikipedia to extract information about over 200 million references and find the most popular and reliable sources. We presented 10 models for the assessment of the popularity and reliability of the sources based on analysis of meta information about the references in Wikipedia articles, page views and authors of the articles. Using DBpedia and Wikidata we automatically identified the alignment of the sources to a specific domain. Additionally, we analyzed the changes of popularity and reliability in time and identified growth leaders in each of the considered months. The results can be used for quality improvements of the content in different languages versions of Wikipedia. -
Arhiviranje Weba
Arhiviranje weba Lučić, Lucija Master's thesis / Diplomski rad 2020 Degree Grantor / Ustanova koja je dodijelila akademski / stručni stupanj: University of Zagreb, University of Zagreb, Faculty of Humanities and Social Sciences / Sveučilište u Zagrebu, Filozofski fakultet Permanent link / Trajna poveznica: https://urn.nsk.hr/urn:nbn:hr:131:448437 Rights / Prava: In copyright Download date / Datum preuzimanja: 2021-09-25 Repository / Repozitorij: ODRAZ - open repository of the University of Zagreb Faculty of Humanities and Social Sciences SVEUČILIŠTE U ZAGREBU FILOZOFSKI FAKULTET ODSJEK ZA INFORMACIJSKE I KOMUNIKACIJSKE ZNANOSTI SMJER ARHIVISTIKA Ak. god. 2019./2020. Lucija Lučić Arhiviranje weba Diplomski rad Mentor: prof. dr. sc. Hrvoje Stančić Zagreb, 2020. Izjava o akademskoj čestitosti Izjavljujem i svojim potpisom potvrđujem da je ovaj rad rezultat mog vlastitog rada koji se temelji na istraživanjima te objavljenoj i citiranoj literaturi. Izjavljujem da nijedan dio rada nije napisan na nedozvoljen način, odnosno da je prepisan iz necitiranog rada, te da nijedan dio rada ne krši bilo čija autorska prava. Također izjavljujem da nijedan dio rada nije korišten za bilo koji drugi rad u bilo kojoj drugoj visokoškolskoj, znanstvenoj ili obrazovnoj ustanovi. Sadržaj 1. Uvod ....................................................................................................................................... 3 2. Što je arhiviranje weba? ..................................................................................................... -
Cultural Anthropology and the Infrastructure of Publishing
CULTURAL ANTHROPOLOGY AND THE INFRASTRUCTURE OF PUBLISHING TIMOTHY W. ELFENBEIN Society for Cultural Anthropology This essay is based on an interview conducted by Matt Thompson, portions of which originally appeared on the blog Savage Minds the week after Cultural Anthropology relaunched as an open-access journal.1 Herein, I elaborate on my original comments to provide a nuts-and-bolts account of what has gone into the journal’s transition. I am particularly concerned to lay bare the publishing side of this endeavor—of what it means for a scholarly society to take on publishing responsibilities—since the challenges of pub- lishing, and the labor and infrastructure involved, are invisible to most authors and readers. Matt Thompson: When did the Society for Cultural Anthropology (SCA) de- cide to go the open-access route and what was motivating them? Tim Elfenbein: Cultural Anthropology was one of the first American Anthropo- logical Association (AAA) journals to start its own website. Kim and Mike Fortun were responsible for the initial site. They wanted to know what extra materials they could put on a website that would supplement the journal’s articles. That experience helped germinate the idea that there was more that the journal could do itself. The Fortuns are also heavily involved in science and technology studies 2 (STS), where discussions about open access have been occurring for a long time. When Anne Allison and Charlie Piot took over as editors of the journal in 2011, they continued to push for an open-access alternative in our publishing program. Although the SCA and others had been making the argument for transitioning at CULTURAL ANTHROPOLOGY, Vol. -
Appraisal Talk in Web Archives Ed Summers
Appraisal Talk in Web Archives ed summers ABSTRACT The Web is a vast and constantly changing information landscape that by its very nature seems to resist the idea of the archive. But for the last 20 years, archivists and technologists have worked together to build systems for doing just that. While technical infrastructures for performing web archiving have been well studied, surprisingly little is known about the interactions between archi- vists and these infrastructures. How do archivists decide what to archive from the Web? How do the tools for archiving the Web shape these decisions? This study analyzes a series of ethnographic interviews with web archivists to understand how their decisions about what to archive function as part of a community of practice. It uses critical discourse analysis to examine how the participants’ use of language enacts their appraisal decision-making processes. Findings suggest that the politics and positionality of the archive are reflected in the ways that archivists talk about their network of personal and organizational relationships. Appraisal decisions are expressive of the structural relationships of an archives as well as of the archivists’ identities, which form during mentoring relationships. Self-reflection acts as a key method for seeing the ways that interviewers and interviewees work together to construct the figured worlds of the web archive. These factors have implications for the ways archivists communicate with each other and interact with the communities that they document. The results help ground the encounter between archival practice and the architecture of the Web. Archivaria The Journal of the Association of Canadian Archivists Appraisal Talk in Web Archives 71 RÉSUMÉ Le Web est un paysage informationnel vaste et en changement constant qui, par sa nature même, semble s’opposer à l’idée de l’archive. -
SCONUL Focus Number 38 Summer/Autumn 2006
SCONUL Focus Number 38 Summer/Autumn 2006 Contents ISSN 1745-5782 (print) ISSN 1745-5790 (online) 3 The 3Ss 4 The Learning Grid at the University of Warwick: a library innovation to support learning in higher education Rachel Edwards 8 The Learning Gateway: opening the doors to a new generation of learners at St Martin’s College, Carlisle campus Margaret Weaver 11 Middlesex University: the impressive rejuvenation of Hendon campus Paul Beaty-Pownall 14 Poor design equals poor health questionnaire: the final results Jim Jackson 20 Human resourcing in academic libraries: the ‘lady librarian’, the call for flexible staff and the need to be counted A. D. B. MacLean, N. C. Joint 26 Taking steps that make you feel dizzy: personal reflections on module 1 of the Future Leaders programme John Cox, Annie Kilner, Dilys Young 30 Evolution: the Oxford trainee scheme Gill Powell, Katie Robertson 34 A week in the life Kim McGowan 36 Got the knowledge? Focusing on the student: Manchester Metropolitan University’s (MMU) library welcome campaign David Matthews, Emily Shields, Rosie Jones, Karen Peters 41 Ask the audience: e-voting at the University of Leeds Lisa Foggo, Susan Mottram, Sarah Taylor 44 Information literacy, the link between second and tertiary education: project origins and current developments Christine Irving 47 Review of how libraries are currently supporting the research process Ruth Stubbings, Joyce Bartlett, Sharon Reid 51 Researchers, information and libraries: the CONUL national research support survey John Cox 55 Creating a new Social Science Library at Oxford University based on reader consultation Louise Clarke 58 The use of personal scanners and digital cameras within OULS reading rooms Steve Rose, Gillian Evison 60 Copyright, digital resources and IPR at Brunel University Monique Ritchie 64 Secure electronic delivery: ‘get the world’s knowledge with less waiting’ Alison E. -
File Format Guidelines for Management and Long-Term Retention of Electronic Records
FILE FORMAT GUIDELINES FOR MANAGEMENT AND LONG-TERM RETENTION OF ELECTRONIC RECORDS 9/10/2012 State Archives of North Carolina File Format Guidelines for Management and Long-Term Retention of Electronic records Table of Contents 1. GUIDELINES AND RECOMMENDATIONS .................................................................................. 3 2. DESCRIPTION OF FORMATS RECOMMENDED FOR LONG-TERM RETENTION ......................... 7 2.1 Word Processing Documents ...................................................................................................................... 7 2.1.1 PDF/A-1a (.pdf) (ISO 19005-1 compliant PDF/A) ........................................................................ 7 2.1.2 OpenDocument Text (.odt) ................................................................................................................... 3 2.1.3 Special Note on Google Docs™ .......................................................................................................... 4 2.2 Plain Text Documents ................................................................................................................................... 5 2.2.1 Plain Text (.txt) US-ASCII or UTF-8 encoding ................................................................................... 6 2.2.2 Comma-separated file (.csv) US-ASCII or UTF-8 encoding ........................................................... 7 2.2.3 Tab-delimited file (.txt) US-ASCII or UTF-8 encoding .................................................................... 8 2.3