Web Archiving Programs in the U.S

WEB LOGOS, 1/28/2009 LOGO COLORS: dark green: 666633 NOTE: All type is same color; map and dotted line are same light green: CCCC99 color (differing tints of green are allowed; use orange map orange: CC6600 and white dots for Combined logo on dark background) black LIGHT BACKGROUND Center for GLOBAL Center for GLOBAL Side by Side: 75 px high; Research RESOURCES Research RESOURCES also available at 130 px high Libraries NETWORK Libraries NETWORK Combined: Center for GLOBAL 75 px high; also available Research RESOURCES at 130 px high Libraries NETWORK Center for Research Libraries Centerrfo Research Libraries banner: 60 px high GLOBAL RESOURCES NETWORK GLOBAL RESOURCES NETWORK An Evaluation of Web Archiving Programs in the U.S. Relevant to DARK BACKGROUND International and Area StudiesThe Center for GLOBAL Side by Side: THE EXAMPLE OF LATIN AMERICA Research RESOURCES 75 px high; AND THE CARIBBEAN also available Libraries NETWORK at 130 px high By Jeffrey Garrett The Center for GLOBAL Combined: 130 px high; Research RESOURCES also available at 75 px high Libraries NETWORK Centerrfo Research Libraries banner: 60 px high GLOBAL RESOURCES NETWORK SHALLOW BACKGROUNDS Center for GLOBAL Center for GLOBAL Side by Side: RESOURCES RESOURCES 75 px high; Research Research also available Libraries NETWORK Libraries NETWORK at 130 px high The The Combined: Center for GLOBAL Center for GLOBAL 130 px high; also available at 75 px high Research RESOURCES Research RESOURCES Libraries NETWORK Libraries NETWORK Centerrfo Research Libraries Centerrfo Research Libraries banner: 60 px high GLOBAL RESOURCES NETWORK GLOBAL RESOURCES NETWORK Web Archiving Programs: Latin America and the Caribbean, page 1 February 2019 Contents Contents ....................................................................................................................................................... 1 Executive Summary .................................................................................................................................... 3 An Evaluation of Web Archiving Programs in the US Relevant to International and Area Studies: The Example of Latin America and the Caribbean ................................................................................. 5 I. Introduction: Latin America Takes to the Web ............................................................................... 5 II. Conducting Research in the Live and the Past Web of Latin America: A Hypothetical Example .............................................................................................................................................. 11 III. Archiving the Latin American & Caribbean Web ......................................................................... 15 A. Web Archiving at the Library of Congress ................................................................................ 15 B. Web Archiving at Columbia University Libraries: The Human Rights Web Archive............. 23 C. Web Archiving at the University of Texas at Austin: LAGDA and HRDI ................................. 28 IV. An Independent Use Analysis ......................................................................................................... 34 V. Conclusions and Recommendations: Accelerating the Integration and Mainstreaming of Web Archiving ................................................................................................................................... 38 A. Metadata ........................................................................................................................................ 40 B. Finding Aids ................................................................................................................................... 40 C. Citation Standards & Tools ......................................................................................................... 41 D. Self-archiving................................................................................................................................. 43 E. Credibility Enhancement .............................................................................................................. 44 F. Education, Outreach, Promotion ................................................................................................ 46 G. Data Mining & Whole-Collection Analysis ................................................................................ 46 H. Promoting Collaboration ............................................................................................................. 49 VI. Cited Literature ................................................................................................................................. 50 Web Archiving Programs: Latin America and the Caribbean, page 2 February 2019 Web Archiving Programs: Latin America and the Caribbean, page 3 February 2019 Executive Summary In 2016 The Andrew W. Mellon Foundation awarded CRL funding to develop an “integrated, self-sustaining, international cooperative framework to support area and international studies (AIS).” The chief goal of the Global Collections Initiative is to expand electronic access to primary source documentation and data from major world regions, where the information landscape differs from that in the U.S. and Western Europe. The initial phase of the project focused on one region: Latin America and the Caribbean. A major focus of the initiative has been access to materials existing only in digital form. This report evaluates efforts in the U.S. to archive open web content from the Caribbean and Latin America for future use by researchers. The web is booming throughout this world region and has become a fertile resource for research and publication across the disciplinary spectrum. At the same time, the ephemerality of web content, the result of deletion, migration, alteration, or adulteration, collectively known as “reference rot,” has led to a crisis in scholarly communication. Conducting, sharing, and reading web-based research is “like trying to stand on quicksand.” (Lepore 2015) While largely resolved for journals, this crisis “has so far not been adequately addressed for . web-at-large resources.” (Klein et al. 2014) Part I of this report introduces both systemic and region-specific issues of web use (and abuse) which are at the root of the problem; while Part II, using the example of a hypothetical scholar researching the Landless Workers Movement in Brazil, shows how these issues impact actual research in ways unimaginable in the pre-web era. In Part III, three prominent area studies–relevant archival programs are examined: the Library of Congress Web Archiving Program (LCWA); Columbia University’s Human Rights Web Archive (HRWA); and then two programs at the University of Texas at Austin, the Latin American Government Documents Archive (LAGDA) and the Human Rights Documentation Initiative (HRDI). For each, we describe: history; governance; scoping and selection; metadata and search; in-house use analysis; and self-assessed challenges and future hopes. In Part IV, we consider what external evidence exists that these archives are actually being used. We discover that “views” do not necessarily translate into scholarly citations, but that methodological problems make any comprehensive use analysis—recognized by the National Digital Stewardship Alliance as a high priority—difficult at this time. (Bailey et al. 2017, 27) Above all, lack of consensus on how and even whether to cite from web archives compromises any analysis of their research relevance. Web Archiving Programs: Latin America and the Caribbean, page 4 February 2019 Finally, in Part V, this report describes opportunities for moving ahead. These include efforts to standardize metadata across the library/archives divide; to develop better finding aids and expose them to web crawlers; to improve citation standards for web content—especially in popular style manuals and bibliographic software used by students, scholars, and publishers; to introduce certification standards for web archives, thereby enhancing their credibility among skeptical scholars; to push education, outreach, and exchange outside the bubble of web archivists both on campus and beyond, for example at discipline-specific professional meetings; to promote data mining and whole-collection analysis; and finally to advance inter- institutional—and international—collaboration, leveraging the strengths of multiple partners. Web Archiving Programs: Latin America and the Caribbean, page 5 February 2019 An Evaluation of Web Archiving Programs in the US Relevant to International and Area Studies: The Example of Latin America and the Caribbean I. Introduction: Latin America Takes to the Web After a slow start in the 1990s and early 2000s, internet use in Latin America and the Caribbean exploded a little over ten years ago, now exceeding that of the United States and Canada.1 As with so much having to do with very large numbers, a graphic can make this rapid growth dramatically clearer. The image on the left shows city-to-city internet connections between North America, Europe, Africa, and South America in 2007. It is immediately apparent that the bulk of international internet traffic was taking place between North America and Western Europe. The graphic on the right, created by the same researcher using the same mapping algorithm, reflects the density of internet connections just Figure 1: Worldwide City-to-City Internet Connections in 2007 (l.) and 2011 (r.). Courtesy

Web Archiving Programs in the U.S

Web Archiving Supplementary Guidelines

Web Archiving and You Web Archiving and Us

情報管理 O U R Nal of Information Pr Ocessing and Managemen T December

Web Archives for the Analog Archivist: Using Webpages Archived by the Internet Archive to Improve Processing and Description

Wayback Machine Instructions

The Future of Web Citation Practices

The Dbpedia Wayback Machine ∗

Partnership Opportunities with the Internet Archive Web Archiving in Libraries October 21, 2020

Functionalities of Web Archives

Using Wayback Machine for Research

Internet Archive Open Libraries Proposal Macarthur Foundation 100&Change Internet Archive 100&Change

Reference Rot: an Emerging Threat to Transparency in Political Science