Creating (And Commissioning) Ebooks
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
How to Find Free, Reusable Content Online Rhode Island Library
Open Everything: How to find free, reusable content online Rhode Island Library Association Conference 2016, “Color Outside the Lines” Andrée Rathemacher • Julia Lovett • Angel Ferria University of Rhode Island Open Culture General Resources: Sites, Portals & Guides Digital Public Library of America — http://dp.la/ Aims to be a national digital library for the USA. Harvests metadata and content in all formats from other digital libraries and databases (HathiTrust, Internet Archive, state/consortium repositories, govt repositories etc. full partner list here http://dp.la/partners) Does not yet allow searching/filtering by rights information. Europeana — http://www.europeana.eu/portal/ Europe’s portal to cultural collections: “Explore 52,219,831 artworks, artefacts, books, videos and sounds from across Europe.” Can filter search results by reuse rights. Internet Archive — http://archive.org Founded in 1996. A “nonprofit library of millions of free books, movies, software, music, and more.” Searchable by Creative Commons license or Public Domain: See https://archive.org/about/faqs.php#1069 Open Culture — http://www.openculture.com/ Founded in 2006. Brings together free/open resources from around the web. Geared for a popular audience, with frequent blog posts and active social media presence. OpenGLAM Open Collections — http://openglam.org/opencollections/ A searchable index of open cultural her itage collections with freely reusable content. Shared Shelf Commons — http://www.sscommons.org Freely available images and oth er digital content from libraries, archives, and museums participating in Shared Shelf by Artstor. Copyright restrictions vary. Creative Commons Search — https://search.creativecommons.org/ Search CClicensed content from multiple sites such as Flickr, Google, and YouTube. -
Hathitrust Preferred Internet Archive Book Package Overview
HathiTrust Preferred Internet Archive Book Package Overview & Background As a by-product of the Internet Archive scanning process, a variety of different files and formats are available to everyone, everywhere. This differs from the Google output, which offers no file-level variations or options. However, this also means that files chosen for ingest into the HathiTrust repository must be carefully selected, with an eye towards both near-term and long-term utility. The process of selecting files that is described below attempted to balance the following important criteria: a baseline, cross-partner standard; functional consistency with the Google work products; a desire to keep the highest quality master images; a disinclination to discard useful information; and an attempt to minimize overall package size to reduce storage costs. Ingest into the HathiTrust repository will require pre-processing of the original file set described below in order to normalize files to an expected format. This normalization will allow HathiTrust processes to accommodate content from all partners. This process is currently in development and a link to the documentation of the process will be included here, once it is finalized. File Selection Criteria In the following section, the files selected for ingest into the HathiTrust repository are identified, along with a justification for why they were selected. Also listed are files that are available from the Internet Archive, but have not been selected. A description of each file can be found in the All Available Files & Characteristics section below. All files below are named using the Internet Archive identifier, preceding the underscore (ex. -
Overview of the INEX 2009 Book Track
Overview of the INEX 2009 Book Track Gabriella Kazai1, Antoine Doucet2, Marijn Koolen3, and Monica Landoni4 1 Microsoft Research, United Kingdom [email protected] 2 University of Caen, France [email protected] 3 University of Amsterdam, Netherlands [email protected] 4 University of Lugano [email protected] Abstract. The goal of the INEX 2009 Book Track is to evaluate ap- proaches for supporting users in reading, searching, and navigating the full texts of digitized books. The investigation is focused around four tasks: 1) the Book Retrieval task aims at comparing traditional and book-specific retrieval approaches, 2) the Focused Book Search task eval- uates focused retrieval approaches for searching books, 3) the Structure Extraction task tests automatic techniques for deriving structure from OCR and layout information, and 4) the Active Reading task aims to explore suitable user interfaces for eBooks enabling reading, annotation, review, and summary across multiple books. We report on the setup and the results of the track. 1 Introduction The INEX Book Track was launched in 2007, prompted by the availability of large collections of digitized books resulting from various mass-digitization projects [1], such as the Million Book project5 and the Google Books Library project6. The unprecedented scale of these efforts, the unique characteristics of the digitized material, as well as the unexplored possibilities of user interactions present exciting research challenges and opportunities, see e.g. [3]. The overall goal of the INEX Book Track is to promote inter-disciplinary research investigating techniques for supporting users in reading, searching, and navigating the full texts of digitized books, and to provide a forum for the exchange of research ideas and contributions. -
The Internet Archive: an Interview with Brewster Kahle Brewster Kahle and Ana Parejo Vadillo
The Internet Archive: An Interview with Brewster Kahle Brewster Kahle and Ana Parejo Vadillo Rumour has it that one of the candidates for Librarian of Congress is Brewster Kahle, the founder and director of the non-profit digital library Internet Archive.1 That he may be considered for the post is a testament to Kahle’s commitment to mass digitization, the cornerstone of modern librarianship. A visionary of the digital preservation of knowledge and an outspo- ken advocate of the open access movement (the memorial for the Internet activist Aaron Swartz was held at the Internet Archive’s headquarters in San Francisco), Kahle has been part of the many ventures that have created our cyber age. At MIT, he was on the project team of Thinking Machines, a precursor of the World Wide Web. In 1989 he created WAIS (Wide Area Information Server), the first electronic publishing system, which was designed to search and make information available. He left Thinking Machines to focus on his newly founded company, WAIS, Inc., which was sold to AOL two years later for a reported $15 million. In 1996 he co- founded Alexa Internet, which was built on the principles of collecting Web traffic data and analysis.2 The company was named after the Library of Alexandria, the largest repository of knowledge in the ancient world, to highlight the potential of the Internet to become such a custodian. It was sold for c. $250 million in stock to Amazon, which uses it for data mining. Alongside Alexa Internet, in 1996 Kahle founded the Internet Archive to archive Web culture (Fig. -
February 2005)
TechNews November 2006 TechNews is a technology, news and analysis service aimed at those in the education sector keen to stay informed about technology developments, trends and issues. Please navigate the newsletter by clicking on items within the table of contents. Networking and wireless ........................................................................................................... 2 Analysis: Trusted Computing and Network Access Control............................................................................. 2 Networking and wireless news ................................................................................................. 4 Becta Infrastructure Services Framework ............................................................................................................... 4 802.11n update ....................................................................................................................................................... 4 Predicted growth in GPS-based services................................................................................................................ 4 Mobile WiMAX......................................................................................................................................................... 5 Short range wireless developments ........................................................................................................................ 5 4G progress ........................................................................................................................................................... -
Gen 102 Finding Full-Text Books Online
Finding Full-Text Books Online Internet Archive www.archive.org The Internet Archive is building a digital library of Internet sites and other cultural artifacts in digital form, including video, audio, texts and the wayback machine. .Texts includes books and journals .Searches bibliographic information (see Open Library below for inside the book searching) .Displays text or image .Print from pdf (pdf may not be searchable) Open Library openlibrary.org Creating One web page for every book ever published. A project of Internet Archive .Books (Searches Internet Archive) .Displays text or image .Download or Print? o Searches bibliographic information, to identify books of interest Click on Subject to search by subject heading and can then keyword search within the heading (census, maps, Carey’s American pocket atlas) o Searching inside the book: openlibrary.org/search/inside Google Books books.google.com Google’s mission is to organize the world‘s information and make it universally accessible and useful. Books and Journals .Searches inside the book: use google search tools, use limiters in sidebar .Print from PDF—pdf downloaded is not searchable, must search in google books .Views: o Snippett o Preview o Download/pdf .Displays images (with some text in preview mode) .MORE .My Library .Order ebook, buy book, or get from a library Making of America quod.lib.umich.edu/m/moagrp/ Primary sources in American social history from the 19th century .Searches inside the book or document .Displays text or image or pdf .Can print one page at a time (best from pdf) .No download of complete book or article Family History Archive from Brigham Young University www.lib.byu.edu/fhc/index.php .search bib or inside book, pdf display, print, download Hathi Trust www.hathitrust.org A partnership of major research institutions and libraries to preserve and make books accessible. -
On the Interoperability of Ebook Formats
It is widely seen as a serious problem that European as well as international customers who have bought an ebook from one of the international ebook retailers implicitly subscribe to this retailer as their sole future ebook On the Interoperability supplier, i.e. in effect, they forego buying future ebooks from any other supplier. This is a threat to the qualified European book distribution infrastructure and hence the European book culture, since subscribers to one of these of eBook Formats ebook ecosystems cannot buy future ebooks from privately owned community-located bricks & mortar booksellers engaging in ebook retailing. This view is completely in line with the Digital Agenda of the European Commission calling in Pillar II for “an effective interoperability Prof. Christoph Bläsi between IT products and services to build a truly digital society. Europe must ensure that new IT devices, applications, data repositories and services interact seamlessly anywhere – just like the Internet.” Prof. Franz Rothlauf This report was commissioned from Johannes Gutenberg University Johannes Gutenberg-Universität Mainz – Germany Mainz by the European and International Booksellers Federation. EIBF is very grateful to its sponsors, namely the Booksellers Association of Denmark, the Booksellers Association of the Netherlands and the Booksellers Association of the UK & Ireland, whose financial contribution made this project possible. April 2013 European and International Booksellers Federation rue de la Science 10 – 1000 Brussels – Belgium – [email protected] -
Rethink Web Archiving! ! Helen Hockx-Yu, Director of Global Web Services Internet Archive
Rethink Web Archiving! ! Helen Hockx-Yu, Director of Global Web Services Internet Archive DPC Students Conference January 2016 About Me • Digital preservation / Web Archiving • Project / Programme / Operation/Service management • IT related • 2003-2007: Programme Manager, Digital Preservation and Shared Services, JISC • 2007-2008: Planets Project Manager, British Library • 2008 – 2015: Web Archiving Programme Manager & Head of Web Archiving, British Library • September 2015 – Present: Director of Global Web Services, Internet Archive 20 years of Web Archiving • Started by the Internet Archive in 1996 • Increased awareness • Legal issues much better understood • Growing community • 68 initiatives across 33 countries • 534 billions of web-archived files since 1996 (17 PB) • Scholarly use of web archives • Many challenges Internet Archive • A not-for-profit digital library founded in 1996 by Brewster Kahle • Contains 24+PB of data and is growing • Digitised books, manuscripts and other texts • Movies & music • TV news archive: https://archive.org/details/tv • Software • Archived webpages • Over 2 million registered users https://archive.org/about/stats.php • Started web archiving in 1996. Wayback released in 2001 • Largest publicly available web archive in existence • 450+ Billion URLs, 100+ million websites • content in 40+ Languages • 600,000 visit / day • We collect a broad snapshot of the web every 60 days, +1billion ULRs/week • Also crawl wikipedia, news, RSS feeds, YouTube etc Archive-IT • Subscription service launched in February 2006 -
Overview of the INEX 2010 Book Track: at the Mercy of Crowdsourcing
Overview of the INEX 2010 Book Track: At the Mercy of Crowdsourcing Gabriella Kazai1, Marijn Koolen2, Antoine Doucet3, and Monica Landoni4 1 Microsoft Research, United Kingdom [email protected] 2 University of Amsterdam, Netherlands [email protected] 3 University of Caen, France [email protected] 4 University of Lugano [email protected] Abstract. The goal of the INEX 2010 Book Track is to evaluate ap- proaches for supporting users in reading, searching, and navigating the full texts of digitized books. The investigation is focused around four tasks: 1) the Book Retrieval (Best Books to Reference) task aims at comparing traditional and book-specific retrieval approaches, 2) the Fo- cused Book Search (Prove It) task evaluates focused retrieval approaches for searching books, 3) the Structure Extraction task tests automatic techniques for deriving structure from OCR and layout information, and 4) the Active Reading task aims to explore suitable user interfaces for eBooks enabling reading, annotation, review, and summary across mul- tiple books. We report on the setup and the results of the track. 1 Introduction The INEX Book Track was launched in 2007, prompted by the availability of large collections of digitized books resulting from various mass-digitization projects [1], such as the Million Book project5 and the Google Books Library project6. The unprecedented scale of these efforts, the unique characteristics of the digitized material, as well as the unexplored possibilities of user interactions present exciting research challenges and opportunities, see e.g. [4]. The overall goal of the INEX Book Track is to promote inter-disciplinary research investigating techniques for supporting users in reading, searching, and navigating the full texts of digitized books, and to provide a forum for the exchange of research ideas and contributions. -
Hachette Book Group V. Internet Archive
Case 1:20-cv-04160 Document 1 Filed 06/01/20 Page 1 of 53 UNITED STATES DISTRICT COURT SOUTHERN DISTRICT OF NEW YORK - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - x HACHETTE BOOK GROUP, INC., : HARPERCOLLINS PUBLISHERS LLC, JOHN WILEY & SONS, INC., and PENGUIN RANDOM : 20 Civ. _____________ HOUSE LLC, : ECF Case Plaintiffs, : : COMPLAINT -against- : TRIAL BY JURY DEMANDED : INTERNET ARCHIVE and DOES 1 through 5, : inclusive, : Defendants. - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - x Plaintiffs Hachette Book Group, Inc. (“Hachette”), HarperCollins Publishers LLC (“HarperCollins”), John Wiley & Sons, Inc. (“Wiley”), and Penguin Random House LLC (“Penguin Random House”), by and through their attorneys Davis Wright Tremaine LLP and Oppenheim + Zebrak, LLP, for their Complaint, hereby allege against Defendant Internet Archive (“IA” or “Defendant”) and Does 1 through 5 as follows: NATURE OF THE ACTION 1. Plaintiffs Hachette, HarperCollins, Penguin Random House, and Wiley (collectively, “Plaintiffs” or “Publishers”) bring this copyright infringement action against IA in connection with website operations it markets to the public as “Open Library” and/or “National Emergency Library.” Plaintiffs are four of the world’s preeminent publishing houses. Collectively, they publish some of the most successful and leading authors in the world, investing in a wide range of fiction and nonfiction books for the benefit of readers everywhere. All of the Plaintiffs are member companies of the Association of American Publishers, the mission of which is to be the voice of American publishing on matters of law and public policy. 1 Case 1:20-cv-04160 Document 1 Filed 06/01/20 Page 2 of 53 2. Defendant IA is engaged in willful mass copyright infringement. Without any license or any payment to authors or publishers, IA scans print books, uploads these illegally scanned books to its servers, and distributes verbatim digital copies of the books in whole via public-facing websites. -
Forcepoint DLP Supported File Formats and Size Limits
Forcepoint DLP Supported File Formats and Size Limits Supported File Formats and Size Limits | Forcepoint DLP | v8.8.1 This article provides a list of the file formats that can be analyzed by Forcepoint DLP, file formats from which content and meta data can be extracted, and the file size limits for network, endpoint, and discovery functions. See: ● Supported File Formats ● File Size Limits © 2021 Forcepoint LLC Supported File Formats Supported File Formats and Size Limits | Forcepoint DLP | v8.8.1 The following tables lists the file formats supported by Forcepoint DLP. File formats are in alphabetical order by format group. ● Archive For mats, page 3 ● Backup Formats, page 7 ● Business Intelligence (BI) and Analysis Formats, page 8 ● Computer-Aided Design Formats, page 9 ● Cryptography Formats, page 12 ● Database Formats, page 14 ● Desktop publishing formats, page 16 ● eBook/Audio book formats, page 17 ● Executable formats, page 18 ● Font formats, page 20 ● Graphics formats - general, page 21 ● Graphics formats - vector graphics, page 26 ● Library formats, page 29 ● Log formats, page 30 ● Mail formats, page 31 ● Multimedia formats, page 32 ● Object formats, page 37 ● Presentation formats, page 38 ● Project management formats, page 40 ● Spreadsheet formats, page 41 ● Text and markup formats, page 43 ● Word processing formats, page 45 ● Miscellaneous formats, page 53 Supported file formats are added and updated frequently. Key to support tables Symbol Description Y The format is supported N The format is not supported P Partial metadata -
IDOL Keyview Filter SDK 12.8 C Programming Guide
IDOL KeyView Software Version 12.8 Filter SDK C Programming Guide Document Release Date: February 2021 Software Release Date: February 2021 Filter SDK C Programming Guide Legal notices Copyright notice © Copyright 2016-2021 Micro Focus or one of its affiliates. The only warranties for products and services of Micro Focus and its affiliates and licensors (“Micro Focus”) are as may be set forth in the express warranty statements accompanying such products and services. Nothing herein should be construed as constituting an additional warranty. Micro Focus shall not be liable for technical or editorial errors or omissions contained herein. The information contained herein is subject to change without notice. Documentation updates The title page of this document contains the following identifying information: l Software Version number, which indicates the software version. l Document Release Date, which changes each time the document is updated. l Software Release Date, which indicates the release date of this version of the software. To check for updated documentation, visit https://www.microfocus.com/support-and-services/documentation/. Support Visit the MySupport portal to access contact information and details about the products, services, and support that Micro Focus offers. This portal also provides customer self-solve capabilities. It gives you a fast and efficient way to access interactive technical support tools needed to manage your business. As a valued support customer, you can benefit by using the MySupport portal to: l Search for knowledge documents of interest l Access product documentation l View software vulnerability alerts l Enter into discussions with other software customers l Download software patches l Manage software licenses, downloads, and support contracts l Submit and track service requests l Contact customer support l View information about all services that Support offers Many areas of the portal require you to sign in.