Metadata: a Primer for Indexers
Total Page:16
File Type:pdf, Size:1020Kb
METADATA: A PRIMER FOR INDEXERS Metadata: A Primer for Indexers BY DEBBIE OLSON Introduction data that fits the definition. Metadata. It’s a term being used widely and often • “Digital libraries” made up of born-digital infor - in the library (digital libraries, cataloging), business mation or materials originally in another format Debbie Olson provides freelance (knowledge management), and web design (infor - brought together online via scanning or other indexing, library and archival services mation architecture and taxonomy development) technologies. Such initiatives are seen in the tra - and lives in Syracuse, NY. She is the worlds. Its most basic definition – one that you are ditional library environments but organizations current Webmaster for the Web Index - probably already familiar with – is “information also use digital library concepts to document ing SIG and has recently audited about information.” It’s the mantra of the informa - their in-house information. classes in information architecture and tion world. But here’s a more detailed definition • The information contained in the metadata from Syracuse University. from the National Information Standards Organiza - <head></head> element of a web page, e.g. key - She can be reached at [email protected] tion (NISO) where the key to metadata is that it is words. or www.olson-info.com . “structured” information (2004): • The technical details associated with photos Metadata is structured information that from a digital camera, e.g. width, height, pixels describes, explains, locates, or otherwise per inch, resolution. makes it easier to retrieve, use, or manage • The information about music downloaded from an information resource. 1 an online source e.g. artist, length, genre. • The properties of a text document, e.g. type of Its nuances now cast a broader net in terms of file, location, creator. meaning and application. It’s partially about a Some types of metadata may be automatically change in terminology stemming from a change in embedded in a record such as in the case of photos technology but it’s also about a new way of looking taken from a digital camera or in downloaded at information. And as standards and best practices music. It can also be proprietary, as in the case of evolve, information can be more easily accessed and the metadata stored in a text editor, or non-propri - shared across communities and platforms. etary, such as that stored in the header of an HTML Indexers are already creators of metadata and document. Other metadata is added by humans. have important skills and experience to bring to the Tagging photos on a gallery website such as Flickr is broader discussion. We can start by developing a one example .2 basic understanding of the definitions, standards, schemas, applications, and best practices associated Types of Metadata with the world of metadata. Metadata is generally broken down into four or Applications for Metadata five categories. Administrative metadata includes rights and preservation metadata (sometimes sepa - Metadata creation is not limited to text-based rate) and contains information regarding the cre - material. It can be applied to three-dimensional ation, management, and accessibility of the material. objects, images, audio, video materials, geospatial Technical metadata (sometimes included with admin - data, PowerPoint presentations, lesson plans, web - istrative) contains information about how the item sites, computer file systems, and just about anything or piece of information was generated or transferred, we want in order to find it faster and more effi - e.g. camera settings for a photograph that are auto - ciently in the future. A few examples include: matically embedded in the file/object. Structural • The traditional library card catalog – certainly metadata refers to how the information or object is one of the earliest, mainstream forms of meta - to be compiled, e.g. filenames for chapters of a 18 JANUARY – MARCH 2009 METADATA: A PRIMER FOR INDEXERS book. Descriptive metadata, the type of meta - munity can be viewed in the <head></head> the genre term to input in the “type” element. data we’re most familiar with and indexers are section of the page info and page source codes It can be adapted or modified for non-DCMES most likely to work with, includes elements of the home page for IBM. 7 Dublin Core is projects. such as titles, genres, authors/creators, and declared as the namespace , where the name of The Tools of Metadata Creation subjects. 3 the schema and the web location (URL) where Metadata Standards and Schemas it resides is given. For example, IBM uses six Metadata can be input by hand or with the Dublin Core elements that are noted by the assistance of tools such as templates and edi - Metadata standards and schemas provide a “dc” prefix: dc.publisher, dc.language, dc.date, tors. Schemas may also have tools created way to format and exchange metadata and dc.rights, dc.subject, and dc.type. W3C-DTF, a especially for them to assist with creating and improve interoperability , or the ability to be syntax standard for expressing the date and encoding metadata for the Web. One example understood and shared across information time, is used for configuring the date and is DC-dot, a Web-based application that will communities and platforms. Metadata stan - dc.subject refers to internal IBM taxonomies . generate DCMES metadata in HTML or XML dards address how to describe content and DCMES, however, has become too simplistic for embedding in the <head></head> of a Web what must be included, such as a title, creator, for some communities. The fifteen elements page. 10 date, and subject. Standards also provide the were refined and expanded with additional There are also crosswalks , tools to assist with rules, or syntax , for how values , e.g. , the name properties but other standards and schemas converting one schema to another through of a particular title or a subject heading, are have been developed too. A few examples mapping one set of elements to another or via constructed or formatted. Metadata schemas , as include: Categories for the Description of Works an intermediary, e.g., one schema is mapped to opposed to schemas for encoding material for of Art (CDWA) and the VRA Core Categories DCMES then mapped to another schema from the Web such as XML, include specific sets of (VRA) for artwork and other visual media; DCMES. It is important to keep in mind that elements, e.g., title, creator, date, and subject. IEEE Learning Object Metadata (LOM) and with any of these tools a certain amount of In the library community, the Anglo-Ameri - Gateway to Educational Materials (GEM) for editing will be necessary to meet the needs of a can Cataloging Rules (AACR) provides the rules educational materials; Encoded Archival Descrip - particular project or community. to describe the content of a bibliographic tion (EAD) for archival finding aids; Content Metadata may also be created through a record while MAchine Readable Cataloging Standards for Digital Geospatial Metadata content management system (CMS) which (MARC) provides the schema, or set of ele - (CSDGM) for scientific material; and ONline provides for the authoring, publishing, archiv - ments, to markup and encode this biblio - Information eXhange (ONIX ) and the Text ing, and searching of documents. Many corpo - graphic information in an electronic format, Encoding Initiative (TEI) for booksellers and rations are using CMSs to manage websites e.g., 245 is the title field. In the archival com - publishers. Elements from various schemas and internal information 11 and many cultural munity, Describing Archives: A Content Standard may also be combined to create an application institutions are using digital management soft - (DACS) , provides the standards for describing profile , a formal set of guidelines on how a par - ware packages, e.g., CONTENTdm®, to cata - archival materials while Encoded Archival ticular community is using the elements of one log their collections. 12 Open source (free) Description (EAD) provides the set of elements or several schemas. systems are also available to manage content necessary to markup and encode the material, The public sector is also involved and and metadata. such as the components of a finding aid, for an invested in developing standards and schemas Indexers as Metadata Creators online environment. to manage the ever-expanding amount of Current metadata standards and schemas online government information. The Australian In expanding our knowledge and vocabu - were developed and became more widespread Government Locator Service (AGLS), the e- lary to include the world of metadata we can primarily in the mid-1990s with the creation Government Metadata Standard (e-GMS) (UK), take a step towards not only maintaining, but of the Dublin Core Metadata Element Set and the Government of Canada Records Man - more importantly, advancing our role in the (DCMES) . Sometimes simply known as agement Metadata Standard (GC RMMS) are information world in our own eyes and those “Dublin Core,” it is named after Dublin, Ohio, examples. The U.S. federal government recom - of the wider information community. We can the location where the original workshop con - mends the use of metadata as a part of its best begin by familiarizing ourselves with the basics vened and where OCLC, the library cataloging practices for government websites . 8 Many of of metadata and its applications, standards, services company, is headquartered. The initia - these