Granzow-de la Cerda & Beach • Acquiring specimen data from herbarium labels TAXON 59 (6) • December 2010: 1830–1842 METHODS AND TECHNIQUES Semi-automated workflows for acquiring specimen data from label images in herbarium collections Íñigo Granzow-de la Cerda1,3 & James H. Beach2 1 University of Michigan Herbarium, 3600 Varsity Dr., Ann Arbor, Michigan 48108, U.S.A. 2 Biodiversity Institute, University of Kansas, 1345 Jayhawk Boulevard, Lawrence, Kansas 66045, U.S.A. 3 Current address: Departament de Biologia Animal, Biologia Vegetal i Ecologia, Universitat Autònoma de Barcelona, 08193 Bellaterra, Spain Author for correspondence: Íñigo Granzow-de la Cerda,
[email protected] Abstract Computational workflow environments are an active area of computer science and informatics research; they promise to be effective for automating biological information processing for increasing research efficiency and impact. In this project, semi-automated data processing workflows were developed to test the efficiency of computerizing information contained in herbarium plant specimen labels. Our test sample consisted of mexican and Central American plant specimens held in the University of michigan Herbarium (mICH). The initial data acquisition process consisted of two parts: (1) the capture of digital images of specimen labels and of full-specimen herbarium sheets, and (2) creation of a minimal field database, or “pre-catalog”, of records that contain only information necessary to uniquely identify specimens. For entering “pre-catalog” data, two methods were tested: key-stroking the information (a) from the specimen labels directly, or (b) from digital images of specimen labels. In a second step, locality and latitude/longitude data fields were filled in if the values were present on the labels or images.