Central Archiving Platform (CAP)
Total Page:16
File Type:pdf, Size:1020Kb
Load more
										Recommended publications
									
								- 
												  ISSN: 2348-1773 (Online) Volume 3, Issue 1 (January-June, 2016), PpJournal of Information Management ISSN: 2348-1765 (Print), ISSN: 2348-1773 (Online) Volume 3, Issue 1 (January-June, 2016), pp. 71-79 © Society for Promotion of Library Professionals (SPLP) http:// www.splpjim.org ______________________________________________________________________________ DATA CURATION: THE PROCESSING OF DATA Krishna Gopal Librarian , Kendriya Vidyalaya NTPC Dadri, GB Nagar [email protected] ABSTRACT Responsibility for data curation rests, in different ways, on a number of different professional roles. Increasingly, within the library, data curation responsibilities are being associated with specific jobs (with titles like “data curator” or “data curation specialist”), and the rise of specialized training programs within library schools has reinforced this process by providing a stream of qualified staff to fill these roles. At the same time, other kinds of library staff have responsibilities that may dovetail with (or even take the place of) these specific roles: for instance, metadata librarians are strongly engaged in curatorial processes, as are repository managers and subject librarians who work closely with data creators in specific fields. Different techniques and different organizations are engaged in data curation. Finally, it is increasingly being recognized that the data creators themselves (faculty researchers or library staff) have a very important responsibility at the outset: to follow relevant standards, to document their work and methods, and to work closely with data curation specialists so that their data is curatable over the long term. Keywords:Data Storage, Data Management, Data curation, Data annotation, Data preservation 1. INTRODUCTION Data Curation is a term used to indicate management activities related to organization and integration of data collected from various sources, annotation of the data, publication and presentation of the data such that the value of the data is maintained over time and the data remains available for reuse and preservation.
- 
												  Download Date 04/10/2021 06:52:09Reunión subregional de planificación de ODINCARSA (Red de Datos e Información Oceanográficos para las Regiones del Caribe y América del Sur), Universidad Autónoma de Baja California (UABC) Ensenada, Mexico, 7-10 December 2009, Item Type Report Publisher UNESCO Download date 04/10/2021 06:52:09 Item License http://creativecommons.org/licenses/by-nc/3.0/ Link to Item http://hdl.handle.net/1834/5678 Workshop Report No. 225 Informes de reuniones de trabajo Nº 225 Reunión subregional de planificación de ODINCARSA (Red de Datos e Información Oceanográficos para las Regiones del Caribe y América del Sur) Universidad Autónoma de Baja California (UABC) Ensenada (México) 7-10 de diciembre de 2009 ODINCARSA (Ocean Data and Information Network for the Caribbean and South America region) Latin America sub- regional Planning Meeting Universidad Autónoma de Baja California (UABC) Ensenada, Mexico, 7-10 December 2009 UNESCO 2010 IOC Workshop Report No. 225 Oostende, 23 February 2010 English and Spanish Workshop Participants For bibliographic purposes this document should be cited as follows: ODINCARSA (Ocean Data and Information Network for the Caribbean and South America region) Latin America sub-regional Planning Meeting, Universidad Autónoma de Baja California (UABC), Ensenada, Mexico, 7-10 December 2009 Paris, UNESCO, 23 February 2010 (IOC Workshop Report No. 225) (English and Spanish) La Comisión Oceanográfica Intergubernamental (COI) de la UNESCO celebra en 2010 su 50º Aniversario. Desde la Expedición Internacional al Océano Índico en 1960, en la que la COI asumió su función de coordinadora principal, se ha esforzado por promover la investigación de los mares, la protección del océano y la cooperación internacional.
- 
												  Vittorio CannasCurriculum Vitae Vittorio Cannas INFORMAZIONI PERSONALI Vittorio Cannas Viale Giuseppe Serrecchia 16, 00015, Monterotondo (RM), Italia +39 06 90626740 +39 339 6071094 [email protected] [email protected] Sesso Maschile | Data di nascita 22/11/1968 | Nazionalità Italiana ESPERIENZA PROFESSIONALE Da 07/2015 Presidente SpacEarth Technology Srl (http://www.spacearth.net/ ) Spin-off dell’Istituto Nazionale di Geofisica e Vulcanologia(http://www.ingv.it/en/) Attività realizzate: ▪ Sviluppo business e networking con aziende e organismi di ricerca a livello internazionale nei settori: aerospazio, minerario e ambient. ▪ Gestione dei rapporti con reti di imprese: Cluster Aerospaziale della Sardegna, Associazione Lazio Connect (Settore aerospaziale), Rete d’imprese ATEN-IS Lazio nel settore aerospaziale, Associazione Italia-Cina CinItaly nei settori aerospaziale, ICT e Ambiente. ▪ Gestione progetti europei di R&S (H2020, ESA, EIT) nei settori Aerospazio, Minerario e Ambiente: ▪ Scrittura di business plan e analisi di mercato propedeutiche allo sviluppo business. ▪ Trasferimento tecnologico, brevettazione e valorizzazione dei risultati della ricerca ▪ Internazionalizzazione e strumenti di finanza innovativa ▪ Scouting, selezione e scrittura di proposte di progetto e offerte commerciali in risposta a bandi e gare regionali, nazionali ed internazionali. ▪ Esperto valutatore di progetti POR-FESR nel settore Smart Cities&Communities per la regione Sicilia. ▪ Docente di strumenti di finanza innovativa e di sviluppo manageriale per le start-up ▪ Docente di sistemi satellitari a supporto dell’agricoltura, dei rischi ambientali e del cambiamento climatico Attività o settore: Sviluppo business e formazione a livello nazionale ed internazionale verso clienti dei settori aerospazio, ambiente e minerario. Da 07/2014 Senior Advisor Leoni Corporate Advisors (http://www.corporate-advisors.eu/) – Milan, Italy.
- 
												  Web Data Extraction, Applications and Techniques: a SurveyWeb Data Extraction, Applications and Techniques: A Survey Emilio Ferraraa,∗, Pasquale De Meob, Giacomo Fiumarac, Robert Baumgartnerd aCenter for Complex Networks and Systems Research, Indiana University, Bloomington, IN 47408, USA bUniv. of Messina, Dept. of Ancient and Modern Civilization, Polo Annunziata, I-98166 Messina, Italy cUniv. of Messina, Dept. of Mathematics and Informatics, viale F. Stagno D'Alcontres 31, I-98166 Messina, Italy dLixto Software GmbH, Austria Abstract Web Data Extraction is an important problem that has been studied by means of different scientific tools and in a broad range of applications. Many approaches to extracting data from the Web have been designed to solve specific problems and operate in ad-hoc domains. Other approaches, instead, heavily reuse techniques and algorithms developed in the field of Information Extraction. This survey aims at providing a structured and comprehensive overview of the literature in the field of Web Data Extraction. We provided a simple classification framework in which existing Web Data Extraction applications are grouped into two main classes, namely applications at the Enterprise level and at the Social Web level. At the Enterprise level, Web Data Extraction techniques emerge as a key tool to perform data analysis in Business and Competitive Intelligence systems as well as for business process re-engineering. At the Social Web level, Web Data Extraction techniques allow to gather a large amount of structured data continuously generated and disseminated by Web 2.0, Social Media and Online Social Network users and this offers unprecedented opportunities to analyze human behavior at a very large scale. We discuss also the potential of cross-fertilization, i.e., on the possibility of re-using Web Data Extraction techniques originally designed to work in a given domain, in other domains.
- 
												  Design and Implementation of a Nosql Database for Decision Support in R&D ManagementUniversity of Magdeburg School of Computer Science Databases D B and Software S E Engineering Master's Thesis Design and Implementation of a NoSQL Database for Decision Support in R&D Management Author: Prem Sagar Jeevangekar April 23, 2019 Advisors: Prof. Dr. rer. nat. habil. Gunter Saake Dipl.-Inf. Wolfram Fenske Department of Computer Science Dr. rer. nat. Matthias Plaue MAPEGY GmbH Jeevangekar, Prem Sagar: Design and Implementation of a NoSQL Database for Decision Support in R&D Management Master's Thesis, University of Magdeburg, 2019. Abstract The concept of database is introduced in the early 1960s. The relational database came into the picture in the early 1970s. The relational database had a great im- pact on data storage ever since. However, due to an increase in data in the modern database world, it leads to developing more efficient database technologies to deal with the usage of exponentially increased data. When working with the structured datasets, relational databases are more reliable and efficient. However, the database lacks its efficiency when huge unstructured data is produced from real-world ap- plications. To overcome the problems faced by the relational databases, compa- nies started looking for more reliable, flexible, highly scalable and high-performance databases. In the early 2000s, the NoSQL databases were introduced and gained huge popularity in a short period of time. The main aim of the thesis is to design and implement the NoSQL database (Mon- goDB) and investigate its performance in terms of query execution speed. This thesis provides an overview of one of the relational database (PostgreSQL) and investigate the performance of NoSQL database (MongoDB).
- 
												  CDR Data Management Plan: Data Survey and System DesignFinal Report CDR Data Management Plan: Data Survey and System Design Submitted to the Texas General Land Office by The University of Texas at Austin Center for Space Research Work leading to the report received funding from the Texas General Land Office through GLO Contract No. 18-229-000-B854. October 2020 Table of Contents Executive Summary ........................................................................................................................ 4 Prologue .......................................................................................................................................... 8 Chapter 1. Survey of Data for Response, Recovery and Mitigation ............................................ 10 Geodetic Control .................................................................................................................................... 13 Bathymetry .............................................................................................................................................. 14 Topography ............................................................................................................................................. 15 High-Resolution Baseline Imagery ......................................................................................................... 16 Hydrography ........................................................................................................................................... 18 Soils .......................................................................................................................................................
- 
												  Thesis ReferenceThesis Development of methods for the automated annotation of prokaryotic proteomes GATTIKER, Alexandre Abstract L'annotation attachée aux données génomiques représente une aide considérable pour la compréhension de la fonction des gènes. Des méthodes ont été développées pour augmenter l'efficacité, l'étendue, la fiabilité et la cohérence de l'annotation des protéomes procaryotes au sein de la base de connaissances Swiss-Prot. La superposition de méthodes automatisées au processus d'annotation classique n'amène aucune réduction de qualité mais rend au contraire les données plus cohérentes et reproductibles. La complémentarité entre l'expertise des annotateurs et des méthodes automatiques a été maximisée en utilisant un système de règles d'annotation maintenues manuellement. Des outils ont été créés pour maximiser le débit du travail des experts travaillant sur la base de connaissance et un système automatique permet l'annotation de sous-ensembles particuliers des protéomes procaryotes. Le système a été ensuite étendu pour gérer également les protéines d'origine eucaryote et pour annoter partiellement des protéines complexes sur la base de leurs domaines et sites fonctionnels. Reference GATTIKER, Alexandre. Development of methods for the automated annotation of prokaryotic proteomes. Thèse de doctorat : Univ. Genève, 2005, no. Sc. 3616 URN : urn:nbn:ch:unige-7263 DOI : 10.13097/archive-ouverte/unige:726 Available at: http://archive-ouverte.unige.ch/unige:726 Disclaimer: layout of this document may differ from the published version. 1 / 1 UNIVERSITÉ DE GENÈVE Département de biologie structurale FACULTÉ DE MÉDECINE et bioinformatique Professeur Amos Bairoch Département d’informatique FACULTÉ DES SCIENCES Professeur Ron D. Appel Development of methods for the automated annotation of prokaryotic proteomes THÈSE présentée à la Faculté des sciences de l’Université de Genève pour obtenir le grade de Docteur ès sciences, mention bioinformatique par ALEXANDRE GATTIKER de Küsnacht (ZH) Thèse N o 3616 GENÈVE 2005 La Faculté des sciences, sur le préavis de Messieurs A.
- 
												  Web Scraping, Applications and ToolsWEB SCRAPING, APPLICATIONS AND TOOLS European Public Sector Information Platform Topic Report No. 2015 / 10 Web Scraping: Applications and Tools Author: Osmar Castrillo-Fernández Published: December 2015 ePSIplatform Topic Report No. 2015 / 10 , December 2015 1 WEB SCRAPING, APPLICATIONS AND TOOLS Table of Contents Table of Contents ...................................................................................................................... 2 Keywords ................................................................................................................................... 3 Abstract/ Executive Summary ................................................................................................... 3 1 Introduction ............................................................................................................................ 4 2 What is web scraping? ............................................................................................................ 6 The necessity to scrape web sites and PDF documents ........................................................ 6 The API mana ........................................................................................................................ 7 3 Web scraping tools ................................................................................................................. 8 Partial tools for PDF extraction ............................................................................................. 8 Partial tools to extract from HTML .......................................................................................
- 
												  Industrial Equipment Manufacturing Success CasesIndustrial Equipment Manufacturing Success Cases Electronic Manufacturing Semiconductor Food and Beverage Components Automated Guided Vehicle Test and Measurements www.advantech.com Table of Contents Founded in 1983, Advantech is a leader in providing trusted, innovative embedded and automation products and solutions. Advantech offers comprehensive system integration, hardware and software, customer-centric design services, as well as global logistics support, all backed by industry-leading front and back office e-business solutions. We cooperate closely with our partners to help provide complete solutions for a wide variety of applications across a diverse range of industries. Advantech has always been an innovator in the development and manufacturing of top-quality, high-performance computing platforms, and our mission is to continue to drive innovation by offering trustworthy automation products and services. With Advantech, there is no limit to the applications and innovations our products make possible. Electronic Semiconductor Food and Manufacturing Beverage Production Line Solution for Mobile 4 Internet Solution for PCB Equipment 22 Automatic Vision Inspection Solution 38 Phone Ceramic Covers for Product Traceability in the Food High-Speed Adhesive Dispenser 24 and Beverage Industry Motor Control Integrated Solution for 6 Solution High-Precision Dual-Channel Adhesive Advantech Empowers Smart Pharma 40 Dispensers Advantech IC Test Handler Machine 26 Manufacturing Equipment and Automation Solution Machinery A FOG Vision Control Solution
- 
												  Pre-Workshop DocumentASCR Workshop on In Situ Data Management Bethesda North Marriott Hotel and Conference Center 5701 Marinelli Road, Rockville, MD 20852 January 28 - 29, 2019 https://www.orau.gov/insitudata2019/ Pre-Workshop Document 1. Introduction The Department of Energy (DOE) Office of Advanced Scientific Computing Research (ASCR) will convene a workshop on In Situ Data Management (ISDM) on January 28-29, 2019 at the Bethesda North Marriott in Rockville, MD. This document provides background information on ISDM, information about the purpose of workshop and expected outcomes, as well as some logistics information for participants. Purpose of the Workshop Scientific computing will increasingly incorporate a number of different tasks that need to be managed along with the main simulation tasks. For example, this year’s SC18 agenda featured in situ infrastructures, in situ analytics, big data analytics, workflows, data intensive science, machine learning, deep learning, and graph analytics—all nontraditional applications unheard of in an HPC conference just a few years ago. Perhaps most surprising, more than half of the 2018 Gordon Bell finalists featured some form of artificial intelligence, deep learning, graph analysis, or experimental data analysis in conjunction with or instead of a single computational model that solves a system of differential equations. We define ISDM as the practices, capabilities, and procedures to control the organization of data and enable the coordination and communication among heterogeneous tasks, executing simultaneously in an HPC system, cooperating toward a common objective. This workshop considers In Situ Data Management beyond the traditional roles of accelerating simulation I/O and visualizing simulation results, to more broadly support future scientific computing needs.
- 
												  Lessons on Data Management Practices for Local Data IntermediariesLESSONS ON DATA MANAGEMENT PRACTICES FOR LOCAL DATA INTERMEDIARIES AUGUST 2017 Rob Pitingolo ACKNOWLEDGMENTS I appreciate the thoughtful suggestions on the draft guide from Minh Wendt and her colleagues in the Office of Minority Health at the US Department of Health and Human Services, Bob Gradeck of the University of Pittsburgh Center for Social and Urban Research, Noah Urban and Stephanie Quesnelle of Data Driven Detroit, and April Urban of Case Western Reserve University. Leah Hendey and Kathryn Pettit (technical reviewer) at the Urban Institute also provided valuable comments. We are grateful to the staff at Data Driven Detroit, NeighborhoodInfo DC, Metropolitan Area Planning Council, and the Community Research Institute at Grand Valley State University for generously sharing information for the case studies. This guide was supported by the Office of Minority Health at the US Department of Health and Human Services. The views expressed are those of the author and do not necessarily represent those of the Office of Minority Health, Data Driven Detroit, the Urban Institute, its trustees, or its funders. Copyright © 2017 by the Urban Institute. Permission is granted for reproduction of this file, with attribution to the Urban Institute. Table of Contents Introduction ................................................................................................................... 3 Section 1: The ETL Process ............................................................................................ 5 Getting organized .................................................................................................................
- 
												  Recent Findings in Intelligent Computing Techniques Proceedings of the 5Th ICACNI 2017, Volume 2 Advances in Intelligent Systems and ComputingAdvances in Intelligent Systems and Computing 708 Pankaj Kumar Sa · Sambit Bakshi Ioannis K. Hatzilygeroudis Manmath Narayan Sahoo Editors Recent Findings in Intelligent Computing Techniques Proceedings of the 5th ICACNI 2017, Volume 2 Advances in Intelligent Systems and Computing Volume 708 Series editor Janusz Kacprzyk, Systems Research Institute, Polish Academy of Sciences, Warsaw, Poland e-mail: [email protected] The series “Advances in Intelligent Systems and Computing” contains publications on theory, applications, and design methods of Intelligent Systems and Intelligent Computing. Virtually all disciplines such as engineering, natural sciences, computer and information science, ICT, economics, business, e-commerce, environment, healthcare, life science are covered. The list of topics spans all the areas of modern intelligent systems and computing such as: computational intelligence, soft computing including neural networks, fuzzy systems, evolutionary computing and the fusion of these paradigms, social intelligence, ambient intelligence, computational neuroscience, artificial life, virtual worlds and society, cognitive science and systems, Perception and Vision, DNA and immune based systems, self-organizing and adaptive systems, e-Learning and teaching, human-centered and human-centric computing, recommender systems, intelligent control, robotics and mechatronics including human-machine teaming, knowledge-based paradigms, learning paradigms, machine ethics, intelligent data analysis, knowledge management, intelligent agents, intelligent decision making and support, intelligent network security, trust management, interactive entertainment, Web intelligence and multimedia. The publications within “Advances in Intelligent Systems and Computing” are primarily proceedings of important conferences, symposia and congresses. They cover significant recent developments in the field, both of a foundational and applicable character. An important characteristic feature of the series is the short publication time and world-wide distribution.