The Effect of Knowledge Management Systems On

Total Page:16

File Type:pdf, Size:1020Kb

The Effect of Knowledge Management Systems On THE ARTS This PDF document was made available from www.rand.org as a public CHILD POLICY service of the RAND Corporation. CIVIL JUSTICE EDUCATION ENERGY AND ENVIRONMENT Jump down to document6 HEALTH AND HEALTH CARE INTERNATIONAL AFFAIRS NATIONAL SECURITY The RAND Corporation is a nonprofit research POPULATION AND AGING organization providing objective analysis and effective PUBLIC SAFETY solutions that address the challenges facing the public SCIENCE AND TECHNOLOGY and private sectors around the world. SUBSTANCE ABUSE TERRORISM AND HOMELAND SECURITY TRANSPORTATION AND INFRASTRUCTURE Support RAND WORKFORCE AND WORKPLACE Browse Books & Publications Make a charitable contribution For More Information Visit RAND at www.rand.org Explore Pardee RAND Graduate School View document details Limited Electronic Distribution Rights This document and trademark(s) contained herein are protected by law as indicated in a notice appearing later in this work. This electronic representation of RAND intellectual property is provided for non-commercial use only. Unauthorized posting of RAND PDFs to a non-RAND Web site is prohibited. RAND PDFs are protected under copyright law. Permission is required from RAND to reproduce, or reuse in another form, any of our research documents for commercial use. For information on reprint and linking permissions, please see RAND Permissions. This product is part of the Pardee RAND Graduate School (PRGS) dissertation series. PRGS dissertations are produced by graduate fellows of the Pardee RAND Graduate School, the world’s leading producer of Ph.D.’s in policy analysis. The dissertation has been supervised, reviewed, and approved by the graduate fellow’s faculty committee. The Effect of Knowledge Management Systems on Organizational Performance Do Soldier and Unit Counterinsurgency Knowledge and Performance Improve Following “Push” or “Adaptive-Push” Training? S. Jamie Gayton This document was submitted as a dissertation in June 2009 in partial fulfillment of the requirements of the doctoral degree in public policy analysis at the Pardee RAND Graduate School. The faculty committee that supervised and approved the dissertation consisted of Bryan W. Hallmark (Chair), James T. Quinlivan, and Matthew W. Lewis. PARDEE RAND GRADUATE SCHOOL The Pardee RAND Graduate School dissertation series reproduces dissertations that have been approved by the student’s dissertation committee. The views expressed herein are those of the author and do not reflect the position of the United States Military Academy, the Department of the Army, or the Department of Defense. The RAND Corporation is a nonprofit research organization providing objective analysis and effective solutions that address the challenges facing the public and private sectors around the world. RAND’s publications do not necessarily reflect the opinions of its research clients and sponsors. R® is a registered trademark. All rights reserved. No part of this book may be reproduced in any form by any electronic or mechanical means (including photocopying, recording, or information storage and retrieval) without permission in writing from RAND. Published 2009 by the RAND Corporation 1776 Main Street, P.O. Box 2138, Santa Monica, CA 90407-2138 1200 South Hayes Street, Arlington, VA 22202-5050 4570 Fifth Avenue, Suite 600, Pittsburgh, PA 15213-2665 RAND URL: http://www.rand.org To order RAND documents or to obtain additional information, contact Distribution Services: Telephone: (310) 451-7002; Fax: (310) 451-6915; Email: [email protected] - iii - PREFACE My interest in Knowledge Management Systems stems from my experience commanding a battalion during Operation Iraqi Freedom III in Baghdad, Iraq. Prior to deployment, I wanted to provide my leaders the most up to date information available on the enemy’s use of improvised explosive devices (IEDs). In order to acquire and organize this type of information, I went to the Army’s Center for Army Lessons Learned (CALL) website and began searching through myriad data on the subject. When I called the organization to find their synthesized, pruned, “push” version of the slides that would contain the information I requested, I was directed back to the myriad data. I was stymied. Despite tremendous amounts of available data, the organization was counting on an inefficient strategy in which each leader used his time to synthesize and prune the data to prepare it for his subordinates. I noted at the time that a more efficient and effective means of distributing tactical information must be possible and planned to engage this possibility at some point in the future. Upon reassignment to the Pardee RAND Graduate School to pursue a doctoral degree, I had the opportunity to study the possibility of changing aspects of the Army’s knowledge management systems. There are leaders in the Army who are proponents of the SWfF concept and accept without empirical support that organizations that use the SWfF tools, including push tools, will increase their performance. At the same time, there are other leaders who question the need for the SWfF organization and in particular the underlying assumption of increased performance of units as they use the tools. My knowledge management research opportunity came in the form of a RAND Arroyo project to conduct an empirical assessment of the Stryker Warfighters’ Forum (SWfF) organization’s contribution to learning for soldiers and units preparing for a contingency deployment. During my project research, I became specifically interested in how the delivery method of information would affect performance. This dissertation was submitted to the Pardee RAND Graduate School in 2009 in partial fulfillment of the requirements for the degree of Doctor of Philosophy in Policy Analysis. The dissertation examined the relationship between the use of knowledge management systems and the subsequent gain in knowledge by individuals and the increase in performance by organizations. Specifically, I empirically assessed the “push” delivery - iv - method (leaders identify and push relevant information to subordinates) and “adaptive-push” delivery method (leaders and subordinates interact to identify relevant information and leaders push to subordinates). I examined this relationship by empirically assessing the knowledge gain in soldiers and the performance increase by units through two studies. In the first study, soldiers received information using a push delivery method and in the second study they received information through an adaptive-push delivery method. - v - CONTENTS Preface........................................................................................................................................................... iii Figures............................................................................................................................................................ix Tables .............................................................................................................................................................xi Abstract ....................................................................................................................................................... xiii Acknowledgments.........................................................................................................................................xv Acronyms and Glossary ...............................................................................................................................xix 1. Specific Aims and Research Questions......................................................................................................1 Introduction.......................................................................................................................................1 Broad policy issue and policy questions ...........................................................................................2 SWfF policy issue, policy questions, and research questions ...........................................................5 Specific aims and desired outcomes of KMS research .....................................................................6 2. Background, Significance, Motivation, and Literature Review ...........................................................8 Army’s success with major combat operations.................................................................................9 Counterinsurgency and stability operations become the norm........................................................10 The Army transitions to a new vehicle – Stryker - and establishes SWfF......................................11 SWfF history, mission, and supporting tasks..................................................................................13 SWfF oversight, direction, and a desire for assessment/validation.................................................15 SWfF validation significance ....................................................................................................16 The Army needs a better method to capture untapped experience..................................................17 Units have limited time to prepare for future deployments .......................................................17 The expertise developed during deployments is an untapped pool of knowledge to train soldiers that is not effectively utilized today........................................................................18 KMS could be a method to systematically capture, synthesize, and transfer
Recommended publications
  • Ts 124 623 V9.3.0 (2011-10)
    ETSI TS 124 623 V9.3.0 (2011-10) Technical Specification Digital cellular telecommunications system (Phase 2+); Universal Mobile Telecommunications System (UMTS); LTE; Extensible Markup Language (XML) Configuration Access Protocol (XCAP) over the Ut interface for Manipulating Supplementary Services (3GPP TS 24.623 version 9.3.0 Release 9) 3GPP TS 24.623 version 9.3.0 Release 9 1 ETSI TS 124 623 V9.3.0 (2011-10) Reference RTS/TSGC-0124623v930 Keywords GSM,LTE,UMTS ETSI 650 Route des Lucioles F-06921 Sophia Antipolis Cedex - FRANCE Tel.: +33 4 92 94 42 00 Fax: +33 4 93 65 47 16 Siret N° 348 623 562 00017 - NAF 742 C Association à but non lucratif enregistrée à la Sous-Préfecture de Grasse (06) N° 7803/88 Important notice Individual copies of the present document can be downloaded from: http://www.etsi.org The present document may be made available in more than one electronic version or in print. In any case of existing or perceived difference in contents between such versions, the reference version is the Portable Document Format (PDF). In case of dispute, the reference shall be the printing on ETSI printers of the PDF version kept on a specific network drive within ETSI Secretariat. Users of the present document should be aware that the document may be subject to revision or change of status. Information on the current status of this and other ETSI documents is available at http://portal.etsi.org/tb/status/status.asp If you find errors in the present document, please send your comment to one of the following services: http://portal.etsi.org/chaircor/ETSI_support.asp Copyright Notification No part may be reproduced except as authorized by written permission.
    [Show full text]
  • Knowledge Extraction for Hybrid Question Answering
    KNOWLEDGEEXTRACTIONFORHYBRID QUESTIONANSWERING Von der Fakultät für Mathematik und Informatik der Universität Leipzig angenommene DISSERTATION zur Erlangung des akademischen Grades Doctor rerum naturalium (Dr. rer. nat.) im Fachgebiet Informatik vorgelegt von Ricardo Usbeck, M.Sc. geboren am 01.04.1988 in Halle (Saale), Deutschland Die Annahme der Dissertation wurde empfohlen von: 1. Professor Dr. Klaus-Peter Fähnrich (Leipzig) 2. Professor Dr. Philipp Cimiano (Bielefeld) Die Verleihung des akademischen Grades erfolgt mit Bestehen der Verteidigung am 17. Mai 2017 mit dem Gesamtprädikat magna cum laude. Leipzig, den 17. Mai 2017 bibliographic data title: Knowledge Extraction for Hybrid Question Answering author: Ricardo Usbeck statistical information: 10 chapters, 169 pages, 28 figures, 32 tables, 8 listings, 5 algorithms, 178 literature references, 1 appendix part supervisors: Prof. Dr.-Ing. habil. Klaus-Peter Fähnrich Dr. Axel-Cyrille Ngonga Ngomo institution: Leipzig University, Faculty for Mathematics and Computer Science time frame: January 2013 - March 2016 ABSTRACT Over the last decades, several billion Web pages have been made available on the Web. The growing amount of Web data provides the world’s largest collection of knowledge.1 Most of this full-text data like blogs, news or encyclopaedic informa- tion is textual in nature. However, the increasing amount of structured respectively semantic data2 available on the Web fosters new search paradigms. These novel paradigms ease the development of natural language interfaces which enable end- users to easily access and benefit from large amounts of data without the need to understand the underlying structures or algorithms. Building a natural language Question Answering (QA) system over heteroge- neous, Web-based knowledge sources requires various building blocks.
    [Show full text]
  • D 5.2.2 Semantic Annotation 2
    SEMANTIC ANNOTATION 2 MARCH 2017 DELIVERABLE Project Acronym: SDI4Apps Grant Agreement number: 621129 Project Full Title: Uptake of Open Geographic Information Through Innovative Services Based on Linked Data D5.2.2 SEMANTIC ANNOTATION 2 Revision no. 03 Authors: Otakar Čerba (University of West Bohemia) Project co-funded by the European Commission within the ICT Policy Support Programme Dissemination Level P Public X C Confidential, only for members of the consortium and the Commission Services D5.2.2 Semantic Annotation 2 REVISION HISTORY Revision Date Author Organisation Description 01 07/03/2016 Otakar Čerba UWB Initial draft 02 29/03/2016 Martin Tuchyna, SAZP/CCSS Internal review Karel Charvát 03 30/03/2016 Otakar Čerba UWB Final version Statement of originality: This deliverable contains original unpublished work except where clearly indicated otherwise. Acknowledgement of previously published material and of the work of others has been made through appropriate citation, quotation or both. Disclaimer: Views expressed in this document are those of the individuals, partners or the consortium and do not represent the opinion of the Community. Page 3 of 15 © SDI4Apps Consortium 2017 D5.2.2 Semantic Annotation 2 TABLE OF CONTENTS Revision History ................................................................................................................. 3 Table of Contents .............................................................................................................. 4 List of Figures ..................................................................................................................
    [Show full text]
  • Applications of Smart Points of Interest
    Applications of Smart Points Of Interest Otakar Čerba University of West Bohemia, Plzeň, Czech Republic Outline I About Smart Points of Interest (SPOI) I Examples of applications – map client I Future steps of SPOI development Motivation – Why did we start to develop SPOI 1. Request for open data for tourism, including promotion of services or regions → data focused on interesting places or features (points of interest) 2. Need to interconnect this data with existing data, but also non-structured information (photos, maps, text documents, records. ) → Linked Data approach 3. Demand on seamless data (without state borders) → one global dataset enabling any type of data querying (GeoSPARQL) 4. Low costs on data production and management → data from various existing resources, but published in the uniform data model and with use standards 5. Possibility of an attractive visualization (maps), integration to external application and simple editing → support by independent simple tools based on standards Smart Points of Interest (SPOI) I ~27 000 000 points published as 5* Linked Open Data I Data access I Map client (within editing) I SPARQL endpoint (results exported to JSON, CSV, RDF. ) I Web page: gis.zcu.cz/spoi I SPOI description I Links to map client and SPARQL endpoint I Data model, harmonization scheme, list of changes, metadata DOAP a VoID SPOI Populating I 49 external data resources I Global datasets – OpenStreetMap, Natural Earth, GeoNames.org I Local data from Citadel on the Move project I Local data – small regions from Italy, Belgium,
    [Show full text]
  • Collaborative Writing with Web 2.0 Technologies: Education Students’ Perceptions
    Journal of Information Technology Education: Volume 10, 2011 Innovations in Practice Collaborative Writing with Web 2.0 Technologies: Education Students’ Perceptions Cornelia Brodahl, Said Hadjerrouit, and Nils Kristian Hansen University of Agder, Kristiansand, Norway [email protected] [email protected] [email protected] Executive Summary Web 2.0 technologies are becoming popular in teaching and learning environments. Among them several online collaborative writing tools, like wikis and blogs, have been integrated into educa- tional settings. Research has been carried out on a wide range of subjects related to wikis, while other, comparable tools like Google Docs and EtherPad remain largely unexplored in the litera- ture. This work presents a case study investigating education students’ perceptions of collabora- tive writing using Google Docs and EtherPad. Both tools provide opportunity for multiple users to work on the same document simultaneously, have a separate space for written metacommuni- cation, and are promoted by software designers to be fairly intuitive to adopt without prior train- ing. The work investigates if perceptions depend on factors such as gender, age, digital compe- tence, interest in digital tools, educational settings, and choice of writing tool, and examines if the tools are easy to use and effective in group work. This paper focuses on quantitative results of survey questionnaires. Further qualitative analysis will be presented in a later paper. The theoretical framework is drawn from two learning theories, the social-constructivist learning theory and the community of practice, and their relationships to collaborative tools. Related re- search literature is characterized by a number of issues: positive elements of use, advantages of using Web 2.0 technologies, critical issues regarding the pedagogical value of Web 2.0, and the role of the teacher in using these technologies.
    [Show full text]
  • A Framework Supporting Development of Ontology-Based Web Applications
    University of New Orleans ScholarWorks@UNO University of New Orleans Theses and Dissertations Dissertations and Theses 12-17-2010 A Framework Supporting Development of Ontology-Based Web Applications Shireesha Tankashala University of New Orleans Follow this and additional works at: https://scholarworks.uno.edu/td Recommended Citation Tankashala, Shireesha, "A Framework Supporting Development of Ontology-Based Web Applications" (2010). University of New Orleans Theses and Dissertations. 103. https://scholarworks.uno.edu/td/103 This Thesis-Restricted is protected by copyright and/or related rights. It has been brought to you by ScholarWorks@UNO with permission from the rights-holder(s). You are free to use this Thesis-Restricted in any way that is permitted by the copyright and related rights legislation that applies to your use. For other uses you need to obtain permission from the rights-holder(s) directly, unless additional rights are indicated by a Creative Commons license in the record and/or on the work itself. This Thesis-Restricted has been accepted for inclusion in University of New Orleans Theses and Dissertations by an authorized administrator of ScholarWorks@UNO. For more information, please contact [email protected]. A Framework Supporting Development of Ontology-Based Web Applications A Thesis Submitted to the Graduate Faculty of the University of New Orleans in partial fulfillment of the requirements for the degree of Master of Science in Computer Science by Shireesha M Tankashala G.Narayanamma Institute of Technology, India, 2006 December 2010 ACKNOWLEDGEMENT I would like to express my heartfelt gratitude to my major professor Dr. Shengru Tu for his belief and substantial support throughout my work.
    [Show full text]
  • Voidext: Vocabulary and Patterns for Enhancing Interoperable Datasets with Virtual Links
    VoIDext: Vocabulary and Patterns for Enhancing Interoperable Datasets with Virtual Links Tarcisio M. de Farias1;2( ), Kurt Stockinger4, and Christophe Dessimoz1;2;3 1 SIB Swiss Institute of Bioinformatics, Lausanne, Switzerland 2 University of Lausanne, Switzerland 3 University College London, UK 4 Zurich University of Applied Sciences, Winterthur, Switzerland [email protected] Abstract. Semantic heterogeneity remains a problem when interoper- ating with data from sources of different scopes and knowledge domains. Causes for this challenge are context-specific requirements (i.e. no \one model fits all"), different data modelling decisions, domain-specific pur- poses, and technical constraints. Moreover, even if the problem of se- mantic heterogeneity among different RDF publishers and knowledge domains is solved, querying and accessing the data of distributed RDF datasets on the Web is not straightforward. This is because of the com- plex and fastidious process needed to understand how these datasets can be related or linked, and consequently, queried. To address this issue, we propose to extend the existing Vocabulary of Interlinked Datasets (VoID) by introducing new terms such as the Virtual Link Set concept and data model patterns. A virtual link is a connection between re- sources such as literals and IRIs (Internationalized Resource Identifier) with some commonality where each of these resources is from a differ- ent RDF dataset. The links are required in order to understand how to semantically relate datasets. In addition, we describe several benefits of using virtual links to improve interoperability between heterogenous and independent datasets. Finally, we exemplify and apply our approach to multiple world-wide used RDF datasets.
    [Show full text]
  • Raising Risk Awareness on the Adoption of Web 2.0 Technologies in Decision Making Processes
    Future Internet 2012, 4, 700-718; doi:10.3390/fi4030700 OPEN ACCESS future internet ISSN 1999-5903 www.mdpi.com/journal/futureinternet Article Raising Risk Awareness on the Adoption of Web 2.0 Technologies in Decision Making Processes Marco Prandini 1;* and Marco Ramilli 2 1 DEIS, Universita` di Bologna, Viale del Risorgimento 2, Bologna 40136, Italy 2 DEIS, Universita` di Bologna, Via Venezia 52, Cesena 47521, Italy; E-Mail:[email protected] * Author to whom correspondence should be addressed; E-Mail: [email protected]; Tel.: +39-051-209-3867; Fax: +39-051-209-3073. Received: 27 April 2012; in revised form: 14 July 2012 / Accepted: 19 July 2012 / Published: 9 August 2012 Abstract: In the recent past, the so-called “Web 2.0” became a powerful tool for decision making processes. Politicians and managers, seeking to improve participation, embraced this technology as if it simply were a new, enhanced version of the World Wide Web, better suited to retrieve information, opinions and feedbacks from the general public on subjects like laws, acts and policies. This approach was often naive, neglecting the less-obvious aspects of the technology, and thus bringing on significant security problems. This paper shows how, in the end, the result could easily be the opposite of what was desired. Malicious attackers, in fact, could quite easily exploit the vulnerabilities in these systems to hijack the process and lead to wrong decisions, also causing the public to lose trust in the systems themselves. Keywords: Web 2.0; e-government; security; decision making 1. Introduction In 1999 Darcy DiNucci introduced the term Web 2.0 in her article entitled: “Fragmented Future” [1].
    [Show full text]
  • Lecture Notes
    cis3.5 fall2009 lecture II.1 what is a network? topics: • when computers talk to each other, this is called a network • internet overview • the network can have different kinds of computers and peripherals attached to it • creating graphics on the internet with “Processing” • networks in which computers are physically connected to each other in the close resources: geographical proximity are called local area networks (LANs) • Processing web site: http://www.processing.org/ • other networks are called wide area networks (WANs) • getting started tutorial: http://www.processing.org/learning/gettingstarted/ • the internet is a wide area network • drawing tutorial: http://www.processing.org/learning/drawing/ • the internet is an open system = “a system whose architecture is not a secret” • reference: http://www.processing.org/reference/index.html • protocol = set of rules for how computers communicate with each other; for example: – TCP: transmission control protocol (computer ↔ computer) – IP: internet protocol (computer ↔ computer) – HTTP: hypertext transfer protocol (computer ↔ browser) – FTP: file transfer protocol (computer ↔ computer) – SMTP: simple mail transfer protocol (computer ↔ mail client) cis3.5-fall2009-sklar-lecII.1 1 cis3.5-fall2009-sklar-lecII.1 2 what is the internet? some internet facilities • history • the world wide web – ARPAnet (circa 1971): used “NCP” – HTML = hypertext markup language – TCP (1974): hardware independent, open – hyperlink – internet was standardized in September 1981 – browser • the internet is NOT the
    [Show full text]
  • HTTP Protocol Binding
    ONEM2M TECHNICAL SPECIFICATION Document Number TS-0009-V2.6.1 Document Name: HTTP Protocol Binding Date: 2016-August-30 Abstract: HTTP Protocol Binding TS This Specification is provided for future development work within oneM2M only. The Partners accept no liability for any use of this Specification. The present document has not been subject to any approval process by the oneM2M Partners Type 1. Published oneM2M specifications and reports for implementation should be obtained via the oneM2M Partners' Publications Offices. © oneM2M Partners Type 1 (ARIB, ATIS, CCSA, ETSI, TIA, TSDSI, TTA, TTC) Page 1 of 20 This is a draft oneM2M document and should not be relied upon; the final version, if any, will be made available by oneM2M Partners Type 1. About oneM2M The purpose and goal of oneM2M is to develop technical specifications which address the need for a common M2M Service Layer that can be readily embedded within various hardware and software, and relied upon to connect the myriad of devices in the field with M2M application servers worldwide. More information about oneM2M may be found at: http//www.oneM2M.org Copyright Notification No part of this document may be reproduced, in an electronic retrieval system or otherwise, except as authorized by written permission. The copyright and the foregoing restriction extend to reproduction in all media. © 2016, oneM2M Partners Type 1 (ARIB, ATIS, CCSA, ETSI, TIA, TSDSI, TTA, TTC). All rights reserved. Notice of Disclaimer & Limitation of Liability The information provided in this document is directed solely to professionals who have the appropriate degree of experience to understand and interpret its contents in accordance with generally accepted engineering or other professional standards and applicable regulations.
    [Show full text]
  • Server Program Analysis Based on HTTP Protocol Min CHE1 Ming Fu TUO1 1College of Science, Air Force Engineering University 710051 Xi’An, China
    MATEC Web of Conferences 63, 05023 (2016) DOI: 10.1051/matecconf/20166305023 MMME 2016 Server Program Analysis Based on HTTP Protocol Min CHE1 Ming Fu TUO1 1College of Science, Air Force Engineering University 710051 Xi’an, China Abstract—The overwhelming majority of Web developments are built based on Web application of HTTP protocol. To analyze the server program based on the HTTP protocol and the process of interaction between the browser and server, a simple HTTP-based server program is taken as an example to describe how to use the theories of multi- threading and asynchronous operationˈand the core section of HTTP-based server program is gradually resolved with Winsock programming. This method can achieve the purpose to avoid blocking the calling threadˈand improve the responsiveness of a server program. Keywords- HTTP; HTTP protocol; Server; WinSock 1 INTRODUCTION 2.2. HTTP response consists of three components namely the status line,message header,and response body . Hypertext Transfer Protocol (HTTP) is the protocol of web’s application layer. It is the web's core issue. HTTP can be The format of status line is: HTTP-Version Status-Code Rea- realized in the web client’s program or server’s program. The son-Phrase CRLF. HTTP-Version indicates the version of programs running on different end systems of client and server HTTP protocol used in web server; Status-Code indicates the can exchange with each other through exchanging HTTP status code of a response sent back from web server; Reason- message. Web Page is also called document composed of Phrase indicates the text description of status code.
    [Show full text]
  • Creating Void Descriptions for Web-Scale Data
    Creating voiD Descriptions for Web-scale Data Christoph B¨ohm, Johannes Lorey, Felix Naumann Hasso Plattner Institute, Prof.-Dr.-Helmert-Str. 2-3, 14482 Potsdam, Germany fi[email protected] Abstract When working with large amounts of crawled semantic data as provided by the Billion Triple Challenge (BTC), it is desirable to present the data in a manner best suited for end users. This includes conceiving and presenting explanatory metainformation. The Vocabulary of Interlinked Data (voiD) has been proposed as a means to annotate sets of RDF resources to facilitate not only human understanding, but also query optimization. In this article we introduce tools that automatically generate voiD descriptions for large datasets. Our approach comprises different means to identify (sub)datasets and annotate the derived subsets according to the voiD specification. Due to the complexity of Web-scale Linked Data, all algorithms used for partitioning and augmenting are implemented in a cloud environment utilizing the MapReduce paradigm. We employed the Billion Triple Challenge 2010 dataset [6] to evaluate our approach, and present the results in this article. We have released a tool named voiDgen to the public that allows the generation of metainformation for such large datasets. Keywords: Semantic Web; Vocabulary of Interlinked Data; Semantic Data Profiling; RDF Metadata Generation; Cloud Computing 1. Introduction of well-defined metadata expedites the causes of data in- terconnectivity and semantic integration. The Vocabulary Open data emerges from a variety of sources, e.g., gov- of Interlinked Datasets (voiD) addresses this need. ernment agencies, bio-science institutes, social networks, or community-driven knowledge bases.
    [Show full text]