Enterprise Data Analysis and Design

Total Page:16

File Type:pdf, Size:1020Kb

Enterprise Data Analysis and Design Enterprise Data Analysis and Design Lecture 1: Introduction to Database Technology Johannes Gehrke [email protected] http://www.cs.cornell.edu/johannes Course Goals • Architectures of modern enterprise information systems • Understand the functionality of modern database and data mining systems • Understand where database systems and data mining fit into an enterprise information system • Learn to ask the right questions • Learn how to use several important tools • Data modeling (DeZign for Databases) • Data mining (SAS Enterprise Miner) NBA 518 Spring 2004: Lecture 1 2 DeZign for Databases NBA 518 Spring 2004: Lecture 1 3 NBA 518: Enterprise Data Design and Analysis 1 SAS Enterprise Miner NBA 518 Spring 2004: Lecture 1 4 Course Outline • 1/26 Database Management Systems • 1/28 Enterprise Information Architectures • 2/2, 2/4, and 2/9: Data Modeling • 2/11, 2/16, 2/18, 2/23, and 2/25: Data Mining • 3/2 OLAP • 3/4 Web Services • 3/9 Future Trends NBA 518 Spring 2004: Lecture 1 5 Course Mechanics • Temporary course homepage: http://www.cs.cornell.edu/johannes/teaching/NBA518 • Slides will be online the morning before each lecture • Readings for each class will be available online • Office hours: • Tuesdays 1:30-2:30, Upson Hall 4105B • Mondays and Wednesdays from 2:00 – start of class in Sage Hall Atrium • Always welcome to ask questions via email ([email protected]) • Ask questions after the lecture NBA 518 Spring 2004: Lecture 1 6 NBA 518: Enterprise Data Design and Analysis 2 Grading • Five homework assignments: • Enterprise architectures (15%) • Data modeling (15%) • Data mining I: Classification (15%) • Data mining II: Clustering and Associations (15%) • A complete case study (in groups of 2-3 students, 20%) • Class participation (20%): Quality and not quantity counts NBA 518 Spring 2004: Lecture 1 7 Introduction: About the Instructor Johannes Gehrke is an Assistant Professor in the Department of Computer Science at Cornell University. He obtained his Ph.D. in computer science from the University of Wisconsin-Madison in 1999; his graduate studies were supported by a Fulbright fellowship and an IBM fellowship. Johannes' research interests are in the areas of data mining, data stream processing, and distributed data management for sensor networks and peer-to-peer networks. Johannes has received a National Science Foundation Career Award, an Arthur P. Sloan Fellowship, an IBM Faculty Award, and the Cornell College of Engineering James and Mary Tien Excellence in Teaching Award. He is the author of numerous publications on data mining and database systems, and he co-authored the undergraduate textbook Database Management Systems (McGrawHill (2002), currently in its third edition), used at universities all over the world. Johannes has served as Program Co-Chair of the 2001 ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery, Tutorial Chair for the 2001 IEEE International Conference on Data Mining, Area Chair for the Twentieth International Conference on Machine Learning, co-Chair of the 2003 ACM SIGKDD Cup, and he is serving as Program co-Chair of the 2004 ACM SIGKDD Conference. Johannes has given courses and tutorials on data mining and data stream processing at international conferences and on Wall Street, and he has extensive industry experience as technical advisor. NBA 518 Spring 2004: Lecture 1 8 Introduction: Students NBA 518 Spring 2004: Lecture 1 9 NBA 518: Enterprise Data Design and Analysis 3 Goal of This Lecture • Understand the basic functionality of a database system NBA 518 Spring 2004: Lecture 1 10 The Big Picture WWW Site Internal User Visitor INTRANET, VPN THE WEB Internal Main Public Web Server Web Server Memory Cache Data Business Warehouse Transaction Application DBMS Server Server NBA 518 Spring 2004: Lecture 1 11 Why Database Systems? Discuss with your neighbor: What functionality is required from database systems in the following application scenarios: • EBay (www.ebay.com) • Barnes and Noble (www.bn.com) • General Motors (www.gm.com) • The Protein Data Bank (http://www.rcsb.org/pdb) • Sprint (www.sprint.com) • Your cell phone NBA 518 Spring 2004: Lecture 1 12 NBA 518: Enterprise Data Design and Analysis 4 Why Store Data in a DBMS? • Benefits • Transactions (concurrent data access, recovery from system crashes) • High-level abstractions for data access, manipulation, and administration • Data integrity and security • Performance and scalability NBA 518 Spring 2004: Lecture 1 13 A Digress – What Is a Transaction? The execution of a program that performs a function by accessing a database. Examples: • Reserve an airline seat. Buy an airline ticket. • Withdraw money from an ATM. • Verify a credit card sale. • Order an item from an Internet retailer. • Download a video clip and pay for it. • Play a bid at an on-line auction. NBA 518 Spring 2004: Lecture 1 14 Transactions • A transaction is an atomic sequence of actions • Each transaction must leave the system in a consistent state (if system is consistent when the transaction starts). • The ACID Properties: • Atomicity • Consistency • Isolation • Durability NBA 518 Spring 2004: Lecture 1 15 NBA 518: Enterprise Data Design and Analysis 5 Example Transaction: Online Store Your purchase transaction: • Atomicity: Either the complete purchase happens, or nothing • Consistency: The inventory and internal accounts are updated correctly • Isolation: It does not matter whether other customers are also currently making a purchase • Durability: Once you have received the order confirmation number, your order information is permanent, even if the site crashes NBA 518 Spring 2004: Lecture 1 16 Transactions (Contd.) A transaction will commit after completing all its actions, or it could abort (or be aborted by the DBMS) after executing some actions. NBA 518 Spring 2004: Lecture 1 17 Example Transaction: ATM You withdraw money from the ATM machine • Atomicity • Consistency • Isolation • Durability Commit versus Abort? What are reasons for commit or abort? NBA 518 Spring 2004: Lecture 1 18 NBA 518: Enterprise Data Design and Analysis 6 Transactions: Examples Give examples of transactions in the following applications. Which of the ACID properties are needed? • EBay (www.ebay.com) • Barnes and Noble (www.bn.com) • General Motors (www.gm.com) • The Protein Data Bank (http://www.rcsb.org/pdb) • Sprint (www.sprint.com) • Your cell phone NBA 518 Spring 2004: Lecture 1 19 What Makes Transaction Processing Hard • Reliability - system should rarely fail • Availability - system must be up all the time • Response time - within 1-2 seconds • Throughput - thousands of transactions/second • Scalability - start small, ramp up to Internet-scale • Security – for confidentiality and high finance • Configurability - for above requirements + low cost • Atomicity - no partial results • Durability - a transaction is a legal contract • Distribution - of users and data NBA 518 Spring 2004: Lecture 1 20 Reliability and Availability • Reliability - system should rarely fail • Availability - system must be up all the time Downtime Availability 1 hour/day 95.8% 1 hour/week 99.41% 1 hour/month 99.86% 1 hour/year 99.9886% 1 minute/day 99.9988% 1 hour/20years 99.99942% 1 minute/week 99.99983% NBA 518 Spring 2004: Lecture 1 21 NBA 518: Enterprise Data Design and Analysis 7 Performance • Response time - within 1-2 seconds • Throughput - thousands of transactions/second • Scalability - start small, ramp up to Internet- scale NBA 518 Spring 2004: Lecture 1 22 What Makes TP Important? • It is at the core of electronic commerce • Most medium-to-large businesses use TP for their production systems. The business can’t operate without it. • It is a huge slice of the computer system market — over $50B/year. Probably the single largest application of computers. NBA 518 Spring 2004: Lecture 1 23 TP System Infrastructure • User’s viewpoint • Enter a request from a browser or other display device • The system performs some application-specific work, which includes database accesses • Receive a reply (usually, but not always) • The TP system ensures that each transaction • is an independent unit of work • executes exactly once, and • produces permanent results. • TP system makes it easy to program transactions • TP system has tools to make it easy to manage NBA 518 Spring 2004: Lecture 1 24 NBA 518: Enterprise Data Design and Analysis 8 TP System Infrastructure End-User Presentation Manager Front-End (Client) requests Workflow Control (routes requests) Back-End Transaction Program (Server) NBADatabase 518 Spring 2004: System Lecture 1 25 System Characteristics • Typically < 100 transaction types per application • Transaction size has high variance. Typically, • 0-30 disk accesses • 10K - 1M instructions executed • 2-20 messages • A large-scale example: airline reservations • 150,000 active display devices • plus indirect access via Internet travel agents • thousands of disk drives • 3000 transactions per second, peak NBA 518 Spring 2004: Lecture 1 26 Exercise • Reliability - system should rarely fail • Availability - system must be up all the time • Response time - within 1-2 seconds • Throughput - thousands of transactions/second • Scalability - start small, ramp up to Internet-scale • Security – for confidentiality and high finance • Configurability - for above requirements + low cost • Atomicity - no partial results • Durability - a transaction is a legal contract • Distribution - of users and data • Question: Think of a TP System that you know of, and discuss with your
Recommended publications
  • Big Data Velocity in Plain English
    Big Data Velocity in Plain English John Ryan Data Warehouse Solution Architect Table of Contents The Requirement . 1 What’s the Problem? . .. 2 Components Needed . 3 Data Capture . 3 Transformation . 3 Storage and Analytics . 4 The Traditional Solution . 6 The NewSQL Based Solution . 7 NewSQL Advantage . 9 Thank You. 10 About the Author . 10 ii The Requirement The assumed requirement is the ability to capture, transform and analyse data at potentially massive velocity in real time. This involves capturing data from millions of customers or electronic sensors, and transforming and storing the results for real time analysis on dashboards. The solution must minimise latency — the delay between a real world event and it’s impact upon a dashboard, to under a second. Typical applications include: • Monitoring Machine Sensors: Using embedded sensors in industrial machines or vehicles — typically referred to as The Internet of Things (IoT) . For example Progressive Insurance use real time speed and vehicle braking data to help classify accident risk and deliver appropriate discounts. Similar technology is used by logistics giant FedEx which uses SenseAware technology to provide near real-time parcel tracking. • Fraud Detection: To assess the risk of credit card fraud prior to authorising or declining the transaction. This can be based upon a simple report of a lost or stolen card, or more likely, an analysis of aggregate spending behaviour, aligned with machine learning techniques. • Clickstream Analysis: Producing real time analysis of user web site clicks to dynamically deliver pages, recommended products or services, or deliver individually targeted advertising. Big Data: Velocity in Plain English eBook 1 What’s the Problem? The primary challenge for real time systems architects is the potentially massive throughput required which could exceed a million transactions per second.
    [Show full text]
  • PS Non-Standard Database Systems Overview
    PS Non-Standard Database Systems Overview Daniel Kocher March 2019 is document gives a brief overview on three categories of non-standard database systems (DBS) in the context of this class. For each category, we provide you with a motivation, a discussion about the key properties, a list of commonly used implementations (proprietary and open-source/freely-available), and recommendation(s) for your project. For your convenience, we also recap some relevant terms (e.g., ACID, OLTP, ...). Terminology OLTP Online transaction processing. OLTP systems are highly optimized to process (1) short queries (operating only on a small portion of the database) and (2) small transac- tions (inserting/updating few tuples per relation). Typically, OLTP includes insertions, updates, and deletions. e main focus of OLTP systems is on low response time and high throughput (in terms of transactions per second). Databases in OLTP systems are highly normalized [5]. OLAP Online analytical processing. OLAP systems analyze complex data (usually from a data warehouse). Typically, queries in an OLAP system may run for a long time (as opposed to queries in OLTP systems) and the number of transactions is usually low. Fur- thermore, typical OLAP queries are read-only and touch a large portion of the database (e.g., scan or join large tables). e main focus of OLAP systems is on low response time. Databases in OLAP systems are oen de-normalized intentionally [5]. ACID Traditional relational database systems guarantee ACID properties: – Atomicity All or no operation(s) of a transaction are reected in the database. – Consistency Isolated execution of a transaction preserves consistency of the database.
    [Show full text]
  • Performance Analysis of Blockchain Platforms
    UNLV Theses, Dissertations, Professional Papers, and Capstones August 2018 Performance Analysis of Blockchain Platforms Pradip Singh Maharjan Follow this and additional works at: https://digitalscholarship.unlv.edu/thesesdissertations Part of the Computer Sciences Commons Repository Citation Maharjan, Pradip Singh, "Performance Analysis of Blockchain Platforms" (2018). UNLV Theses, Dissertations, Professional Papers, and Capstones. 3367. http://dx.doi.org/10.34917/14139888 This Thesis is protected by copyright and/or related rights. It has been brought to you by Digital Scholarship@UNLV with permission from the rights-holder(s). You are free to use this Thesis in any way that is permitted by the copyright and related rights legislation that applies to your use. For other uses you need to obtain permission from the rights-holder(s) directly, unless additional rights are indicated by a Creative Commons license in the record and/ or on the work itself. This Thesis has been accepted for inclusion in UNLV Theses, Dissertations, Professional Papers, and Capstones by an authorized administrator of Digital Scholarship@UNLV. For more information, please contact [email protected]. PERFORMANCE ANALYSIS OF BLOCKCHAIN PLATFORMS By Pradip S. Maharjan Bachelor of Computer Engineering Tribhuvan University Institute of Engineering, Pulchowk Campus, Nepal 2012 A thesis submitted in partial fulfillment of the requirements for the Master of Science in Computer Science Department of Computer Science Howard R. Hughes College of Engineering The Graduate College University of Nevada, Las Vegas August 2018 c Pradip S. Maharjan, 2018 All Rights Reserved Thesis Approval The Graduate College The University of Nevada, Las Vegas May 4, 2018 This thesis prepared by Pradip S.
    [Show full text]
  • Evaluating and Comparing Oracle Database Appliance Performance Updated for Oracle Database Appliance X8-2-HA
    Evaluating and Comparing Oracle Database Appliance Performance Updated for Oracle Database Appliance X8-2-HA ORACLE WHITE PAPER | JANUARY 2020 DISCLAIMER The performance results in this paper are intended to give an estimate of the overall performance of the Oracle Database Appliance X8-2-HA system. The results are not a benchmark, and cannot be used to characterize the relative performance of Oracle Database Appliance systems from one generation to another, as the workload characteristics and other environmental factors including firmware, OS, and database patches, which includes Spectre/Meltdown fixes, will vary over time. For an accurate comparison to another platform, you should run the same tests, with the same OS (if applicable) and database versions, patch levels, etc. Do not rely on older tests or benchmark runs, as changes made to the underlying platform in the interim may substantially impact results. EVALUATING AND COMPARING ORACLE DATABASE APPLIANCE PERFORMANCE Table of Contents Introduction and Executive Summary 1 Audience 2 Objective 2 Oracle Database Appliance Deployment Architecture 3 Oracle Database Appliance Configuration Templates 3 What is Swingbench? 4 Swingbench Download and Setup 4 Configuring Oracle Database Appliance for Testing 5 Configuring Oracle Database Appliance System for Testing 5 Benchmark Setup 5 Database Setup 6 Schema Setup 6 Workload Setup and Execution 10 Benchmark Results 11 Workload Performance 11 Database and Operating System Statistics 12 Average CPU Busy 12 REDO Writes 13 Transaction Rate 13 Important Considerations for Performing the Benchmark 14 Conclusion 15 Appendix A Swingbench configuration files 16 Appendix B The loadgen.pl file 19 Appendix C The generate_awr.sh script 24 References 25 EVALUATING AND COMPARING ORACLE DATABASE APPLIANCE PERFORMANCE Introduction and Executive Summary Oracle Database Appliance X8-2-HA is a highly available Oracle database system.
    [Show full text]
  • 60-539: Emerging Non-Traditional Database Systems (Data Warehousing and Mining)
    60-539: Emerging non-traditional database systems (Data Warehousing and Mining): sdb1 warehouse sdb2 sdb3 Dr. C.I. Ezeife School of Computer Science, University of Windsor, Canada. Email: [email protected] 60-539 Dr. C.I. Ezeife @c 2021 1 PART I (DATABASE MANAGEMENT SYSTEM OVERVIEW) DBMS (PART 1) OVERVIEW Components of a DBMS DBMS Data model Data Definition and Manipulation Language File Organization Techniques Query Optimization and Evaluation Facility Database Design and Tuning Transaction Processing Concurrency Control Database Security and Integrity Issues 60-539 Dr. C.I. Ezeife © 2021 2 DBMS OVERVIEW(What are?) What is a database? : It is a collection of data, typically describing the activities of one or more related organizations, e.g., a University, an airline reservation or a banking database. What is a DBMS?: A DBMS is a set of software for creating, querying, managing and keeping databases. Examples of DBMS’s are DB2, Informix, Sybase, Oracle, MYSQL, Microsoft Access (relational). Alternative to Databases: Storing all data for university, airline and banking information in separate files and writing separate program for each data file. What is a Data Warehouse?:A subject-oriented, historical, non- volatile database integrating a number of data sources. What is Data Mining?:Is data analysis for finding interesting trends or patterns in large dataset to guide decisions about future activities. 60-539 Dr. C.I. Ezeife © 2021 3 DBMS OVERVIEW(Evolution of Information Technology) 1960s and earlier: Primitive file processing: data are collected in files and manipulated with programs in Cobol and other languages. Disadv.: any change in storage structure of data requires changing the program (i.e., no logical or physical data independence).
    [Show full text]
  • A Survey of Ledger Technology-Based Databases
    future internet Article A Survey of Ledger Technology-Based Databases Dénes László Fekete 1 and Attila Kiss 1,2,* 1 Department of Information Systems, ELTE Eötvös Loránd University, 1117 Budapest, Hungary; [email protected] 2 Department of Informatics, J. Selye University, 945 01 Komárno, Slovakia * Correspondence: [email protected] Abstract: The spread of crypto-currencies globally has led to blockchain technology receiving greater attention in recent times. This paper focuses more broadly on the uses of ledger databases as a traditional database manager. Ledger databases will be examined within the parameters of two categories. The first of these are Centralized Ledger Databases (CLD)-based Centralised Ledger Technology (CLT), of which LedgerDB will be discussed. The second of these are Permissioned Blockchain Technology-based Decentralised Ledger Technology (DLT) where Hyperledger Fabric, FalconDB, BlockchainDB, ChainifyDB, BigchainDB, and Blockchain Relational Database will be examined. The strengths and weaknesses of the reviewed technologies will be discussed, alongside a comparison of the mentioned technologies. Keywords: ledger technology; permissioned blockchain; centralised ledger database 1. Introduction Citation: Fekete, D.L.; Kiss, A. A In recent years, we have been witness to the rise of multiple new ledger technologies. Survey of Ledger Technology-Based Since Satosi Nakamoto wrote the Bitcoin white paper [1] in 2009, many different systems Databases. Future Internet 2021, 13, that endeavour to provide decentralization and trustless collaboration have been developed 197. https://doi.org/10.3390/ based on blockchain technology. Of late, a great amount of research and academia has been fi13080197 focused on the further development and reformation of the basis of blockchain in an attempt to solve the issues that continue to arise [2–4].
    [Show full text]
  • CSE 344 Final Examination Name
    CSE 344 Final Examination Monday, December 10, 2018, 2:30-4:20 Name: Question Points Score 1 30 2 20 3 20 4 30 5 20 6 35 7 45 Total: 200 • This exam is CLOSED book and CLOSED devices. • You are allowed TWO, HAND-WRITTEN letter-size sheets with notes (both sides). • You have 110 minutes; • Answer the easy questions before you spend too much time on the more difficult ones. • Good luck! 1 CSE 344 Final Dec 10, 2018, 2:30-4:20 1 Relational Data Model 1. (30 points) You are now a Climate Scientist, and run simulations to predict future changes in the climate. Your job is to improve the simulator in order to improve the climate predictions. Every time you have a new version of the simulator, you run it, and it will generate a dataset where, for each year, it records the predicted temperature at each location of your simulated area. You analyze the results, improve the simulator, and repeat. As a good scientist, you keep the results of all simulations in a relation with the following schema: Sim(version, year, loc, temp) The data types of Version and year are integers; temp is a real number; and loc is a text. (a) (5 points) Write a SQL CREATE TABLE statement to create the Sim relation. Identify the key in Sim, and create the key in your SQL statement. Choose float and for the real number type and text for the text type. Page 2 CSE 344 Final Dec 10, 2018, 2:30-4:20 Sim(version, year, loc, temp) (b) (5 points) Simulation version 13 is wrong, due to a bug in the simulator.
    [Show full text]
  • Extending ACID Semantics to the File System
    Extending ACID Semantics to the File System CHARLES P. WRIGHT IBM T. J. Watson Research Center RICHARD SPILLANE, GOPALAN SIVATHANU, and EREZ ZADOK Stony Brook University An organization’s data is often its most valuable asset, but today’s file systems provide few facilities to ensure its safety. Databases, on the other hand, have long provided transactions. Transactions are useful because they provide atomicity, consistency, isolation, and durability (ACID). Many applications could make use of these semantics, but databases have a wide variety of non-standard interfaces. For example, applications like mail servers currently perform elaborate error handling to ensure atomicity and consistency, because it is easier than using a DBMS. A transaction-oriented programming model eliminates complex error-handling code, because failed operations can simply be aborted without side effects. We have designed a file system that exports ACID transactions to user-level applications, while preserving the ubiquitous and convenient POSIX interface. In our prototype ACID file system, called Amino, updated applications can protect arbitrary sequences of system calls within a transaction. Unmodified applications operate without any changes, but each system call is transaction protected. We also built a recoverable memory library with support for nested transactions to allow applications to keep their in-memory data structures consistent with the file system. Our performance evaluation shows that ACID semantics can be added to applications with acceptable overheads.
    [Show full text]
  • Chapter 17: Parallel Databases
    ChapterChapter 18:18: ParallelParallel DatabasesDatabases IntroductionIntroduction • Parallel machines are becoming quite common and affordable – Prices of microprocessors, memory and disks have dropped sharply – Recent desktop computers feature multiple processors and this trend is projected to accelerate • Databases are growing increasingly large – large volumes of transaction data are collected and stored for later analysis. – multimedia objects like images are increasingly stored in databases • Large-scale parallel database systems increasingly used for: – storing large volumes of data – processing time-consuming decision-support queries – providing high throughput for transaction processing ParallelismParallelism inin DatabasesDatabases • Data can be partitioned across multiple disks for parallel I/O. • Individual relational operations (e.g., sort, join, aggregation) can be executed in parallel – data can be partitioned and each processor can work independently on its own partition. • Queries are expressed in high level language (SQL, translated to relational algebra) – makes parallelization easier. • Different queries can be run in parallel with each other. Concurrency control takes care of conflicts. • Thus, databases naturally lend themselves to parallelism. I/OI/O ParallelismParallelism • Reduce the time required to retrieve relations from disk by partitioning • The relations on multiple disks. • Horizontal partitioning – tuples of a relation are divided among many disks such that each tuple resides on one disk. • Partitioning techniques (number of disks = n): Round-robin: Send the I th tuple inserted in the relation to disk i mod n. Hash partitioning: – Choose one or more attributes as the partitioning attributes. – Choose hash function h with range 0…n - 1 – Let i denote result of hash function h applied to the partitioning attribute value of a tuple.
    [Show full text]
  • Performance Benchmark Postgresql / Mongodb Performance Benchmark Postgresql / Mongodb
    PERFORMANCE BENCHMARK POSTGRESQL / MONGODB // CONTENTS // ABOUT THIS BENCHMARK 3 Introduction 3 OnGres Ethics Policy 4 Authors 4 // EXECUTIVE SUMMARY. BENCHMARKS KEY FINDINGS 5 Transactions benchmark 5 OLTP Benchmark 6 OLAP Benchmark 6 // METHODOLOGY AND BENCHMARKS 7 Introduction and objectives 7 Benchmarks performed 7 About the technologies involved 8 Automated infrastructure 9 // TRANSACTIONS BENCHMARK 12 Benchmark description 12 MongoDB transaction limitations 14 Discussion on transaction isolation levels 14 Benchmark results 17 // OLTP BENCHMARK 25 Benchmark description 25 Initial considerations 26 Benchmark results 29 // OLAP BENCHMARK 38 Benchmark description 38 Benchmark results 45 2/46 // ABOUT THIS BENCHMARK Introduction Benchmarking is hard. Benchmarking databases, harder. Benchmarking databases that follow different approaches (relational vs document) is even harder. There are many reasons why this is true, widely discussed in the industry. Notwithstanding all these difficulties, the market demands these kinds of benchmarks. Despite the different data models that MongoDB and PostgreSQL expose, many developers and organizations face a challenge when choosing between the platforms. And while they can be compared on many fronts, performance is undoubtedly one of the main differentiators — arguably the main one. How then do you leverage an informative benchmark so that decisions can be made about the choice of a given technology, while at the same time presenting a fair arena in which the technologies compete in an apples-to-apples scenario? To fulfill these goals, this benchmark has been executed based on the following criteria: • Transparency and reproducibility. The framework that has been programmed and used to run the benchmarks is fully automated and is published as open source.
    [Show full text]
  • Hyperledger Blockchain Performance Metrics
    Hyperledger Blockchain Performance Metrics ABOUT HYPERLEDGER About This Paper Hyperledger This is the first white paper from the Hyperledger Performance and Scale Working (http://hyperledger.org/) is Group (https://wiki.hyperledger.org/groups/pswg/performance-and-scale-wg). an open source effort created The purpose of this document is to define the basic terms and key metrics to advance cross-industry that should be used to evaluate the performance of a blockchain and then blockchain technologies. communicate the results. This paper also serves as a platform-agnostic resource It is a global collaboration for technical blockchain developers and managers interested in using industry- including leaders in banking, standard nomenclature. finance, Internet of Things, While we appreciate that there may be discrete definitions for the terms “blockchain” manufacturing, supply chains, and “Distributed Ledger Technology (DLT),” for the purposes of this paper we will and technology. The Linux treat both terms synonymously and use the term “blockchain” throughout. Foundation, the nonprofit organization enabling mass This document provides some guidance on selecting and evaluating workloads. innovation through open source, We expect that refinements to these definitions and new blockchain-specific hosts Hyperledger. The Linux metrics will warrant future revisions of this document. Foundation also enables a In future documents, the Working Group plans to discuss workloads in greater worldwide developer community detail and to offer additional guidance on standard procedures and emerging to work together and share best practices for evaluating blockchain performance. To provide your feedback ideas, infrastructure, and code. and stay informed about subsequent versions of this paper, please join us in the Performance and Scale Working Group (https://wiki.hyperledger.org/groups/ pswg/performance-and-scale-wg).
    [Show full text]
  • Administration Guide Version 10.0
    Oracle® GoldenGate for Mainframe Administration Guide Version 10.0 October 2009 Administration Guide, version 10.0 Copyright © 1995, 2009 Oracle and/or its affiliates. All rights reserved. This software and related documentation are provided under a license agreement containing restrictions on use and disclosure and are protected by intellectual property laws. Except as expressly permitted in your license agreement or allowed by law, you may not use, copy, reproduce, translate, broadcast, modify, license, transmit, distribute, exhibit, perform, publish, or display any part, in any form, or by any means. Reverse engineering, disassembly, or decompilation of this software, unless required by law for interoperability, is prohibited. The information contained herein is subject to change without notice and is not warranted to be error-free. If you find any errors, please report them to us in writing. If this software or related documentation is delivered to the U.S. Government or anyone licensing it on behalf of the U.S. Government, the following notice is applicable: U.S. GOVERNMENT RIGHTS Programs, software, databases, and related documentation and technical data delivered to U.S. Government customers are "commercial computer software" or "commercial technical data" pursuant to the applicable Federal Acquisition Regulation and agency-specific supplemental regulations. As such, the use, duplication, disclosure, modification, and adaptation shall be subject to the restrictions and license terms set forth in the applicable Government contract, and, to the extent applicable by the terms of the Government contract, the additional rights set forth in FAR 52.227-19, Commercial Computer Software License (December 2007). Oracle USA, Inc., 500 Oracle Parkway, Redwood City, CA 94065.
    [Show full text]