Query Analysis on a Distributed Graph Database Student: Bc

Total Page:16

File Type:pdf, Size:1020Kb

Query Analysis on a Distributed Graph Database Student: Bc ASSIGNMENT OF MASTER’S THESIS Title: Query Analysis on a Distributed Graph Database Student: Bc. Lucie Svitáková Supervisor: Ing. Michal Valenta, Ph.D. Study Programme: Informatics Study Branch: Web and Software Engineering Department: Department of Software Engineering Validity: Until the end of summer semester 2019/20 Instructions 1. Research current practicies of data storage in distributed graph databases. 2. Choose an existing database system for your implementation. 3. Analyse how to extract necessary information for further query analysis and implement that extraction. 4. Create a suitable method to store the data extracted in step 3. 5. Propose a method for a more efficient redistribution of the data. References Will be provided by the supervisor. Ing. Michal Valenta, Ph.D. doc. RNDr. Ing. Marcel Jiřina, Ph.D. Head of Department Dean Prague October 5, 2018 Master’s thesis Query Analysis on a Distributed Graph Database Bc. et Bc. Lucie Svit´akov´a Department of Software Engineering Supervisor: Ing. Michal Valenta, Ph.D. January 8, 2019 Acknowledgements I would like to thank my supervisor Michal Valenta for his valuable suggestions and observations, my sister S´arkaSvit´akov´aforˇ spelling corrections, my friends Martin Horejˇsand Tom´aˇsNov´aˇcekfor their valuable comments and my family for their continual motivation and support. Declaration I hereby declare that the presented thesis is my own work and that I have cited all sources of information in accordance with the Guideline for adhering to ethical principles when elaborating an academic final thesis. I acknowledge that my thesis is subject to the rights and obligations stip- ulated by the Act No. 121/2000 Coll., the Copyright Act, as amended. In accordance with Article 46(6) of the Act, I hereby grant a nonexclusive au- thorization (license) to utilize this thesis, including any and all computer pro- grams incorporated therein or attached thereto and all corresponding docu- mentation (hereinafter collectively referred to as the “Work”), to any and all persons that wish to utilize the Work. Such persons are entitled to use the Work in any way (including for-profit purposes) that does not detract from its value. This authorization is not limited in terms of time, location and quan- tity. However, all persons that makes use of the above license shall be obliged to grant a license at least in the same scope as defined above with respect to each and every work that is created (wholly or in part) based on the Work, by modifying the Work, by combining the Work with another work, by including the Work in a collection of works or by adapting the Work (including trans- lation), and at the same time make available the source code of such work at least in a way and scope that are comparable to the way and scope in which the source code of the Work is made available. In Prague on January 8, 2019 . Czech Technical University in Prague Faculty of Information Technology c 2019 Lucie Svit´akov´a.All rights reserved. This thesis is school work as defined by Copyright Act of the Czech Republic. It has been submitted at Czech Technical University in Prague, Faculty of Information Technology. The thesis is protected by the Copyright Act and its usage without author’s permission is prohibited (with exceptions defined by the Copyright Act). Citation of this thesis Svit´akov´a,Lucie. Query Analysis on a Distributed Graph Database. Mas- ter’s thesis. Czech Technical University in Prague, Faculty of Information Technology, 2019. Abstrakt Pˇrestoˇzednes existuje nˇekolik produkt˚ugrafov´ych datab´az´ıurˇcen´ych pro dis- tribuovan´aprostˇred´ı, jsou v nich obvykle data distribuov´anana jednotliv´e fyzick´euzly n´ahodnˇe,bez pozdˇejˇs´ırevize zat´ıˇzen´ıa moˇzn´ereorganizace dat. Tato pr´aceanalyzuje souˇcasn´epraktiky ukl´ad´an´ıdat v distribuovan´ych grafo- v´ych datab´az´ıch. Podle t´etoanal´yzypr´acenavrhuje a implementuje nov´y modul pro grafov´yv´ypoˇcetn´ıframework TinkerPop, kter´yloguje provoz vy- generovan´ydotazy uˇzivatel˚u. Je rovnˇeˇznaimplementov´anasamostatn´aap- likace pro ukl´ad´an´ıtakov´ychto logovan´ych dat do datab´azeJanusGraph. Tento program rovnˇeˇzspouˇst´ıredistribuˇcn´ıalgoritmus navrhuj´ıc´ıefektivnˇejˇs´ıuloˇzen´ı dat v clusteru. Existuj´ıc´ı algoritmus od Vaquera a spol., kompatibiln´ı se syst´ememPregel, je aplikov´ans podstatn´ymirozˇs´ıˇren´ımi.V´ysledkem je n´avrh reorganizace dat s 70–80% zlepˇsen´ımkomunikace mezi uzly clusteru. Takov´yto v´ysledekje porovnateln´ys jinou zn´amoumetodou Ja-be-Ja, kter´avˇsakvyˇza- duje v´yraznˇevyˇsˇs´ıv´ypoˇcetn´ıprostˇredky. Na druhou stranu metoda v t´eto pr´acizav´ad´ımalou disbalanci na uzlech clusteru. Nakonec tato pr´aceuv´ad´ı doporuˇcen´ıpro moˇzn´abudouc´ırozˇs´ıˇren´ıa vylepˇsen´ı. Kl´ıˇcov´a slova grafov´edatab´aze,rozdˇelen´ıgrafu, redistribuce, NoSQL, Ja- nusGraph, TinkerPop, Pregel vii Abstract Although products of graph databases intended for distributed environments already exist, the data are usually distributed randomly on particular physical hosts without any subsequent load examination and possible data reorganiza- tion. This thesis analyses the current practices of data storage in distributed graph databases. According to this analysis, it designs and implements a new module of the general graph computing framework TinkerPop for logging the traffic generated with user queries. A separate implementation for storage of such logged data in the JanusGraph database is provided. It also executes a redistribution algorithm proposing a more efficient distribution of data. An existing Pregel-compliant algorithm of Vaquero et al. with substantial en- hancements is applied. Results with 70–80% improvement of communication among physical hosts of the cluster are obtained, which is comparable to an- other well-known method Ja-be-Ja with much higher computational demands. On the other hand, the method in this thesis imposes a necessary slight im- balance of the cluster nodes. Finally, this thesis introduces suggestions for future extensions and enhancements. Keywords graph databases, graph partitioning, redistribution, NoSQL, Ja- nusGraph, TinkerPop, Pregel viii Contents Introduction 1 1 Theory of Graph Databases 3 1.1 Databases . 3 1.2 Graph Databases . 7 2 Graph Partitioning 13 2.1 Algorithms . 14 2.2 Pregel-Compliant Algorithms . 19 3 Technology 29 3.1 Distributed Graph Databases . 29 3.2 Apache TinkerPop . 37 4 Analysis and Design 43 4.1 Requirements Analysis . 44 4.2 Extracted Data Storage . 45 4.3 Architecture . 49 4.4 Graph Redistribution Manager . 50 4.5 Algorithm Design . 53 5 Implementation 57 5.1 Graph Database Selection . 57 5.2 Query Data Extraction . 58 5.3 Graph Redistribution Manager . 65 6 Evaluation 71 6.1 Datasets . 71 6.2 Evaluation Methods . 77 6.3 Settings . 78 ix 6.4 Results . 79 6.5 Summary . 86 7 Enhancements 89 7.1 Query Data Extraction . 89 7.2 Graph Redistribution Manager . 89 7.3 Algorithm . 90 7.4 Experiments . 91 Conclusion 93 Bibliography 95 A README 105 A.1 Processed Result Logging . 105 A.2 Graph Redistribution Manager . 111 B Evaluation Results 117 B.1 Twitter Dataset . 117 B.2 Road Network Dataset . 126 C Contents of CD 135 x List of Figures 1.1 DBMS popularity changes since 2013 . 7 1.2 Example of a property graph . 9 1.3 Example of an RDF graph . 9 2.1 The multilevel approach to graph partitioning . 17 2.2 Maximum Value Example in Pregel Model. 20 2.3 Label propagation without and with oscillation prevented. 22 2.4 Examples of two potential color exchanges . 26 3.1 Local Server Mode of JanusGraph with storage backend . 31 3.2 Remote Server Mode of JanusGraph with storage backend . 31 3.3 Remote Server Mode with Gremlin Server of JanusGraph with stor- age backend . 31 3.4 Embedded Mode of JanusGraph with storage backend . 32 3.5 JanusGraph data layout . 33 3.6 OrientDB APIs . 34 3.7 Apache TinkerPop framework . 38 3.8 OLAP processing in Apache TinkerPop . 41 4.1 Architecture design . 51 4.2 Activity diagram of the main process in the Graph Redistribution Manager . 52 5.1 Query Data Extraction in different connection modes. 59 5.2 Structure of the Processed Result Logging module . 61 5.3 Structure of the Graph Redistribution Manager project . 66 6.1 Degree distribution of the Twitter dataset . 74 6.2 Degree distribution of the Road network dataset . 76 6.3 Development of cross-node communication improvement with var- ious cooling factors . 80 xi 6.4 Effect of cooling factor on cross-node communication and number of iterations. 81 6.5 Effect of imbalance factor on cross-node communication, number of iterations and capacity usage. 82 6.6 Effect of adoption factor on cross-node communication and number of iterations. 84 6.7 Effect of adoption factor on cross-node communication and number of iterations. 85 B.1 Development of cross-node communication improvement with var- ious cooling factors . 117 B.2 Twitter dataset: effects of cooling factor on improvement, number of iterations and capacity usage . 118 B.3 Twitter dataset: effects of imbalance factor on improvement, num- ber of iterations and capacity usage . 120 B.4 Twitter dataset: effects of adoption factor on improvement, num- ber of iterations and capacity usage . 122 B.5 Twitter dataset: effects of number of nodes in a cluster on improve- ment, number of iterations and capacity usage . 124 B.6 Road network dataset: effects of cooling factor on improvement, number of iterations and capacity usage . 126 B.7 Road network dataset: effects of imbalance factor on improvement, number of iterations and capacity usage . 128 B.8 Road network dataset: effects of adoption factor on improvement, number of iterations and capacity usage . 130 B.9 Road network dataset: effects of number of nodes in a cluster on improvement, number of iterations and capacity usage .
Recommended publications
  • Powerdesigner 16.6 Data Modeling
    SAP® PowerDesigner® Document Version: 16.6 – 2016-02-22 Data Modeling Content 1 Building Data Models ...........................................................8 1.1 Getting Started with Data Modeling...................................................8 Conceptual Data Models........................................................8 Logical Data Models...........................................................9 Physical Data Models..........................................................9 Creating a Data Model.........................................................10 Customizing your Modeling Environment........................................... 15 1.2 Conceptual and Logical Diagrams...................................................26 Supported CDM/LDM Notations.................................................27 Conceptual Diagrams.........................................................31 Logical Diagrams............................................................43 Data Items (CDM)............................................................47 Entities (CDM/LDM)..........................................................49 Attributes (CDM/LDM)........................................................55 Identifiers (CDM/LDM)........................................................58 Relationships (CDM/LDM)..................................................... 59 Associations and Association Links (CDM)..........................................70 Inheritances (CDM/LDM)......................................................77 1.3 Physical Diagrams..............................................................82
    [Show full text]
  • Demonstrate an Understanding of Computer Database Management Systems 114049
    Demonstrate an understanding of Computer Database Management Systems 114049 PURPOSE OF THE UNIT STANDARD This unit standard is intended: To provide a fundamental knowledge of the areas covered For those working in, or entering the workplace in the area of Information Technology As additional knowledge for those wanting to understand the areas covered People credited with this unit standard are able to: Describe data management issues and how it is addressed by a DBMS. Describe commonly implemented features of commercial computer DBMS`s Describe different type of DBMS`s Review DBMS end-user tools The performance of all elements is to a standard that allows for further learning in this area. LEARNING ASSUMED TO BE IN PLACE AND RECOGNITION OF PRIOR LEARNING The credit value of this unit is based on a person having prior knowledge and skills to: Demonstrate an understanding of fundamental mathematics (at least NQF level 3). Demonstrate PC competency skills (End-User Computing unit Standards, at least up to NQF level 3) NC: IT: SYSTEMS DEVELOPMENT AUTHOR: LEARNER MANUAL REL DATE: 27/01/2020 REV DATE: 01/01/2023 DOC REF: 48872 LM MOD 5 V-1 PAGE 50 INDEX Competence Requirements Page Unit Standard 114049 alignment index Here you will find the different outcomes explained which you need to be 52 proved competent in, in order to complete the Unit Standard 114049. Unit Standard 114049 54 Describe data management issues 57 Commonly implemented features of commercial database management systems 63 Different types of DBMS’s 67 Review DBMS end-user tools 73 Self-assessment Once you have completed all the questions after being facilitated, you need to check the progress you have made.
    [Show full text]
  • An Evaluation of the Design Space for Scalable Data Loading Into Graph Databases
    Otto-von-Guericke-Universit¨at Magdeburg Faculty of Computer Science Databases D B and Software S E Engineering Master's Thesis An Evaluation of the Design Space for Scalable Data Loading into Graph Databases Author: Jingyi Ma February 23, 2018 Advisors: M.Sc. Gabriel Campero Durand Data and Knowledge Engineering Group Prof. Dr. rer. nat. habil. Gunter Saake Data and Knowledge Engineering Group Ma, Jingyi: An Evaluation of the Design Space for Scalable Data Loading into Graph Databases Master's Thesis, Otto-von-Guericke-Universit¨at Magdeburg, 2018. Abstract In recent years, computational network science has become an active area. It offers a wealth of tools to help us gain insight into the interconnected systems around us. Graph databases are non-relational database systems which have been developed to support such network-oriented workloads. Graph databases build a data model based on graph abstractions (i.e. nodes/vertexes and edges) and can use different optimizations to speed up the basic graph processing tasks, such as traversals. In spite of such benefits, some tasks remain challenging in graph databases, such as the task of loading the complete dataset. The loading process has been considered to be a performance bottleneck, specifically a scalability bottleneck, and application developers need to conduct performance tuning to improve it. In this study, we study some optimization alternatives that developers have for load data into a graph databases. With this goal, we propose simple microbenchmarks of application-level load optimizations and evaluate these optimizations experimentally for loading real world graph datasets. We run our tests using JanusGraphLab, a JanusGraph prototype.
    [Show full text]
  • OLAP, Relational, and Multidimensional Database Systems
    OLAP, Relational, and Multidimensional Database Systems George Colliat Arbor Software Corporation 1325 Chcseapeakc Terrace, Sunnyvale, CA 94089 Introduction Many people ask about the difference between implementing On-Line Analytical Processing (OLAP) with a Relational Database Management System (ROLAP) versus a Mutidimensional Database (MDD). In this paper, we will show that an MDD provides significant advantages over a ROLAP such as several orders of magnitude faster data retrieval, several orders of magnitude faster calculation, much less disk space, and le,~ programming effort. Characteristics of On-Line Analytical Processing OLAP software enables analysts, managers, and executives to gain insight into an enterprise performance through fast interactive access to a wide variety of views of data organized to reflect the multidimensional aspect of the enterprise data. An OLAP service must meet the following fundamental requirements: • The base level of data is summary data (e.g., total sales of a product in a region in a given period) • Historical, current, and projected data • Aggregation of data and the ability to navigate interactively to various level of aggregation (drill down) • Derived data which is computed from input data (performance rados, variance Actual/Budget,...) • Multidimensional views of the data (sales per product, per region, per channel, per period,..) • Ad hoc fast interactive analysis (response in seconds) • Medium to large data sets ( 1 to 500 Gigabytes) • Frequently changing business model (Weekly) As an iUustration we will use the six dimension business model of a hypothetical beverage company. Each dimension is composed of a hierarchy of members: for instance the time dimension has months (January, February,..) as the leaf members of the hierarchy, Quarters (Quarter 1, Quarter 2,..) as the next level members, and Year as the top member of the hierarchy.
    [Show full text]
  • L21: Data Models & Relational Algebra
    L21: Data models & Relational Algebra CS3200 Database design (fa18 s2) https://northeastern-datalab.github.io/cs3200/ Version 11/26/2018 1 Announcements! • Exam 2 comments - Based on feedback: Separate "select all" and negative points • We may have "select all" again: but minimum points is 0 • We may have negative points again: but for MCQ only (select one single answer) • Please don't yet submit HW7 - We are adopting Gradescope to handle your submissions too! • Today - Exam 2 take-aways - Data models - Relational Algebra 2 Schedule 3 From 1 = Q3 to 11 = Q13 (thus add 2 to the number) 4 5 Grading from the "other side" 6 Q6 7 Q6 8 Q12 Answer: only the first! Also see post on Piazza: https://piazza.com/class/jj55fszwtpj7fx?cid=135 9 10 L21: A short history of data models Based on article "What goes around comes around", Hellerstein, Stonebraker, 2005. Several slides courtesy of Dan Suciu CS3200 Database design (fa18 s2) https://northeastern-datalab.github.io/cs3200/ Version 11/26/2018 11 Hierarchical data Source: https://en.wikipedia.org/wiki/Nested_set_model 12 Hierarchies are powerful, but can be misleading... 13 “Data Model” • Applications need to model real-world data - Typically includes entities and relationships between them - Entities: e.g. students, courses, products, clients - Relationships: e.g. course registrations, product purchases • A data model enables a user to define the data using high-level constructs without worrying about many low-level details of how data will be stored on disk 14 Levels of Abstraction schema as seen
    [Show full text]
  • On Active Deductive Databases
    On Active Deductive Databases The Statelog Approach Georg Lausen Bertram Ludascher Wolfgang May Institut fur Informatik Universitat Freiburg Germany flausenludaeschmayginfo rmat ikun ifr eibur gde Abstract After briey reviewing the basic notions and terminology of active rules and relating them to pro duction rules and deductive rules resp ectively we survey a number of formal approaches to active rules Subsequentlywe present our own stateoriented logical approach to ac tive rules which combines the declarative semantics of deductive rules with the p ossibility to dene up dates in the style of pro duction rules and active rules The resulting language Statelog is surprisingly simple yet captures many features of active rules including comp osite eventde tection and dierent coupling mo des Thus it can b e used for the formal analysis of rule prop erties like termination and expressivepo wer Finally weshowhow nested transactions can b e mo deled in Statelog b oth from the op erational and the mo deltheoretic p ersp ective Intro duction Motivated by the need for increased expressiveness and the advent of new appli cations rules have b ecome very p opular as a paradigm in database programming since the late eighties Min Today there is a plethora of quite dierent appli cation areas and semantics for rules From a birdseye view deductive and active rules may b e regarded as two ends of a sp ectrum of database rule languages Deductive Rules Production Rules Active Rules higher level lower level stratied well RDL pro ARDL Ariel Starburst Postgres
    [Show full text]
  • Data-Centric Graphical User Interface of the ATLAS Event Index Service
    EPJ Web of Conferences 245, 04036 (2020) https://doi.org/10.1051/epjconf/202024504036 CHEP 2019 Data-centric Graphical User Interface of the ATLAS Event Index Service Julius Hrivnᡠcˇ1,∗, Evgeny Alexandrov2, Igor Alexandrov2, Zbigniew Baranowski3, Dario Barberis4, Gancho Dimitrov3, Alvaro Fernandez Casani5, Elizabeth Gallas6, Carlos Gar- cía Montoro5, Santiago Gonzalez de la Hoz5, Andrei Kazymov2, Mikhail Mineev2, Fedor Prokoshin2, Grigori Rybkin1, Javier Sanchez5, Jose Salt5, Miguel Villaplana Perez7 1Université Paris-Saclay, CNRS/IN2P3, IJCLab, 91405 Orsay, France 2Joint Institute for Nuclear Research, 6 Joliot-Curie St., Dubna, Moscow Region, 141980, Russia 3CERN, 1211 Geneva 23, Switzerland 4Physics Department of the University of Genoa and INFN Sezione di Genova, Via Dodecaneso 33, I-16146 Genova, Italy 5Instituto de Fisica Corpuscular (IFIC), Centro Mixto Universidad de Valencia - CSIC, Valencia, Spain 6Department of Physics, Oxford University, Oxford, United Kingdom 7Department of Physics, University of Alberta, Edmonton AB, Canada Abstract. The Event Index service of the ATLAS experiment at the LHC keeps references to all real and simulated events. Hadoop Map files and HBase tables are used to store the Event Index data, a subset of data is also stored in the Oracle database. Several user interfaces are currently used to access and search the data, from a simple command line interface, through a programmable API, to sophisticated graphical web services. It provides a dynamic graph-like overview of all available data (and data collections). Data are shown together with their relations, like paternity or overlaps. Each data entity then gives users a set of actions available for the referenced data. Some actions are provided directly by the Event Index system, others are just interfaces to different ATLAS services.
    [Show full text]
  • Multidimensional Network Analysis
    Universita` degli Studi di Pisa Dipartimento di Informatica Dottorato di Ricerca in Informatica Ph.D. Thesis Multidimensional Network Analysis Michele Coscia Supervisor Supervisor Fosca Giannotti Dino Pedreschi May 9, 2012 Abstract This thesis is focused on the study of multidimensional networks. A multidimensional network is a network in which among the nodes there may be multiple different qualitative and quantitative relations. Traditionally, complex network analysis has focused on networks with only one kind of relation. Even with this constraint, monodimensional networks posed many analytic challenges, being representations of ubiquitous complex systems in nature. However, it is a matter of common experience that the constraint of considering only one single relation at a time limits the set of real world phenomena that can be represented with complex networks. When multiple different relations act at the same time, traditional complex network analysis cannot provide suitable an- alytic tools. To provide the suitable tools for this scenario is exactly the aim of this thesis: the creation and study of a Multidimensional Network Analysis, to extend the toolbox of complex network analysis and grasp the complexity of real world phenomena. The urgency and need for a multidimensional network analysis is here presented, along with an empirical proof of the ubiquity of this multifaceted reality in different complex networks, and some related works that in the last two years were proposed in this novel setting, yet to be systematically defined. Then, we tackle the foundations of the multidimensional setting at different levels, both by looking at the basic exten- sions of the known model and by developing novel algorithms and frameworks for well-understood and useful problems, such as community discovery (our main case study), temporal analysis, link prediction and more.
    [Show full text]
  • What Is Database? Types and Examples
    What is database? Types and Examples Visit our site for more information: www.examplanning.com Facebook Page: https://www.facebook.com/examplanning10/ Twitter: https://twitter.com/examplanning10 TABLE OF CONTENTS Sr. Description 1 What is database? 2 Different definitions of database 3 Growth of Database 4 Elements of Database 5 Components of database 6 Database System Environment 7 Types of Databas 8 Characteristics of database 9 Advantages of Database 10 Disadvantages of Database What is Database? A database is a collection of information or data which are organized in such a way that it can be easily accessed, managed and retrieved. Database is abbreviated ad DB. Different definitions of database. “a usually large collection of data organized especially for rapid search and retrieval (as by a computer) an online database” (merriam-webster) “a comprehensive collection of related data organized for convenient access, generally in a computer.” (dictionary) A database is an organized collection of data. (Wikipedia) What is data? It is used as both singular and plural form. It can be a quantity, symbol or character on which operations are performed. Data is information which are converted into digital form. Growth of Database Database was evolved in 1960's started with the hierarchical database. Relational database was invented by EF Codd in 1970s while object oriented database was invented in 1980s. In 1990s object oriented database rose with the growth of object oriented programming languages. Nowadays, databases with SQL and NoSQL are popular. Elements of Database Database elements are fields, rows, columns, tables. All these are building blocks of database.
    [Show full text]
  • Release Notes Date Published: 2021-03-25 Date Modified
    Cloudera Runtime 7.2.8 Release Notes Date published: 2021-03-25 Date modified: https://docs.cloudera.com/ Legal Notice © Cloudera Inc. 2021. All rights reserved. The documentation is and contains Cloudera proprietary information protected by copyright and other intellectual property rights. No license under copyright or any other intellectual property right is granted herein. Copyright information for Cloudera software may be found within the documentation accompanying each component in a particular release. Cloudera software includes software from various open source or other third party projects, and may be released under the Apache Software License 2.0 (“ASLv2”), the Affero General Public License version 3 (AGPLv3), or other license terms. Other software included may be released under the terms of alternative open source licenses. Please review the license and notice files accompanying the software for additional licensing information. Please visit the Cloudera software product page for more information on Cloudera software. For more information on Cloudera support services, please visit either the Support or Sales page. Feel free to contact us directly to discuss your specific needs. Cloudera reserves the right to change any products at any time, and without notice. Cloudera assumes no responsibility nor liability arising from the use of products, except as expressly agreed to in writing by Cloudera. Cloudera, Cloudera Altus, HUE, Impala, Cloudera Impala, and other Cloudera marks are registered or unregistered trademarks in the United States and other countries. All other trademarks are the property of their respective owners. Disclaimer: EXCEPT AS EXPRESSLY PROVIDED IN A WRITTEN AGREEMENT WITH CLOUDERA, CLOUDERA DOES NOT MAKE NOR GIVE ANY REPRESENTATION, WARRANTY, NOR COVENANT OF ANY KIND, WHETHER EXPRESS OR IMPLIED, IN CONNECTION WITH CLOUDERA TECHNOLOGY OR RELATED SUPPORT PROVIDED IN CONNECTION THEREWITH.
    [Show full text]
  • Oracle Rdb™ SQL Reference Manual Volume 4
    Oracle Rdb™ SQL Reference Manual Volume 4 Release 7.3.2.0 for HP OpenVMS Industry Standard 64 for Integrity Servers and OpenVMS Alpha operating systems August 2016 ® SQL Reference Manual, Volume 4 Release 7.3.2.0 for HP OpenVMS Industry Standard 64 for Integrity Servers and OpenVMS Alpha operating systems Copyright © 1987, 2016 Oracle Corporation. All rights reserved. Primary Author: Rdb Engineering and Documentation group This software and related documentation are provided under a license agreement containing restrictions on use and disclosure and are protected by intellectual property laws. Except as expressly permitted in your license agreement or allowed by law, you may not use, copy, reproduce, translate, broadcast, modify, license, transmit, distribute, exhibit, perform, publish, or display any part, in any form, or by any means. Reverse engineering, disassembly, or decompilation of this software, unless required by law for interoperability, is prohibited. The information contained herein is subject to change without notice and is not warranted to be error-free. If you find any errors, please report them to us in writing. If this is software or related documentation that is delivered to the U.S. Government or anyone licensing it on behalf of the U.S. Government, the following notice is applicable: U.S. GOVERNMENT RIGHTS Programs, software, databases, and related documentation and technical data delivered to U.S. Government customers are "commercial computer software" or "commercial technical data" pursuant to the applicable Federal Acquisition Regulation and agency-specific supplemental regulations. As such, the use, duplication, disclosure, modification, and adaptation shall be subject to the restrictions and license terms set forth in the applicable Government contract, and, to the extent applicable by the terms of the Government contract, the additional rights set forth in FAR 52.227-19, Commercial Computer Software License (December 2007).
    [Show full text]
  • Curriculum Vitae Michael Kay Phd FBCS
    Curriculum Vitae Michael Kay PhD FBCS Michael Kay is widely known in the XML world as an expert on the XML processing languages XSLT and XQuery. This reputation derives from his book XSLT Programmer's Reference, from the open-source Saxon XSLT and XQuery processor which he developed, and from his work within the W3C consortium as a member of the XSLT and XQuery working groups, and as editor of several of the specifications. His expertise in the field was recognized in 2005 when he was awarded the XML Cup, an annual international award made to individuals who have made outstanding contributions to the XML Community. Before moving into the XML field in 1998, Michael Kay specialized in database and information management technology. He designed a series of successful software products for the computer manufacturer ICL, and was one of the company's most senior engineers advising the company and it customers on technology strategy. Michael Kay is a member of the XML Guild, a loose federation of leading independent XML consultants who pool knowledge and experience to tackle the most challenging XML-related problems. Personal Details Name Michael Howard Kay Address 52 Matlock Road, Caversham Heights, Reading, Berks, UK. RG4 7BS Phone +44 118 948 3589 Email [email protected] Date of Birth 1951-10-11 Nationality British (born in Germany) Languages English, German, some French Career Summary Feb 2004 - present Director, Saxonica Limited Development and Marketing of commercial Saxon-SA product Ongoing development of open source Saxon-B product Independent
    [Show full text]