Applying ER Model

Total Page:16

File Type:pdf, Size:1020Kb

Applying ER Model Applying ER Model • Problem Domain: The Winter Olympics • Fans: • Get country medal counts • See who won an event • Find out how a favorite athelete is doing • Scheduler: • Set times for competitions • Assign competitors to heats, brackets, etc • Officials: • Record Results (winners, times, medals, etc). 2/11/20102/9/2010 1 Topics for Today • Data Models • Network Data Models • Hierarchical Data Models • Learning objectives • Explain the technical landscape from which the relational data model emerged. • When we present the relational model, articulate the most salient points that distinguish it from what preceded it. 2/11/2010 2 Network Data Model • Basics • Think of this as key/data pairs where the data can contain a reference to another key/data pair (either in the same database or a different database). • Data are represented by collections of records (i.e., a database of key/data pairs). • Relationships are expressed via links between records. • Navigational style of access: you find a record and then traverse links among records to find other information. • Analogies with ER modelingentity? set • Collectionsrelationship? of records == • Links == 2/11/2010 3 Network Model Details • Records composed of attributes • Attributes are single-valued. • A link links only two records. • Design tool for the network model is a data-structure diagram (very similar to an ER diagram) • Think of a data-structure diagram as an ER diagram where relationships are expressed as lines connecting entities (that is, the relationship isn’t depicted explicitly). 2/11/2010 4 Data Structure Diagram ER Diagram PID name Patient Schedule Appointment phone date Data structure Diagram scheduled PID name phone date 2/11/2010 5 Tertiary Relationships • Relationships that relate more than two records are represented by a separate record that simply consists of links. ER Diagram PID name Patient Schedule Appointment phone date name Doctor 2/11/2010 6 Link Records Data structure Diagram Patient Doctor Appointment PID name phone name date DoctorLink PatientLink ApptLink schedule record 2/11/2010 7 Navigating a Network Database Doctor Appointment name schedule schedule record Fri AM schedule record Fri AM Sat PM schedule record Fri AM Patient PID name phone PID name phone PID name phone 2/11/2010 8 Standardization of Network Model • CODASYL DBTG 1971: First database standard • Written in the late 60’s by the Database Task Group (DBTG). • Only allow many:one links (rep 1:1 as many:1) • Disallowed many:many links. • Implement many:many using dummy records. • Canonical many:one relationship is called a DBTG set. relationship owner A B member/child • Each set has only one owner and zero or more members. • Each member may occur only once in each set, but a member can belong to multiple different sets. 2/11/2010 9 More on CODASYL • Like in Berkeley DB, manipulate data using APIs embedded in a host language. • Record selection: DBP->get • Iteration: DBP->put • Iterate over members • Find owner (given a member): secondary index lookup • Implementation • Links are pointers (in-memory). • Links are what in Berkeley DB-like solution? Primary keys • Why only many:one (not many:many)? • Implementation artifact: How do you store many pointers in a single record? Create a linked list through the members. • Makes many:many difficult. 2/11/2010 10 Hierarchical Model • Similar to the network model except that records are organized as collections of trees rather than as arbitrary graphs. • Basics • Data are represented by collections of records. • Relationships are expressed via links between records. • Analogies • Collections of records: entity set • Links: relations 2/11/2010 11 Hierarchical Model: Details • Records composed of attributes. • All members of a set (e.g., DBTG set) are thought of as having a common root, and therefore form a tree. • Consider the data structure diagram for several records in our patient database (i.e., a diagram in the Network Model): Patients Appointments 001 David 6-1234 9@2/16/10 002 Peter 6-5678 10@2/17/10 003 Ryan 6-9012 11@2/18/10 12@2/19/10 2/11/2010 12 Rooted Trees • Database is a collection of trees. • Each tree is organized hierarchically. • The Hierarchy is: Patients • The data looks like: Appointments Insurance 001 David 6-1234 9@2/16/10 Harvard Pilgrim 002 Peter 6-5678 10@2/17/10 HUGHP Tufts 003 Ryan 6-9012 11@2/18/10 Tufts 12@2/19/10 2/11/2010 13 Yet another tree • This time the hierarchy is: Appointments • And the data looks like: Patients 9@2/16/10 001 David 6-1234 10@2/17/10 002 Peter 6-5678 11@2/18/10 003 Ryan 6-9012 12@2/19/10 003 Ryan 6-9012 2/11/2010 14 Implementation Issues • This model is inherently redundant. • Records are represented in multiple trees. • Redundancy results in: • Inconsistency • Inefficiency • Solution is the virtual record: • Virtual record is a reference to a record. • Rather than storing the entire record multiple times, maintain a pointer to the record. • Rather than navigating through a network as in the network model, we navigate through trees 2/11/2010 15 Real World Systems • IBM’s IMS (Information Management System) is the oldest and perhaps among the most widely used database systems. • IMS databases were some of the largest databases. • IMS first had to deal with concurrency, recovery, integrity, efficient query processing, etc. • IMS FastPath was an optimized version that kept the most active parts of the database in main memory (forerunner of modern main memory database systems). • Data manipulation language similar to that for Network; has differences, called DL/1. 2/11/2010 16.
Recommended publications
  • Development Team Principal Investigator Prof
    Paper No: 6 Remote Sensing & GIS Applications in Environmental Sciences Module: 22 Hierarchical, network and relational data Development Team Principal Investigator Prof. R.K. Kohli & Prof. V.K. Garg & Prof. Ashok Dhawan Co- Principal Investigator Central University of Punjab, Bathinda Dr. Puneeta Pandey Paper Coordinator Centre for Environmental Sciences and Technology Central University of Punjab, Bathinda Dr Dinesh Kumar, Content Writer Department of Environmental Sciences Central University of Jammu Content Reviewer Dr. Puneeta Pandey Central University of Punjab, Bathinda Anchor Institute Central University of Punjab 1 Remote Sensing & GIS Applications in Environmental Sciences Environmental Hierarchical, network and relational data Sciences Description of Module/- Subject Name Environmental Sciences Paper Name Remote Sensing & GIS Applications in Environmental Sciences Module Name/Title Hierarchical, network and relational data Module Id EVS/RSGIS-EVS/22 Pre-requisites Introductory knowledge of computers, basic mathematics and GIS Objectives To understand the concept of DBMS and database Models Keywords Fuzzy Logic, Decision Tree, Vegetation Indices, Hyper-spectral, Multispectral 2 Remote Sensing & GIS Applications in Environmental Sciences Environmental Hierarchical, network and relational data Sciences Module 29: Hierarchical, network and relational data 1. Learning Objective The objective of this module is to understand the concept of Database Management System (DBMS) and database Models. The present module explains in details about the relevant data models used in GIS platform along with their pros and cons. 2. Introduction GIS is a very powerful tool for a range of applications across the disciplines. The capacity of GIS tool to store, retrieve, analyze and model the information comes from its database management system (DBMS).
    [Show full text]
  • The Entity-Relationship Model — 'A3s
    .M414 INST. OCT 26 1976 WORKING PAPER ALFRED P. SLOAN SCHOOL OF MANAGEMENT THE ENTITY-RELATIONSHIP MODEL — 'A3S. iflST. l^CH. TOWARD A UNIFIED VIEW OF DATA* OCT 25 197S [ BY PETER PIN-SHAN CHEN WP 839-76 MARCH, 1976 MASSACHUSETTS INSTITUTE OF TECHNOLOGY 50 MEMORIAL DRIVE CAMBRIDGE, MASSACHUSETTS 02139 THE ENTITY-RELATIONSHIP MODEL — — A3S. iHST.TiiCH. TOWARD A UNIFIED VIEW OF DATA* OCT 25 1976 BY PETER PIN-SHAN CHEN WP 839-76 MARCH, 1976 * A revised version of this paper will appear in the ACM Transactions on Database Systems. M.I.T. LlBaARiE; OCT 2 6 1976 ' RECEIVE J I ABSTRACT: A data model, called the entity-relationship model^ls proposed. This model incorporates some of the Important semantic information in the real world. A special diagramatic technique is introduced as a tool for data base design. An example of data base design and description using the model and the diagramatic technique is given. Some implications on data integrity, information retrieval, and data manipulation are discussed. The entity-relationship model can be used as a basis for unification of different views of data: the network model, the relational model, and the entity set model. Semantic ambiguities in these models are analyzed. Possible ways to derive their views of data from the entity-relationship model are presented. KEY WORDS AND PHRASES: data base design, logical view of data, semantics of data, data models, entity-relationship model, relational model. Data Base Task Group, network model, entity set model, data definition and manipulation, data integrity and consistency. CR CATEGORIES: 3.50, 3.70, 4.33, 4.34.
    [Show full text]
  • DATA INTEGRATION GLOSSARY Data Integration Glossary
    glossary DATA INTEGRATION GLOSSARY Data Integration Glossary August 2001 U.S. Department of Transportation Federal Highway Administration Office of Asset Management NOTE FROM THE DIRECTOR Office of Asset Management, Infrastructure Core Business Unit, Federal Highway Administration his glossary is one of a series of documents on data integration being published by the Federal Highway Administration’s Office of Asset T Management. It defines in simple and understandable language a broad set of the terminologies used in information management, particularly in regard to database management and integration. Our objective is to provide convenient reference material that can be used by individuals involved in data integration activities. The glossary is limited to the more fundamental concepts and taxonomies applied in transportation database management and is not intended to be comprehensive. The importance of data integration in implementing Asset Management processes cannot be overstated. My office will continue to provide information and support to all transportation agencies as they work to integrate their data. Madeleine Bloom Director, Office of Asset Management DATA INTEGRATION GLOSSARY 3 LIST OF TERMS Aggregate data . .7 Dynamic segmentation . .12 Application . .7 Enterprise . .12 Application integration . .7 Enterprise application integration (EAI) . .12 Application program interface (API) . .7 Enterprise resource planning (ERP) . .12 Archive . .7 Entity . .12 Asset Management . .7 Entity relationship (ER) diagram . .12 Atomic data . .7 Executive information system (EIS) . .13 Authorization request . .7 Extensibility . .13 Bulk data transfer . .7 Geographic information system (GIS) . .13 Business process . .7 Graphical user interface (GUI) . .13 Business process reengineering (BPR) . .7 Information systems architecture . .13 Communications protocol . .7 Interoperable database . .13 Computer Aided Software Engineering (CASE) tools .
    [Show full text]
  • Describing Data Patterns. a General Deconstruction of Metadata Standards
    Describing Data Patterns A general deconstruction of metadata standards Dissertation in support of the degree of Doctor philosophiae (Dr. phil.) by Jakob Voß submitted at January 7th 2013 defended at May 31st 2013 at the Faculty of Philosophy I Humboldt-University Berlin Berlin School of Library and Information Science Reviewer: Prof. Dr. Stefan Gradman Prof. Dr. Felix Sasaki Prof. William Honig, PhD This document is licensed under the terms of the Creative Commons Attribution- ShareAlike license (CC-BY-SA). Feel free to reuse any parts of it as long as attribution is given to Jakob Voß and the result is licensed under CC-BY-SA as well. The full source code of this document, its variants and corrections are available at https://github.com/jakobib/phdthesis2013. Selected parts and additional content are made available at http://aboutdata.org A digital copy of this thesis (with same pagination but larger margins to fit A4 paper format) is archived at http://edoc.hu-berlin.de/. A printed version is published through CreateSpace and available by Amazon and selected distributors. ISBN-13: 978-1-4909-3186-9 ISBN-10: 1-4909-3186-4 Cover: the Arecibo message, sent into empty space in 1974 (image CC-BY-SA Arne Nordmann, http://commons.wikimedia.org/wiki/File:Arecibo_message.svg) CC-BY-SA by Widder (2010) Abstract Many methods, technologies, standards, and languages exist to structure and de- scribe data. The aim of this thesis is to find common features in these methods to determine how data is actually structured and described. Existing studies are limited to notions of data as recorded observations and facts, or they require given structures to build on, such as the concept of a record or the concept of a schema.
    [Show full text]
  • Week 4 Tutorial - Conceptual Design
    Week 4 Tutorial - Conceptual Design Tutorial 4 – Conceptual Design Remember we can classify an ERDs at one of three levels: 1. Conceptual ERD - an ERD drawing which makes no assumptions about the Data Model which the system will be implemented in 2. Logical ERD (also called a Data Structure Diagram) - an ERD created from the Conceptual ERD which is redrawn in terms of a selected Data Model eg. Relational. The major issues at this stage, if a Relational Data Model is selected, are: relationships are represented by FK's M:N relationships are replaced by 1:M relationships 3. Physical ERD (also represented by a schema file) - an ERD created from the Logical ERD which is redrawn in terms of a selected DBMS eg. Oracle. Most of the issues at this stage of translation will relate to Oracle data types and the Oracle create table/index syntax. Conceptual and Logical ERDs are drawn in terms of a portable ('idealised') data type, now you must translate these types to those supported by your selected DBMS software. For DBMS you may draw ERDs using any of the following (or any combination of the following): (i) using a general drawing package - hand drawn submissions are not acceptable, however you may make use of any standard drawing package to create your diagrams. This approach will support all three ERD levels, you will need to manually make the appropriate translations. (ii) Microsoft VISIO - VISIO is available in the on-campus labs and also available for you (as a Monash student) to install at home. If you wish to install VISIO at home please speak to your lecturer who will explain the local arrangements.
    [Show full text]
  • I Introduction Entity Relationship Model: Basic Concepts Mapping Ca
    PGDCA PAPER : DBMS LESSON NO. : 7 ENTITY RELATIONSHIP MODEL -I Introduction Entity Relationship Model: Basic Concepts Mapping Cardinalities Entity relationship Diagram Weak and Strong Entity sets Aggregation Summary Questionnaires 7.1 Introduction: The Entity-Relationship Model is a high-level conceptual data model developed by Chem in 1976 to facilitate database design. The E-R Model is shown diagrammatically using E-R diagrams which represents the elements of the conceptual model that show the meanings and the relationships between those elements independent of any particular DBMS and implementation details. Cardinality of a relationship between entities is calculated by measuring how many instances of one entity are related to a single instance of another. One of the main limitations of the E-R Model is that it cannot express relationship among relationships. So to represent these relationships among relationships. We combine the entity sets and their relationship to form a higher level entity set. This process of combining entity sets and their relationships to form a high entity set so as to represent relationships among relationships is called Aggregation. 7.2 Entity Relationship Model The entity-relationship model is based on the perception of a real world that consists of a set of basic objects called entities, and of relationships among these objects. It was developed to facilitate database design by allowing the specification of an enterprise schema which represents the overall logical structure of a database. The E-R Model is extremely useful in mapping the meanings and interactions of real-world enterprises into a conceptual schema. Entity – relationship model was originally proposed by Peter in 1976 as a way to unify the network and relational database views.
    [Show full text]
  • The Entity-Relationship Model : Toward a Unified View of Data
    LIBRARY OF THE MASSACHUSETTS INSTITUTE OF TECHNOLOGY .28 1414 MAY 18 1977 Ji/BRARie*. Center for Information Systems Research Massachusetts Institute of Technology Alfred P Sloan School of Management 50 Memorial Drive Cambridge, Massachusetts, 02139 617 253-1000 THE ENTITY-RELATIONSHIP MODEL: TOWARD A UNIFIED VIEW OF DATA By Peter Pin-Shan Chen CISR No. 30 WP 913-77 March 1977 .(Vvsi4 M.I.T. LIBRARIES MAY 1 8 1977 RE'JtlvLiJ The Entity-Relationship Model—Toward a Unified View of Data PETER PIN-SHAN CHEN Massachusetts Institute of Technology A data model, called the entity-relationship model, is proposed. This model incorporates some of the important semantic information about the real world. A special diagrammatic technique is introduced as a tool for database design. An example of database design and description using the model and the diagrammatic techniciuc is given Some implications for data integrity, infor- mation retrieval, and data manipulation are discussed. The entity-relationship model can be used as a basis for unification of dilTcrent views of data: the network model, the relational model, and the entity set model. Semantic ambiguities in lhe.se models are analyzed. Po.ssible ways to derive their views of data from the entity-relationship model are presented. Key Words and Phrases: database design, logical view of data, semantics of data, data models, entity-relationship model, relational model. Data Base Task Group, network model, entity set model, data definition and manipulation, data integrity atid consistency CR Categories: S.oO, 3.70, 4.33, 4.34 1. INTRODUCTION The logical view of data has been an important i.s.siie in recent years.
    [Show full text]
  • Molecular Objects, Abstract Data Types, and Data Models: a Framework ’
    Molecular Objects, Abstract Data Types, and Data Models: A Framework ’ D.S. Batoty Alejandro P. Buchmann Department of Computer Sciencee IIMAS The University of Tezae at Austin National University of Mezico Austin, Tezas 78712 ABSTRACT ples include linear and multidimensional arrays, polygons, and complex numbers. ADT-INGRES is an implementation Molecular objects occur frequently in CAD and engineering of this proposal ([Ong84]). applications. At higher levels of abstraction they are Su et al. ([Su83], [Bro83a]) have proposed that a set of treated as atomic unit,s of data; at lower levels they are system-defined types (e.g., set, vector, matrix, time series) defined in terms of a set of tuples possibly from different are required for DBMS support of statistical database appli- relations. System R’s complex objects are examples of cations. In addition, they proposed ways in which these molecular objects. types could be used to define new data types. A relation, In this paper, we present a framework for studying a gen- for example, could be defined as a data type and a column eralized concept of molecular objects. We show that of a relation could be a system-defined type or a relation. abstract data types unify this framework, which itself The resulting data types and their attendant operators are encompasses some recent data modeling contributions by called ‘complex data types’. researchers at IBM San Jose, Berkeley, Boeing, and Florida. The second distinct approach does not deal with A programming language/data structure paradigm is seen columns of a relation, but rather treats collections of as a way of developing and testing the power of logical data heterogeneous tuples as objects.
    [Show full text]
  • Submitted in Fulfilment of the Requirements for the Degree of in the At
    THE IMPACT OF THE DAT A MANAGEMENT APPROACH ON INFORMATION SYSTEMS AUDITING by DON FRIEDRICH FORSTENBURG submitted in fulfilment of the requirements for the degree of MASTER OF ACCOUNTING SCIENCE in the DEPARTMENT OF APPLIED ACCOUNT ANCY at the UNIVERSITY OF SOUTH AFRICA SUPERVISOR: PROF E SADLER JOINT SUPERVISOR: PROF C HATTINGH NOVEMBER 1992 ii ACKNOWLEDGEMENTS I would like to take the opportunity to express my appreciation to those who contributed to the success of this study. Firstly, I would like to acknowledge the important role that Mr. P.J. de Bruyn and a previous colleague (who wants to remain anonymous) have played. Without them the study would definitely not have progressed beyond the inception stage. Secondly, to Mr. S. Loubser, who did pioneering work in the field of data resource management in South Africa and who introduced me to the subject of data management. To my supervisor, Professor E. Sadler and joint supervisor, Professor C. Hattingh, I am indebted for their ideas and support and providing feedback when I needed it. They played an invaluable role in facilitating the completion of this research study. Finally, I thank my wife and children for their understanding, patience and support during the completion of this thesis. Above all, to God be the honour and glory. PRETORIA 30 November 1992 ----~_...,._ ,.....__ :.~·. __ .. ,,,-- + 1...r RS> /..ccr·t ,. , ~::~:if~;... lfllllllllf llfllllllllllfllll~lllllllf 01491971 iii SUMMARY In establishing the impact of formal data management practices on systems and systems development auditing in the context of a corporate data base environment; the most significant aspects of a data base environment as well as the concept of data management were researched.
    [Show full text]
  • Heuristic Optimization of Physical Data Bases: Using a Generic and Abstract Design Model By: Prashant Palvia Palvia, P
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by The University of North Carolina at Greensboro Heuristic Optimization of Physical Data Bases: Using a Generic and Abstract Design Model By: Prashant Palvia Palvia, P. "Heuristic Optimization of Physical Databases; Using a Generic & Abstract Design Model," Decision Sciences, Summer 1988, Vol. 19, No. 3, pp. 564-579. Made available courtesy of Wiley-Blackwell: The definitive version is available at http://www3.interscience.wiley.com/ ***Reprinted with permission. No further reproduction is authorized without written permission from Wiley-Blackwell. This version of the document is not the version of record. Figures and/or pictures may be missing from this format of the document.*** Abstract: Designing efficient physical data bases is a complex activity, involving the consideration of a large number of factors. Mathematical programming-based optimization models for physical design make many simplifying assumptions; thus, their applicability is limited. In this article, we show that heuristic algorithms can be successfully used in the development of very good, physical data base designs. Two heuristic optimization algorithms are proposed in the contest of a genetic and abstract model for physical design. One algorithm is based on generic principles of heuristic optimization. The other is based on capturing and using problem- specific information in the heuristics. The goodness of the algorithms is demonstrated over a wide range of problems and factor values. Subject Areas: Heuristics, Management Information Systems, and Simulation. Article: INTRODUCTION Data base design is a challenging activity and involves two phases: logical and physical design.
    [Show full text]
  • Describing Data Patterns. a General Deconstruction of Metadata Standards 2013
    Repositorium für die Medienwissenschaft Jakob Voß Describing Data Patterns. A general deconstruction of metadata standards 2013 https://doi.org/10.25969/mediarep/4151 Veröffentlichungsversion / published version Hochschulschrift / academic publication Empfohlene Zitierung / Suggested Citation: Voß, Jakob: Describing Data Patterns. A general deconstruction of metadata standards. Berlin: Humboldt-Universität zu Berlin 2013. DOI: https://doi.org/10.25969/mediarep/4151. Erstmalig hier erschienen / Initial publication here: https://doi.org/10.18452/16794 Nutzungsbedingungen: Terms of use: Dieser Text wird unter einer Creative Commons - This document is made available under a creative commons - Namensnennung - Weitergabe unter gleichen Bedingungen 4.0/ Attribution - Share Alike 4.0/ License. For more information see: Lizenz zur Verfügung gestellt. Nähere Auskünfte zu dieser Lizenz https://creativecommons.org/licenses/by-sa/4.0/ finden Sie hier: https://creativecommons.org/licenses/by-sa/4.0/ Describing Data Patterns A general deconstruction of metadata standards Dissertation in support of the degree of Doctor philosophiae (Dr. phil.) by Jakob Voß submitted at January 7th 2013 defended at May 31st 2013 at the Faculty of Philosophy I Humboldt-University Berlin Berlin School of Library and Information Science Reviewer: Prof. Dr. Stefan Gradman Prof. Dr. Felix Sasaki Prof. William Honig, PhD This document is licensed under the terms of the Creative Commons Attribution- ShareAlike license (CC-BY-SA). Feel free to reuse any parts of it as long as attribution is given to Jakob Voß and the result is licensed under CC-BY-SA as well. The full source code of this document, its variants and corrections are available at https://github.com/jakobib/phdthesis2013.
    [Show full text]
  • Geographic Data Structures 1. Introduction
    GEOGRAPHIC DATA STRUCTURES Hakan SARBANOQLU 1. INTRODUCTION The information stored in a conventional database are the attribute values belonging to certain entity occurences. Entity types and required attributes for each of these entity types are carefuly determined by following a systems analysis and design methodology. During the design study, data definition specifications (data type, lenght, validity constraints etc.) are decided as well. For instance: Entity : Employee Attributes: Attribute name Data type Length Validity constraint employee no. character 6 name character 10 surname character 14 sex character I M orF address character 50 city character 20 postal-code character 5 date-of-birth character 11 There are various relationships between the real world entities; likewise these relationships should be represented in the database. Which relations between which entities are determined also during the analysis and design study by taking the users' access path requirements. Entity-relationship (E-R) Diagrams or Logical Data Structures (LDS) are the ways of illustrating these relationships. As an example, the general data structure of a personnel database is shown by theLDS on figure 1. 75 PERSONNEL UNIVERSITY JOB HISTORY one to one relationship __________~(~ one to many relationship ) ( many to many relationship Fiqure-l.: Logical data structure of a.personnel database Each box in this chart represents an entity about which information are stored. Connecting lines show relationships between entities. The relationships are classified into three types according to whether one or many occurences (records) of an entityis/are related to one' or many occurences (records) of the other entity. For example, "there are many employees working in a department but each employee' works only for a department" statement indicates a one to many relationship between department and personnel entities.
    [Show full text]