Using a Metadata Software Layer in Information Systems Integration*

Total Page:16

File Type:pdf, Size:1020Kb

Using a Metadata Software Layer in Information Systems Integration* Using a Metadata Software Layer in Information Systems Integration Mark Roantree1, Jessie B. Kennedy2, and Peter J. Barclay2 1 School of Computer Applications, Dublin City University, Dublin, Ireland [email protected] 2 School of Computing, Napier University, Edinburgh, Scotland Abstract. A Federated Information System requires that multiple (of- ten heterogenous) information systems are integrated to an extent that they can share data. This shared data often takes the form of a federated schema, which is a global view of data taken from distributed sources. One of the issues faced in the engineering of a federated schema is the continuous need to extract metadata from cooperating systems. Where cooperating systems employ an object-oriented common model to inter- act with each other, this requirement can become a problem due to the type and complexity of metadata queries. In this research, we specified and implemented a metadata software layer in the form of a high-level query interface for the ODMG schema repository, in order to simplify the task of integration system engineers. Two clears benefits have emerged: the reduced complexity of metadata queries during system integration (and federated schema construction) and a reduced learning curve for programmers who need to use the ODMG schema repository. 1 Introduction Many database applications require a mechanism by which ‘generic’ applications can determine a database’s structure at runtime, for functions such as graphical browsers, dynamic queries and the specification ofview schemata. This property, often referred to in programming languages as reflection, has been a feature of databases for many years, and the 2.0 specification of the ODMG metamodel [3] has provided a standard API for metadata queries in object-oriented databases. As part ofour research into federateddatabases, we specified and implemented a global view mechanism to facilitate the creation of views for ODMG databases, and the subsequent integration of view schemata to form federated schemata. Please refer to [6,10] for a complete background on federated databases. In this paper we do not concentrate on the topic offederateddatabases but instead focus on the construction of a metadata interface to ODMG information systems. This paper is structured as follows: the remainder of this section provides a brief overview ofthe nature ofour research, the importance ofmetadata to our view mechanism, and the motivation for this research; in Sect. 2 the main concepts in Supported by Forbairt Strategic Research Programme ST/98/014 K.R. Dittrich, A. Geppert, M.C. Norrie (Eds.): CAiSE 2001, LNCS 2068, pp. 299–314, 2001. c Springer-Verlag Berlin Heidelberg 2001 300 Mark Roantree, Jessie B. Kennedy, and Peter J. Barclay this form of research are discussed, together with an informal description of the metadata layer; in Sect. 3 the pragmatics ofthe language are presented through a series ofexamples; in Sect. 4 we present details ofthe implementation; and finally in Sect. 5 we offer some conclusions. In this paper we use the term view (or ODMG view) to refer to an ODMG subschema which may contain multiple classes, and is defined on an ODMG database, or on another view which has been defined on an ODMG database. 1.1 Background and Motivation The main focus of our research was to extend the ODMG 2.0 model to provide views in a federated database environment. This work yielded the specification and implementation ofa global view mechanism, using the ODMG model as the common model for a federation ofdatabases. The concept ofa federationof databases [10] is one where heterogeneous databases (or information systems) can communicate with one another through an interface provided by a com- mon data model. In our case, the common data model is the ODMG model, the standard model for object-oriented databases since 1993 [3]. The most com- mon architecture for these systems is as follows: data resides in many (generally heterogeneous) information systems or databases; the schema of each Informa- tion System (IS) is translated to an O-O format, and this new schema is called the component schema; view schemata are defined as shareable subsets ofthe component schema; the view schemata are exported to a global or federated server where they are integrated to form many global or federated schemata. Our focus was to extend the ODMG model so that it was possible to define the view schemata on top ofeach component schema, and define integration oper- ators which facilitated the construction of federated schemata. This extension provided a layer ofODMG views on top ofthe component schema. However, it was also necessary to provide a mapping language which could bind the compo- nent schema to its local model representation. This facilitates the translation of ODMG queries (to the their local IS equivalent), and enables data transfer to the ODMG database when views are defined. The classes which comprise the database schema are used to model the real world entities which are stored in the database. Additionally, there is a set of metaclass instances which are used to describe the database classes. Thus, we can think ofan ODMG database as having two distinct sets ofclasses: those which reside in the database schema, and the abstract classes (metaclasses) which reside in the schema repository. Whenever we process and store a view defini- tion, a set ofmetaclass instances are stored in the database. Where a view definition involves multiple classes, each with their own extents, this combina- tion ofmeta-objects can become quite complex. Thus, the request to display a view, or extract data from the local IS often requires powerful query facili- ties in order to retrieve the required meta-information. In Fig. 1 the role of the schema repository within a federated database environment is illustrated. Both the Component Database and Federated Database Server are ODMG databases. The schema repository contains a description ofthe database classes, hence the Using a Metadata Software Layer in Information Systems Integration 301 Software Layer Information Systems Layer Descriptions of imported views Metadata and federated view definitions Query (metaclasses) Interface Global Descriptions of Query database classes, Interface S.R. D.S. wrapper definitions to local IS, and view definitions (metaclasses) Federated Database Server Local Vi ew definitions Query Interface ODMG representation S.R. D.S. of local information system (database classes) Component Database mappings S.R. : Schema Repository D.S. : Database Schema Local information system Fig. 1. Metadata architecture within ODMG information systems. arrow towards the database schema. However, in this type ofarchitecture, the schema repository will contain a large amount ofadditional data required by view definitions. This research involved extending the ODMG metamodel in order to construct view schemata. However, due to the complex nature ofboth the base metamodel and extensions, many ofthe OQL queries which were required both to retrieve base and view class metadata would necessitate long expressions. For this reason we specified some language extensions to OQL for the specific purpose of easy retrieval ofmetadata fromthe schema repository. The contribution ofthis work is to provide a software metadata layer which facilitates the easier expression ofODMG metadata queries. In factall metadata queries can be expressed in a single line. In addition, we believe it is possible to improve the performance of metadata queries as some ofour experiments (discussed briefly in Sect. 4) have shown. 2 Metadata Objects and Queries In this section we provide a description ofthe main concepts involved in this research: the metadata objects and the queries used to manipulate metadata. Metadata objects are used to describe both the structural elements ofthe par- ticipating systems (base metadata objects), and the virtual elements which are defined using a view language and are mapped to base metadata objects. Meta- data queries are categorized into groups representing the type ofmetadata in- formation required. 302 Mark Roantree, Jessie B. Kennedy, and Peter J. Barclay d_Scope d_Meta_Object d_Property d_Type d_Constant d_Relationship d_Attribute d_Derived d_Inheritance d_Class d_Ref_Type d_Collection_type d_Primitive_Type Fig. 2. The ODMG metamodel (subset). 2.1 Metadata Elements In Fig. 2 a briefoutline ofthe ODMG schema repository interface(or ODMG C++ metamodel) is illustrated. For integration engineers the main points ofin- terest are classes and their properties, and issues such as scoping and inheritance. The illustration attempts to show a hierarchy ofmetaclasses with the d Scope, d Inheritance and d Meta Object metaclasses at the top ofthe hierarchy. Oth- erwise, all metaclasses derive from d Meta Object (identified by arrows) and multiple inheritance occurs where some metaclasses derive from d Scope.By deriving from a d Scope object, meta-objects can place other meta-objects in- side a (conceptual) container class. For example, a d Class object derives from d Scope, and can use its properties to bind a set of d Attribute objects to it. Specifics ofthe roles ofeach ofthe metaclasses is best described in [5]. In an ODMG database that contains a view mechanism, it is necessary to be able to distinguish between base and virtual classes, and for virtual classes one must be capable ofobtaining the same informationregarding structure as can be obtained for base classes. The view mechanism, its specification language, and implementation
Recommended publications
  • Type-Safe Composition of Object Modules*
    International Conference on Computer Systems and Education I ISc Bangalore Typ esafe Comp osition of Ob ject Mo dules Guruduth Banavar Gary Lindstrom Douglas Orr Department of Computer Science University of Utah Salt LakeCity Utah USA Abstract Intro duction It is widely agreed that strong typing in We describ e a facility that enables routine creases the reliability and eciency of soft typ echecking during the linkage of exter ware However compilers for statically typ ed nal declarations and denitions of separately languages suchasC and C in tradi compiled programs in ANSI C The primary tional nonintegrated programming environ advantage of our serverstyle typ echecked ments guarantee complete typ esafety only linkage facility is the ability to program the within a compilation unit but not across comp osition of ob ject mo dules via a suite of suchunits Longstanding and widely avail strongly typ ed mo dule combination op era able linkers comp ose separately compiled tors Such programmability enables one to units bymatching symb ols purely byname easily incorp orate programmerdened data equivalence with no regard to their typ es format conversion stubs at linktime In ad Such common denominator linkers accom dition our linkage facility is able to automat mo date ob ject mo dules from various source ically generate safe co ercion stubs for com languages by simply ignoring the static se patible encapsulated data mantics of the language Moreover com monly used ob ject le formats are not de signed to incorp orate source language typ e
    [Show full text]
  • 5. Data Types
    IEEE FOR THE FUNCTIONAL VERIFICATION LANGUAGE e Std 1647-2011 5. Data types The e language has a number of predefined data types, including the integer and Boolean scalar types common to most programming languages. In addition, new scalar data types (enumerated types) that are appropriate for programming, modeling hardware, and interfacing with hardware simulators can be created. The e language also provides a powerful mechanism for defining OO hierarchical data structures (structs) and ordered collections of elements of the same type (lists). The following subclauses provide a basic explanation of e data types. 5.1 e data types Most e expressions have an explicit data type, as follows: — Scalar types — Scalar subtypes — Enumerated scalar types — Casting of enumerated types in comparisons — Struct types — Struct subtypes — Referencing fields in when constructs — List types — The set type — The string type — The real type — The external_pointer type — The “untyped” pseudo type Certain expressions, such as HDL objects, have no explicit data type. See 5.2 for information on how these expressions are handled. 5.1.1 Scalar types Scalar types in e are one of the following: numeric, Boolean, or enumerated. Table 17 shows the predefined numeric and Boolean types. Both signed and unsigned integers can be of any size and, thus, of any range. See 5.1.2 for information on how to specify the size and range of a scalar field or variable explicitly. See also Clause 4. 5.1.2 Scalar subtypes A scalar subtype can be named and created by using a scalar modifier to specify the range or bit width of a scalar type.
    [Show full text]
  • Python Programming
    Python Programming Wikibooks.org June 22, 2012 On the 28th of April 2012 the contents of the English as well as German Wikibooks and Wikipedia projects were licensed under Creative Commons Attribution-ShareAlike 3.0 Unported license. An URI to this license is given in the list of figures on page 149. If this document is a derived work from the contents of one of these projects and the content was still licensed by the project under this license at the time of derivation this document has to be licensed under the same, a similar or a compatible license, as stated in section 4b of the license. The list of contributors is included in chapter Contributors on page 143. The licenses GPL, LGPL and GFDL are included in chapter Licenses on page 153, since this book and/or parts of it may or may not be licensed under one or more of these licenses, and thus require inclusion of these licenses. The licenses of the figures are given in the list of figures on page 149. This PDF was generated by the LATEX typesetting software. The LATEX source code is included as an attachment (source.7z.txt) in this PDF file. To extract the source from the PDF file, we recommend the use of http://www.pdflabs.com/tools/pdftk-the-pdf-toolkit/ utility or clicking the paper clip attachment symbol on the lower left of your PDF Viewer, selecting Save Attachment. After extracting it from the PDF file you have to rename it to source.7z. To uncompress the resulting archive we recommend the use of http://www.7-zip.org/.
    [Show full text]
  • Type Conversion ,Type Casting, Operator Precedence and Associativity in C
    Type Conversion ,Type Casting, Operator Precedence and Associativity in C Gaurav Kr. suman 4/30/20 MAT09 The type conversion in C is basically converting one type of data type to other to perform some operation. The conversion is done only between those datatypes wherein the conversion is possible There are two types of type conversion: This type of conversion is usually performed by the compiler when necessary without any commands by the user. Thus it is also called "Automatic Type Conversion". • Done by the compiler on its own, without any external trigger from the user. • Generally takes place when in an expression more than one data type is present. In such condition type conversion (type promotion) takes place to avoid lose of data. • All the data types of the variables are upgraded to the data type of the variable with largest data type. Now, let’s focus on some examples to further understand about type conversions in C. Example 1 int a = 20; double b = 20.5; a + b; Here, first operand is int type and other is of type double. So, as per rule, the variable a will be converted to double. Therefore, the final answer is double a + b = 40.500000. Example 2 char ch='a'; int a =13; a+c; Here, first operand is char type and other is of type int. So, as per rule , the char variable will be converted to int type during the operation and the final answer will be of type int. We know the ASCII value for ch is 97.
    [Show full text]
  • Object Oriented Pogramming with C++
    OBJECT ORIENTED POGRAMMING WITH C++ 1. What is type conversion? Type Conversion is the process of converting one predefined type into another type. When variables of one type are mixed with variables of another type, a type conversion will occur. C++ facilitates the type conversion into the following two forms: Implicit Type Conversion: An implicit type conversion is a conversion performed by the compiler without programmer's intervention. It is applied whenever different data types are intermixed in an expression, so as not to lose information short x=6000; int y; y=x; // data type short variable x is converted to int and is assigned to the integer variable y. Explicit Type Conversion: The explicit conversion of an operand to a specific type is called type casting. An explicit type conversion is user-defined that forces an expression to be of specific type. Syntax: (type) expression #include <iostream.h> void main( ) { int a; float b, c; cout << "Enter the value of a:"; cin >> a; cout << "Enter the value of b:"; cin >> b; c = float(a)+b; cout << "The value of c is:" << c; getch(); } In the above program “a” is declared as integer and “b” and “c” are declared as float. In the type conversion statement namely c = float (a) +b; The variable a of type integer is converted into float type and so the value 10 is converted as 10.0 and then is added with the float variable b with value 12.5 giving a resultant float variable c with value as 22.5 2. Why C++ is called OOP language? M.VIJAY .
    [Show full text]
  • VHDL Type Conversion | Bitweenie
    VHDL Type Conversion | BitWeenie http://www.bitweenie.com/listings/vhdl-type-conversion/ Home About Electrical Engineer Jobs Request Topic Resources Home » VHDL Type Conversion VHDL Type Conversion Posted by Shannon Hilbert in Verilog / VHDL on 2-10-13 Any given VHDL FPGA design may have multiple VHDL types being used. The most common VHDL types used in synthesizable VHDL code are std_logic, std_logic_vector, signed, unsigned, and integer. Because VHDL is a strongly-typed language, most often differing types cannot be used in the same expression. In cases where you can directly combine two types into one expression, you are really leaving it up to the compiler or synthesis tool to determine how the expression should behave, which is a dangerous thing to do. This article will discuss the following concepts: 1. Type casting and conversion functions. 2. The importance of using the appropriate type. 3. Common uses and examples. VHDL Type Cast and Conversion Functions The picture below illustrates how to convert between the most common VHDL types. Type casting is used to move between the std_logic_vector type and the signed and unsigned types. 1 --signal definitions 2 signal slv : std_logic_vector(7 downto 0); 3 signal s : signed(7 downto 0); 4 signal us : unsigned(7 downto 0); 5 6 --FROM std_logic_vector TO signed/unsigned 1 de 5 07/10/2015 14:58 VHDL Type Conversion | BitWeenie http://www.bitweenie.com/listings/vhdl-type-conversion/ 7 sgn <= signed(slv); 8 usgn <= unsigned(slv); 9 10-- FROM signed/unsigned TO std_logic_vector 11svl <= std_logic_vector(sgn); 12svl <= std_logic_vector(usgn); Functions are used to move between signed and unsigned types and the integer type.
    [Show full text]
  • N1592 Explicit Conversion Operators
    Document Number: SC22/WG21/N1592=04-0032 Date: 2004-2-13 Author: Lois Goldthwaite Email: [email protected] Explicit Conversion Operators This paper proposes a small change in C++ grammar to permit the function-specifier 'explicit' to be applied to the definition of a user-defined conversion operator. The semantic effect is to inhibit automatic conversions in situations where they may not have been intended. The Problem One of the design principles of C++ is that the language does not enforce a different syntax for user-defined types and built-in primitive types. A variable of either category can be passed by value (assuming the programmer has not intentionally disabled this), and a variable of any type can be passed by reference. The compiler will perform automatic promotions and conversions, if necessary, when numeric types are used as function parameters or when differing types are combined with an operator (int to long, signed to unsigned, float to double, etc.). Similarly, the programmer can write conversion functions for user-defined types, so that the conversions will take place transparently. This is a feature, and A Good Thing, as it decreases the number of overloaded functions which would otherwise be needed (D&E 3.6.1). In Modern C++ Design, Alexandrescu says, "User-defined conversions in C++ have an interesting history. Back in the 1980s, when user-defined conversions were introduced, most programmers considered them a great invention. User-defined conversions promised a more unified type system, expressive semantics, and the ability to define new types that were indistinguishable from built-in ones.
    [Show full text]
  • Polymorphism
    Polymorphism A closer look at types.... Chap 8 polymorphism º comes from Greek meaning ‘many forms’ In programming: Def: A function or operator is polymorphic if it has at least two possible types. Polymorphism i) OverloaDing Def: An overloaDeD function name or operator is one that has at least two Definitions, all of Different types. Example: In Java the ‘+’ operator is overloaDeD. String s = “abc” + “def”; +: String * String ® String int i = 3 + 5; +: int * int ® int Polymorphism Example: Java allows user DefineD polymorphism with overloaDeD function names. bool f (char a, char b) { return a == b; f : char * char ® bool } bool f (int a, int b) { f : int * int ® bool return a == b; } Note: ML Does not allow function overloaDing Polymorphism ii) Parameter Coercion Def: An implicit type conversion is calleD a coercion. Coercions usually exploit the type-subtype relationship because a wiDening type conversion from subtype to supertype is always DeemeD safe ® a compiler can insert these automatically ® type coercions. Example: type coercion in Java Double x; x = 2; the value 2 is coerceD from int to Double by the compiler Polymorphism Parameter coercion is an implicit type conversion on parameters. Parameter coercion makes writing programs easier – one function can be applieD to many subtypes. Example: Java voiD f (Double a) { ... } int Ì double float Ì double short Ì double all legal types that can be passeD to function ‘f’. byte Ì double char Ì double Note: ML Does not perform type coercion (ML has no notion of subtype). Polymorphism iii) Parametric Polymorphism Def: A function exhibits parametric polymorphism if it has a type that contains one or more type variables.
    [Show full text]
  • Primitive Data, Variables, and Expressions; Simple Conditional Execution
    Unit 2, Part 1 Primitive Data, Variables, and Expressions; Simple Conditional Execution Computer Science S-111 Harvard University David G. Sullivan, Ph.D. Overview of the Programming Process Analysis/Specification Design Implementation Testing/Debugging Example Problem: Adding Up Your Change • Let's say that we have a bunch of coins of various types, and we want to figure out how much money we have. • Let’s begin the process of developing a program that does this. Step 1: Analysis and Specification • Analyze the problem (making sure that you understand it), and specify the problem requirements clearly and unambiguously. • Describe exactly what the program will do, without worrying about how it will do it. Step 2: Design • Determine the necessary algorithms (and possibly other aspects of the program) and sketch out a design for them. • This is where we figure out how the program will solve the problem. • Algorithms are often designed using pseudocode. • more informal than an actual programming language • allows us to avoid worrying about the syntax of the language • example for our change-adder problem: get the number of quarters get the number of dimes get the number of nickels get the number of pennies compute the total value of the coins output the total value Step 3: Implementation • Translate your design into the programming language. pseudocode code • We need to learn more Java before we can do this! • Here's a portion or fragment of a Java program for computing the value of a particular collection of coins: quarters = 10; dimes = 3; nickels = 7; pennies = 6; cents = 25*quarters + 10*dimes + 5*nickels + pennies; System.out.println("Your total in cents is:"); System.out.println(cents); • In a moment, we'll use this fragment to examine some of the fundamental building blocks of a Java program.
    [Show full text]
  • Operator Overloading, Friend Functions/Classes
    Chapter 8 Operator Overloading, Friends, and References Learning Objectives Basic Operator Overloading Unary operators As member functions Friends and Automatic Type Conversion Friend functions, friend classes Constructors for automatic type conversion References and More Overloading << and >> Operators: = , ++, -- Operator Overloading Introduction Operators +, -, %, ==, etc. Really just functions! Simply ‘called’ with different syntax: x + 7 ‘+’ is binary operator with x & 7 as operands We ‘like’ this notation as humans Think of it as: +(x, 7) ‘+’ is the function name x, 7 are the arguments Function ‘+’ returns ‘sum’ of it’s arguments Operator Overloading Perspective Built-in operators e.g.: +, -, = , %, ==, /, * Already work for C++ built-in types In standard ‘binary’ notation We can overload them! To work with OUR types! To add ‘Chair types’, or ‘Money types’ As appropriate for our needs In ‘notation’ we’re comfortable with Always overload with similar ‘actions’! Overloading Basics Overloading operators VERY similar to overloading functions Operator itself is ‘name’ of function Example Declaration: const Money operator +( const Money& amount1, const Money& amount2); Overloads + for operands of type Money Uses constant reference parameters for efficiency Returned value is type Money Allows addition of ‘Money’ objects Overloaded ‘+’ Given previous example: Note: overloaded ‘+’ NOT member function Definition is ‘more involved’ than simple ‘add’ Requires issues of money type addition Must handle
    [Show full text]
  • Type Conversion Type Conversion
    1.3b Type Conversion Type Conversion When we write expressions involved data that involves two different data types , such as multiplying an integer and floating – point number, we need to perform a type conversion…. Two type of conversions mainly Implicit type conversion Explicit type conversion Implicit Type Conversion When the types of the two operands in a binary expression are different,C automatically converts one type to another which is known as Implicit Type Conversion. Some of the simple conversions are : Conversion Rank Conversions in Assignment Expressions Promotion Demotion Conversion in other Binary Expressions Conversion Rank In C , we assign a rank to the integral and floating point arithmetic types Real 9.long double 10.double 11.float Conversion Rank Integer 6.long long 5.long 4. int 3. short Character 2. Char Boolean 1. Bool Conversions in Assignment Expressions A simple assignment involves an assignment operator and two operands Depending on the difference in the rank, C tries to either promote or demote the right expression to make it the same rank as the left variable. Promotion occurs if the right expression has lower rank Demotion occurs if the right expression has a higher rank Promotion The rank of the left expression is elevated to the rank of the left variable The value of the expression is the value of the right expression after the promotion bool b = true; char c = ‘A’; int i = 1234; long double d = 3458.0004 c = b; //Value of c is SOH i = c; //Value of i is 65 d = b; //Value of d is 1.0 d = i; //Value of d is 1234.0 Demotion If the size of the variable at the left side can accommodate the value of the expression, there is no problem An integral or real value can be assigned to a Boolean Type.
    [Show full text]
  • Chapter 6 Data Types
    Chapter 6 Data Types • Introduction • Type Nomenclature • Primitive Types • Type constructors – Ordinal Types – Structured Types • Type Checking • Type conversion CSCI325 Chapter 6 Dr Ahmed Rafea 1 Introduction Evolution of Data Types: FORTRAN I (1956) - INTEGER, REAL, arrays … Ada (1983) - User can create a unique type for every category of variables in the problem space and have the system enforce the types Def: A descriptor is the collection of the attributes of a variable Def: A data type is a set of values, together with a set of operations on those values having certain properties CSCI325 Chapter 6 Dr Ahmed Rafea 2 Type Nomenclature C Types Basic Derived Void Numeric Pointer Array Function struct union Integral Floating Float (signed) enum Double (unsigned) Long double char Int Short int Long int CSCI325 Chapter 6 Dr Ahmed Rafea 3 Primitive Data Types Integer - Almost always an exact reflection of the hardware, so the mapping is trivial - There may be as many as eight different integer types in a language Floating Point - Model real numbers, but only as approximations - Languages for scientific use support at least two floating-point types; sometimes more - Usually exactly like the hardware, but not always; Decimal - For business applications (money) - Store a fixed number of decimal digits (coded) - Advantage: accuracy - Disadvantages: limited range, wastes memory Boolean - Could be implemented as bits, but often as bytes - Advantage: readability CSCI325 Chapter 6 Dr Ahmed Rafea 4 Ordinal Types (user defined) An ordinal type is one in which the range of possible values can be easily associated with the set of positive integers 1.
    [Show full text]