TECHNICAL JOURNAL VOL 20 NO. 3 (Paper 1-7)
Total Page:16
File Type:pdf, Size:1020Kb
Technical Journal, University of Engineering and Technology (UET) Taxila, Pakistan Vol. 20 No. III-2015 Data Mapping for Transformation from RDB Schema to RDF Schema K. R. Malik12, T. Ahmad , M. M. Iqbal3 1,2,3 Computer Science and Engineering Department, UET Lahore, Pakistan [email protected] Abstract-In this paper, we discussed the data mapping Semantic Web is a language to represent information on for transformation from relational database (RDB) the internet in the form of triplets; subject, predicate schema to resource description frame (RDF) Schema. and object [viii]. Which are used among resources During transformation process between these two mapped at hierarchal levels using graph based schemas, weaknesses like compatibility issues, update representations. Data types supported by XML play the query and complexity in relationships are generated. key role in transformation to work properly. Whereas, We proposed an approach to overcome these issues customized data types can also be prepared using XML particularly when data is transformed from RDB to based tags [ix]. This is another reason due to which RDF for semantic web applications. As, for evolving transformation made by different techniques and data keeping changes intact is hard and difficult to algorithm fails to support each other at their full level to sustain. Main focus of this study is to map up common support compatibility of data among systems. So, there features found in both data models of RDB and is a need to look into different capabilities of data types Semantic Web (SW) based schemas using either form supported by XML either by Document Type of XML as an intermediate which will help in Definition (DTD) or XMLS along with their improving transformation results. These data mappings limitations. can further help in gaining better compatibility options for data transformation. II. STATE OF THE ART Keywords-Data Transformation, Extensible Markup The bulk of data and information are found on the Language Schema (XMLS), Document Type Web is stored and retrieved using RDBs. Previous Definition (DTD), Resource Description Framework research shows that Semantic Web collaboration with Schema (RDFS), Relational Database Schema other domains extends its utilization beyond the Web (RDBS), Data Type Mapping [iii]. Many methods and tools were introduced to help by providing the ways to explore relational data for I. INTRODUCTION availability to Semantic Web based systems [x]. Yet, In Web applications, generally data is stored in the there exist problems in clearly gaining results with high form of RDB. The web is semi-structured and performance and compatibility [xi-xiii]. unorganized in a formal way [i]. When it comes to The standard which is known as XML document searching and getting in touch with resources like web provides tags for the web. Where semi-structured and pages, peoples, and other web contents like video, user defined, and predefine tags are stored. Though, the audio etc. search engines work as a tool for web. XML document is written either using XML schema or Despite of advances made to these search engines, size DTD [ix]. So, for transforming a relational schema into of web content beats technology advancements. To RDF schema, we have to do it partially as relational overcome this problem, the Web content representation schema into XML schema and then XML schema into is required to be translated into machine-processable RDF schema or relational into DTD and then DTD into form. To translate such information into machine RDF schema. After that, transforming it back into its understandable form Semantic Web (SW) has been original form of relational schema. introduced [ii]. The need of SW increases due to its Mapping can be done from RDB to RDF either by capability in providing improved methods and direct or indirect methods. Indirect methods like RDB intelligent data seeking mechanisms and became a big to RDF Mapping Language (R2RML) involves table evolution in the next generation of web [iii]. And to mapping from RDB to RDF without intermediate make this possible, old technologies are getting utilization of XML. Other indirect methods are about transformed into new ones. The EXtensible Markup transforming schema from RDB to RDF using XML. Language (XML) based documents are getting XML schema can be either in the DTD or XML [xi- enriched to map-up with Semantic Web [iv-vii]. xiii]. Which is focusing on indirect methods based Resource description framework (RDF) for the mapping. 99 Technical Journal, University of Engineering and Technology (UET) Taxila, Pakistan Vol. 20 No. III-2015 Different researchers have attempted different above mentioned problem situation is possible through approaches to fulfill the required outcome of reaching mapping of differences at schema level, enriching the to a next generation web that is the Semantic Web. DB contents on web based mapping results and There have been attempts made for transforming a identifying grey areas. database schema into the Document Type Definition (DTD) and then into Resource Description Framework III. PROPOSED WORK Schema (RDFS) either partially or completely. DTDs at the end of XML document for the web can be Transformation from one data model into another transformed and mapped into RDF schema. By looking comes with its limitations. In this field of study, first into tags available in the XML document to pick better look into mapping between different data models. Then suitable tags for the satisfying role as a class or property the process of transformation using DTD and XMLS is [v]. On the other hand, XML documents can be updated given. A control experiment presented to show that how to gain capability of being interpreted as RDF. It would transformation works and on the basis of that results are be better if the XML's original structure remains given. unchanged and transformation process better results A. Mapping and coverage of the developed technique [vi, xiv-xvi]. A tabular representation in Table I to show the In depth, the idea of resources linking among corresponding schema entity for the each concept of database, DTD and RDF is by using queries without Database. These entities are further used for proper changing context and meaning of web based data to transformation among RDB and semantic web. In form its semantic existence [xvii-xix]. These Table 1 each concept of a database is mapped with it approaches can be captured to get a single side one of the possible mapped solution in DTD, XML transformation approach. Other than these differences schema, RDFS. Table I shows how each entity of a the major one is that each resource in RDF is assigned a database can be stored in its corresponding mapped unique Uniform resource identifier (URI) for field and command of other technologies like DTD, identification. So, information on both of these XML and RDF. Furthermore, some of the cases are semantic based expressions is given a shape of triples discussed with the help of W3C standards for all of [xx]. these technologies after careful study. The Table I can This research can help to reduce complexity and play a crucial role in transforming from database to compatibility issues for the process of transformation. semantic web and back to the database. As complexity and compatibility issues arise due to database schemas and ontologies are evolving at a TABLE I constant speed to map with application and user SHOWING ALTERNATIVE SCHEMA ELEMENTS FOR requirements. Therefore, instead of mapping being EACH CASE OF RELATIONAL SCHEMA VALUE redefined from scratch it should evolve on top [xiii]. Relational XML Concepts DTD RDFS The update statement concerning RDF stores is still Schema Schema under progress and its semantics are not yet well Table Table_Name !ELEMENT Complex type Class defined, and uncertainty remains concerned about the Table_Name* element transformation of few SPARQL (SPARQL Protocol ATTLIST and RDF Query Language) Update statements. Only !ELEMENT Field Field_Name !ELEMENT Element Rdf: elementary (attribute-to-property & relation-to-class) Field_Name Property mappings have been studied up to now. The problem of Cardinality Field(>=0) !ELEMENT Restriction modifying relational data using SPARQL Update is the Field_Name* (Pattern) same as to view, update problem the classic database Cardinality Field(>0) !ELEMENT Restriction [xiii, xxi, xxii]. Field_Name+ (Pattern) It is thought that one cause for the delay in the Referencing Field !ELEMENT Simple type Domain recognition of the Semantic Web is the deficiency of Ref Key element Ref (Property) application and tools showing benefits of semantic web Field_Name* technology [xxv]. The success depends of large amount IDREF #REQUIRED of data concerning semantic web of these tools [xxiv]. Primary Key Field !ELEMENT Key Attribute Range Because relational databases are considered as highly Field_Name? Use=”required” used medium for storage of data on web. The solution ID was to automatic manage mass of data in SW form as #REQUIRED RDF. Composite key !ELEMENT Attribute Rdfs:sub It is studied that the transformation model from Second Key Use=”required” Property relational DB to RDF and storage mechanism of RDF Field_Name? Of stores in RDB. It is important to map existing relational ID #REQUIRED representation among relational schema, DTD, XML Data type Field PCDATA or Type Type schema, and RDFS [ix, xx, xxiii]. Proposed study of the CDATA 100 Technical Journal, University of Engineering and Technology (UET) Taxila, Pakistan Vol. 20 No. III-2015 In Table I concept of database covered are transformation, first let's look at the RDBS of tabulated, field, cardinality, referencing, data types and considered example of relational schema of the entity keys. The elements are the basic entity used for both named “Organization” shown in Fig. 2. There are four XML and DTD which are used for table and fields. tables Dependent, Employee, Manager and Each element can further contain a list of attributes Department.