A Data Warehouse Conceptual Data Model for Multidimensional Aggregation

Total Page:16

File Type:pdf, Size:1020Kb

A Data Warehouse Conceptual Data Model for Multidimensional Aggregation A Data Warehouse Conceptual Data Model for Multidimensional Aggregation Enrico Franconi Ulrike Sattler Dept. of Computer Science LuFG Theoretical Computer Science Univ. of Manchester RWTH Aachen Manchester M13 9PL, UK D-52074 Aachen, Germany [email protected] [email protected] the fact that (1) experiences in the field of databases have proved that conceptual modelling is crucial for the design, Abstract evolution, and optimisation of a database, (2) a great va- riety of data warehouse system are on the market, most This paper presents a proposal for a Data Ware- of them providing some implementation of multidimen- house Conceptual Data (CDWDM) Model which sional aggregation, and (3) query optimisation with aggre- allows for the description of both the relevant ag- gated queries [Nutt et al., 1998; Cohen et al., 1999] is even gregated entities of the domain—together with more crucial for data warehouses than it is for databases— their properties and their relationships with other which makes semantic query optimisation using a concep- relevant entities—and the relevant dimensions in- tual model even more important. As a consequence of volved in building the aggregated entities. The the absence of a such an extended modelling formalism, proposed CDWDM is able to capture the database a comparison of different systems or language extensions schemata expressed in an extended version of the for query optimisation is difficult: a common framework in Entity-Relationship Data Model; it is able to in- which to translate and compare these extensions is missing, troduce complex descriptions of the structure of new query optimisation techniques developed for extended aggregated entities and multiply hierarchically or- schema and/or query languages cannot be compared appro- ganised dimensions; it is based on Description priately. Logics, a class of formalisms for which it is possi- In order to address these questions, a formal framework ble to study the expressivity in relation with decid- must be developed that encompasses the abstract principles ability of reasoning problems and completeness of of the data warehouse related extensions of traditional rep- algorithms. resentation formalisms. In this paper, we present some pre- liminary outcome from the research done within the “Foun- 1 Introduction dations of Data Warehouse Quality” (DWQ) long term re- search project, funded by the European Commission (n. Data Warehouse—and especially OLAP—applications ask 22469) under the ESPRIT Programme. With respect to for the vital extension of the expressive power and func- the global picture, the role of our research within DWQ tionality of traditional conceptual modelling formalisms in is to study a formal framework at the conceptual level (see order to cope with aggregation. Still, there have been few Figure 1). The conceptual data model we are investigat- attempts [Catarci et al., 1995; Cabibbo and Torlone, 1998] ing should be able to abstract and describe the entities and to provide such an extended modelling formalism, despite relations which are relevant both in the whole enterprise, and in the user analysis of such information. In the follow- The copyright of this paper belongs to the paper’s authors. Permission to ing, we will refer to this formalism as the Data Warehouse copy without fee all or part of this material is granted provided that the copies are not made or distributed for direct commercial advantage. Conceptual Data Model (DWCDM). Proceedings of the International Workshop on Design and Management of Data Warehouses (DMDW’99) 1.1 A Data Warehouse Conceptual Data Model Heidelberg, Germany, 14. - 15.6. 1999 A DWCDM must provide means for the representation of a (S. Gatziu, M. Jeusfeld, M. Staudt, Y. Vassiliou, eds.) multidimensional conceptual view of data. More precisely, http://sunsite.informatik.rwth-aachen.de/Publications/CEUR-WS/Vol-19/ a DWCDM provides the language for defining multidimen- E. Franconi, U. Sattler 13-1 Client Client REGION W Client S model schema Data N Store Juice 10 Cola 13 Data Soap Enterprise Warehouse Data model schema Warehouse Store PRODUCT Source Source Source model schema Data Store JanFeb MONTH Conceptual Logical Physical level level level Figure 2: Sales volume as a function of product, time, lo- cation. Figure 1: The role played by the Data Warehouse Concep- tual Data Model with respect to the DWQ architecture. For instance, a spatial dimension might have a hierarchy with levels such as country, region, city, office. sional information within a conceptual model in the data Measures (which are also known as variables or warehouse global information base. As stated above, the metrics)—like Sales in the example, or budget, revenue, model is of support for the conceptual design of a data inventory, etc.—in a multidimensional array correspond to warehouse, for query and view management, and for up- columns in a relational database table whose values func- date propagation: it serves as a reference meta-model for tionally depend on the values of other columns. Values deriving the inter-relations among entities, relations, aggre- within a table column correspond to values for that mea- gations, and for providing the integrity constraints neces- sure in a multidimensional array: measures associate val- sary to reduce the design and maintenance costs of a data ues with points in the multi-dimensional world. For ex- warehouse. Hence a DWCDM must be expressive enough ample, the measure of the sales of the product Cola, in to describe both the abstract business domain concerned the northern region, in January, is 13,000. Thus, a dimen- with the specific application (Enterprise model)—just like sion acts as an index for identifying values within a multi- a conceptual schema in the traditional database world—and dimensional array. If one member of the dimension is se- the possible views of the enterprise information a specific lected, then the remaining dimensions in which a range of user may want to analyse (Client model)—with particular members (or all members) are selected defines a sub-cube. emphasis on the aggregated views, which are peculiar to a If all but two dimensions have a single member selected, data warehouse architecture (see Figure 1). A multidimen- the remaining two dimensions define a spreadsheet (or a sional modelling object in the logical perspective—e.g., a slice or a page). If all dimensions have a single member materialised view, a query, or a cube—should always be selected, then a single cell is defined. Dimensions offer a related with some (possibly aggregated) entity in the con- very concise, intuitive way of organising and selecting data ceptual schema. for retrieval, exploration and analysis. Usual pre-defined In the following, we will briefly introduce the ideas be- or user-defined dimension levels (or Roll-Ups ) for aggre- hind a multidimensional data model (see, e.g., [Agrawal gating data in DW are: temporal (e.g., year vs. month), et al., 1995; Cabibbo and Torlone, 1998]) and compare geographical/spatial (e.g., Rome vs. Italy), organisational it with a traditional relational data model. A more com- (meaning the hierarchical breakdowns of your organisa- prehensive introduction has been done in the forthcoming tion, e.g., Institute vs. Department), and physical (e.g., Car book “Fundamentals of Data Warehousing” [Baader et al., vs. Engine). 1999], Chapter 4 on Multidimensional Aggregation. A value in a single cell may represent an aggre- Relational database tables contain records (or rows). gated measure computed from more specific data at some Each record consists of fields (or columns). In a normal re- lower level of the same dimensions. Aggregation in- lational database, a number of fields in each record (keys) volves computing aggregation functions—according to may uniquely identify each record. In contrast, a multidi- the attribute hierarchy within dimensions or to cross- mensional database contains n-dimensional arrays (some- dimensional formulas—for one or more dimensions. For times called hypercubes or cubes), where each dimension example, the value 13,000 for the sales in January, may has an associated hierarchy of levels of consolidated data. have been consolidated as the sum of the disaggregated val- E. Franconi, U. Sattler 13-2 Calls (av. duration) Date (Day) 1/1/99 2/1/99 3/1/99 4/1/99 5/1/99 Cell E Land Line 1 Direct Data Source (Point Type) PABX Calls (av. duration) Date (Week Day) Mon Tue Wed Thu Fri Sat Sun E Consumer 2 Source (Customer Type) Business Figure 3: The cubes reporting the average duration of calls by dates in days and sources in point types, and by dates at the level of week days and sources at the level of customer types. ues of the weekly (or day-by-day) sales. Another example model for both the conceptual and the logical levels; our introducing an aggregation grounded on a different dimen- proposal is compatible with the DWCDM presented in sion is the cost of a product—e.g., a car—as being the sum [Calvanese et al., 1998c]. of the costs of all of its components. The paper is organised as follows. Section 2 infor- In order to provide an adequate conceptualisation of mally introduces an extended ER formalism which allows multidimensional information, a DWCDM should provide for the description of the explicit structure of multidimen- the possibility of explicitly modelling the relevant aggre- sional aggregations; the section briefly describes the se- gations and dimensions. According to a conservative point mantics of the conceptual data model in terms of a logi- of view, a desirable DWCDM should extend some standard cal representation of multidimensional databases, as pro- modelling formalism (such as Entity-Relationship) to al- posed by [Cabibbo and Torlone, 1998]. Section 3 will low for the description of both aggregated entities of the propose a basic modelling language—based on Description domain—together with their properties and their relation- Logics—which is expressive enough to capture the Entity- ships with other relevant entities—and the dimensions in- Relationship Data Model.
Recommended publications
  • Data Warehouse: an Integrated Decision Support Database Whose Content Is Derived from the Various Operational Databases
    1 www.onlineeducation.bharatsevaksamaj.net www.bssskillmission.in DATABASE MANAGEMENT Topic Objective: At the end of this topic student will be able to: Understand the Contrasting basic concepts Understand the Database Server and Database Specified Understand the USER Clause Definition/Overview: Data: Stored representations of objects and events that have meaning and importance in the users environment. Information: Data that have been processed in such a way that they can increase the knowledge of the person who uses it. Metadata: Data that describes the properties or characteristics of end-user data and the context of that data. Database application: An application program (or set of related programs) that is used to perform a series of database activities (create, read, update, and delete) on behalf of database users. WWW.BSSVE.IN Data warehouse: An integrated decision support database whose content is derived from the various operational databases. Constraint: A rule that cannot be violated by database users. Database: An organized collection of logically related data. Entity: A person, place, object, event, or concept in the user environment about which the organization wishes to maintain data. Database management system: A software system that is used to create, maintain, and provide controlled access to user databases. www.bsscommunitycollege.in www.bssnewgeneration.in www.bsslifeskillscollege.in 2 www.onlineeducation.bharatsevaksamaj.net www.bssskillmission.in Data dependence; data independence: With data dependence, data descriptions are included with the application programs that use the data, while with data independence the data descriptions are separated from the application programs. Data warehouse; data mining: A data warehouse is an integrated decision support database, while data mining (described in the topic introduction) is the process of extracting useful information from databases.
    [Show full text]
  • Chapter 2: Database System Concepts and Architecture Define
    Chapter 2: Database System Concepts and Architecture define: data model - set of concepts that can be used to describe the structure of a database data types, relationships and constraints set of basic operations - retrievals and updates specify behavior - set of valid user-defined operations categories: high-level (conceptual data model) - provides concepts the way a user perceives data - entity - real world object or concept to be represented in db - attribute - some property of the entity - relationship - represents and interaction among entities representational (implementation data model) - hide some details of how data is stored, but can be implemented directly - record-based models like relational are representational low-level (physical data model) - provides details of how data is stored - record formats - record orderings - access path (for efficient search) schemas and instances: database schema - description of the data (meta-data) defined at design time each object in schema is a schema construct EX: look at TOY example - top notation represents schema schema constructs: cust ID; order #; etc. database state - the data in the database at any particular time - also called set of instances an instance of data is filled when database is populated/updated EX: cust name is a schema construct; George Grant is an instance of cust name difference between schema and state - at design time, schema is defined and state is the empty state - state changes each time data is inserted or updated, schema remains the same Three-schema architecture
    [Show full text]
  • Data Models for Home Services
    __________________________________________PROCEEDING OF THE 13TH CONFERENCE OF FRUCT ASSOCIATION Data Models for Home Services Vadym Kramar, Markku Korhonen, Yury Sergeev Oulu University of Applied Sciences, School of Engineering Raahe, Finland {vadym.kramar, markku.korhonen, yury.sergeev}@oamk.fi Abstract An ultimate penetration of communication technologies allowing web access has enriched a conception of smart homes with new paradigms of home services. Modern home services range far beyond such notions as Home Automation or Use of Internet. The services expose their ubiquitous nature by being integrated into smart environments, and provisioned through a variety of end-user devices. Computational intelligence require a use of knowledge technologies, and within a given domain, such requirement as a compliance with modern web architecture is essential. This is where Semantic Web technologies excel. A given work presents an overview of important terms, vocabularies, and data models that may be utilised in data and knowledge engineering with respect to home services. Index Terms: Context, Data engineering, Data models, Knowledge engineering, Semantic Web, Smart homes, Ubiquitous computing. I. INTRODUCTION In recent years, a use of Semantic Web technologies to build a giant information space has shown certain benefits. Rapid development of Web 3.0 and a use of its principle in web applications is the best evidence of such benefits. A traditional database design in still and will be widely used in web applications. One of the most important reason for that is a vast number of databases developed over years and used in a variety of applications varying from simple web services to enterprise portals. In accordance to Forrester Research though a growing number of document, or knowledge bases, such as NoSQL is not a hype anymore [1].
    [Show full text]
  • Ontology Transformation: from Requirements to Conceptual Model*
    Justas Trinkunas, Olegas Vasilecas Ontology Transformation: from Requirements to .. SCIeNTIfIC PAPeRS, UNIVeRSITy of Latvia, 2009. Vol. 751 COMPUTER SCIENCE AND Information TECHNOLOGIES 52–64 P. Ontology Transformation: from Requirements to Conceptual Model* Justas Trinkunas1 and Olegas Vasilecas2, 3 1 Information Systems Research Laboratory, Faculty of Fundamental Sciences, Vilnius Gediminas Technical University, Sauletekio al. 11, LT-10223 Vilnius-40, Lithuania, [email protected] 2 Information Systems Research Laboratory, Faculty of Fundamental Sciences, Vilnius Gediminas Technical University, Sauletekio al. 11, LT-10223 Vilnius-40, Lithuania, [email protected] 3 Department of Computer Science, Faculty of Natural Sciences, Klaipeda University, Herkaus Manto 84, LT-92294 Klaipeda, [email protected] Information systems are increasingly complex, especially in the enormous growth of the volume of data, different structures, different technologies, and the evolving requirements of the users. Consequently, current applications require an enormous effort of design and development. The fast-changing requirements are the main problem in creating and/or modifying conceptual data models. To improve this process, we proposed to reuse already existing knowledge for conceptual modelling. In the paper, we have analysed reusable knowledge models. We present our method for creating conceptual models from various knowledge models. Keywords: transformation, ontology, data modelling. 1 Introduction Most of information systems analysts have at least once considered how many times they have to analyse the same problems, create and design the same things. Most of them ask, is it possible to reuse already existing knowledge? Our answer is yes. We believe that knowledge can and should be reused for information systems development.
    [Show full text]
  • Enterprise Data Modeling Using the Entity-Relationship Model
    Database Systems Session 3 – Main Theme Enterprise Data Modeling Using The Entity/Relationship Model Dr. Jean-Claude Franchitti New York University Computer Science Department Courant Institute of Mathematical Sciences Presentation material partially based on textbook slides Fundamentals of Database Systems (6th Edition) by Ramez Elmasri and Shamkant Navathe Slides copyright © 2011 1 Agenda 11 SessionSession OverviewOverview 22 EnterpriseEnterprise DataData ModelingModeling UsingUsing thethe ERER ModelModel 33 UsingUsing thethe EnhancedEnhanced Entity-RelationshipEntity-Relationship ModelModel 44 CaseCase StudyStudy 55 SummarySummary andand ConclusionConclusion 2 Session Agenda Session Overview Enterprise Data Modeling Using the ER Model Using the Extended ER Model Case Study Summary & Conclusion 3 What is the class about? Course description and syllabus: » http://www.nyu.edu/classes/jcf/CSCI-GA.2433-001 » http://cs.nyu.edu/courses/fall11/CSCI-GA.2433-001/ Textbooks: » Fundamentals of Database Systems (6th Edition) Ramez Elmasri and Shamkant Navathe Addition Wesley ISBN-10: 0-1360-8620-9, ISBN-13: 978-0136086208 6th Edition (04/10) 4 Icons / Metaphors Information Common Realization Knowledge/Competency Pattern Governance Alignment Solution Approach 55 Agenda 11 SessionSession OverviewOverview 22 EnterpriseEnterprise DataData ModelingModeling UsingUsing thethe ERER ModelModel 33 UsingUsing thethe EnhancedEnhanced Entity-RelationshipEntity-Relationship ModelModel 44 CaseCase StudyStudy 55 SummarySummary andand ConclusionConclusion
    [Show full text]
  • Mapping Conceptual Models to Database Schemas
    Chapter 4 Mapping Conceptual Models to Database Schemas David W. Embley and Wai Yin Mok 4.1 Introduction The mapping of a conceptual-model instance to a database schema is fun- damentally the same for all conceptual models. A conceptual-model instance describes the relationships and constraints among the various data items. Given the relationships and constraints, the mappings group data items to- gether into flat relational schemas for relational databases and into nested relational schemas for object-based and XML storage structures. Although we are particularly interested in the basic principles behind the mappings, we take the approach of first presenting them in Sections 4.2 and 4.3 in terms of mapping an ER model to a relational database. In these sections, for the ER constructs involved, we (1) provide examples of the constructs and say what they mean, (2) give rules for mapping the constructs to a relational schema and illustrate the rules with examples, and (3) state the underlying principles. We then take these principles and show in Section 4.4 how to apply them to UML. This section on UML also serves as a guide to applying the principles to other conceptual models. In Section 4.5 we ask and answer the question about the circumstances under which the mappings yield normalized relational database schemas. In Section 4.6 we extend the mappings beyond flat relations for relational databases and show how to map conceptual-model instances to object-based and XML storage structures. We provide pointers to additional readings in Section 4.7. Throughout, we use an application in which we assume that we are designing a database for a bed and breakfast service.
    [Show full text]
  • Define Conceptual Schema in Dbms
    Define Conceptual Schema In Dbms Earthshaking Garv hydroplane early while Esteban always tub his liquidations oversimplified sorrily, he theologised so industrially. Devolution or tyrannical, Rinaldo never bottle-feed any stereochromy! Visitorial Barney sometimes politicises his cannibal dreadfully and landscapes so ago! In knowing the same data projects, while control module or schema conceptual. Implementation data model is used at local level. One badge to calculate cardinality constraints. Its functionality is its limit the number and entity occurrences associated in a relationship. Parallel database systems are used in the applications that chapter to query extremely large databases or a have people process an extremely large orchard of transactions per second. SALARY stretch a number. Data compression and data encryption techniques. Dynamic properties, for example, operations or rules defining new database states based on applied state changes. The scheduler is illicit for ensuring that concurrent operations on the database software without conflicting with me another. It also allows any differences in entity names, attributes names, attributes order, data types, and manual on, to pay determined. Thus, by inspecting the articulation set therefore a decomposition hypergraph, one immediately infers that certain MDs hold in on original relation. This is off very strong within which floor have developed our decomposition algorithm. Elementary FDs and multiple elementary MDs have and simple structure appealing to intuition and certain formal properties ACM Transactions on Database Systems. DOB is then date. Guarino is arguing for. We use cookies to hum the best possible experience equip our website. The information about the mapping requests among various schema levels are included in quality system catalog of DBMS.
    [Show full text]
  • Managing Data Models Is Based on a “Where-Used” Model Versus the Traditional “Transformation-Mapping” Model
    Data Model Management Whitemarsh Information Systems Corporation 2008 Althea Lane Bowie, Maryland 20716 Tele: 301-249-1142 Email: [email protected] Web: www.wiscorp.com Data Model Management Table of Contents 1. Objective ..............................................................1 2. Topics Covered .........................................................1 3. Architectures of Data Models ..............................................1 4. Specified Data Model Architecture..........................................3 5. Implemented Data Model Architecture.......................................7 6. Operational Data Model Architecture.......................................10 7. Conclusions...........................................................10 8. References............................................................11 Copyright 2006, Whitemarsh Information Systems Corporation Proprietary Data, All Rights Reserved ii Data Model Management 1. Objective The objective of this Whitemarsh Short Paper is to present an approach to the creation and management of data models that are founded on policy-based concepts that exist across the enterprise. Whitemarsh Short Papers are available from the Whitemarsh website at: http://www.wiscorp.com/short_paper_series.html Whitemarsh’s approach to managing data models is based on a “where-used” model versus the traditional “transformation-mapping” model. This difference brings a very great benefit to data model management. 2. Topics Covered The topics is this paper include: ! Architectures of Data
    [Show full text]
  • Entity Chapter 7: Entity-Relationship Model
    Chapter 7: Entity --Relationship Model These slides are a modified version of the slides provided with the book: Database System Concepts, 6th Ed . ©Silberschatz, Korth and Sudarshan See www.db-book.com for conditions on re-use The original version of the slides is available at: https://www.db-book.com/ Chapter 7: EntityEntity--RelationshipRelationship Model Design Process Modeling Constraints E-R Diagram Design Issues Weak Entity Sets Extended E-R Features Design of the Bank Database Reduction to Relation Schemas Database Design UML Database System Concepts 7.2 ©Silberschatz, Korth and Sudarshan Design Phases Requirements specification and analysis: to characterize and organize fully the data needs of the prospective database users Conceptual design: the designer translates the requirements into a conceptual schema of the database ( Entity-Relationship model, ER schema – these slides topic) A fully developed conceptual schema also indicates the functional requirements of the enterprise. In a “specification of functional requirements”, users describe the kinds of operations (or transactions) that will be performed on the data Logical Design: deciding on the DBMS schema and thus on the data model (relational data model ). Conceptual schema is “mechanically ” translated into a logical schema ( relational schema ) Database design requires finding a “good” collection of relation schemas Physical Design: some decisions on the physical layout of the database Database System Concepts 7.3 ©Silberschatz, Korth and Sudarshan Outline
    [Show full text]
  • Conceptual Analyses and Design Steps
    Mustafa Jarrar: Lecture Notes on Conceptual Analyses and Design Steps. University of Birzeit, Palestine, 2018 Version 4 Conceptual Analyses Conceptual Schema Design Steps (Chapter 3) Mustafa Jarrar Birzeit University [email protected] www.jarrar.info Jarrar © 2018 1 Watch this lecture and download the slides Course Page: http://www.jarrar.info/courses/ORM/Jarrar.LectureNotes.ConceptualAnalyses.pdf Online Courses : http://www.jarrar.info/courses/ Some diagrams in this lecture are based on [1] Jarrar © 2018 2 Mustafa Jarrar: Lecture Notes on Conceptual Analyses and design Steps. University of Birzeit, Palestine, 2018 Conceptual Analyses Conceptual Schema Design Steps § Part 1: Conceptual Analyses Steps § Part 2: Basic ORM Constructs and Syntax § Part 3: Use case (ID Card) § Part 4: Use case (University Programs) Jarrar © 2018 3 Conceptual Analyses Given an application domain, e.g. hospital, and three information modelers, what steps do you suggest them to start with, to build the hospital’s conceptual model? There is no strict or perfect modeling process or procedure! You may start with any step you think suitable, taking into account the complexity of the domain, available resources, modelers’ prior knowledge about the domain, etc. It is recommended that you modularize the domain into sub-domains, build a conceptual schema for each sub-domain , then integrate all sub- schemes into one conceptual schema. The following procedure (7 steps) is to help you model a sub-domain, but you don’t have to strictly follow these steps. Jarrar © 2018 4 Conceptual Schema Design Steps 1. From examples to elementary facts 2. Draw fact types and apply population check 3.
    [Show full text]
  • KDI EER: the Extended ER Model
    KDI EER: The Extended ER Model Fausto Giunchiglia and Mattia Fumagallli University of Trento 0/61 Extended Entity Relationship Model The Extended Entity-Relationship (EER) model is a conceptual (or semantic) data model, capable of describing the data requirements for a new information system in a direct and easy to understand graphical notation. Data requirements for a database are described in terms of a conceptual schema, using the EER model. EER schemata are comparable to UML class diagrams. Actually, what we will be discussing is an extension of Peter Chen’s proposal (hence “extended” ER). 1/61 The Constructs of the EER Model 2/61 Entities These represent classes of objects (facts, things, people,...) that have properties in common and an autonomous existence. City, Department, Employee, Purchase and Sale are examples of entities for a commercial organization. An instance of an entity represents an object in the class represented by the entity. Stockholm, Helsinki, are examples of instances of the entity City, and the employees Peterson and Johanson are examples of instances of the Employee entity. The EER model is very different from the relational model in a number of ways; for example, in EER it is not possible to represent an object without knowing its properties, but in the relational model you need to know its key attributes. 3/61 Example of Entities 4/61 Relationship They represent logical links between two or more entities. Residence is an example of a relationship that can exist between the entities City and Employee; Exam is an example of a relationship that can exist between the entities Student and Course.
    [Show full text]
  • The Case of the COVID-19 Pandemic
    information Article A Conceptual Model for Geo-Online Exploratory Data Visualization: The Case of the COVID-19 Pandemic Anna Bernasconi 1,* and Silvia Grandi 2 1 Dipartimento di Elettronica, Informazione e Bioingegneria, Politecnico di Milano, 20133 Milan, Italy 2 Dipartimento di Scienze Statistiche “P. Fortunati”, Alma Mater Studiorum Università di Bologna, 40126 Bologna, Italy; [email protected] * Correspondence: [email protected] Abstract: Responding to the recent COVID-19 outbreak, several organizations and private citizens considered the opportunity to design and publish online explanatory data visualization tools for the communication of disease data supported by a spatial dimension. They responded to the need of receiving instant information arising from the broad research community, the public health authorities, and the general public. In addition, the growing maturity of information and mapping technologies, as well as of social networks, has greatly supported the diffusion of web-based dashboards and infographics, blending geographical, graphical, and statistical representation approaches. We propose a broad conceptualization of Web visualization tools for geo-spatial information, exceptionally employed to communicate the current pandemic; to this end, we study a significant number of publicly available platforms that track, visualize, and communicate indicators related to COVID-19. Our methodology is based on (i) a preliminary systematization of actors, data types, providers, and visualization tools, and on (ii) the creation of a rich collection of relevant sites clustered according to significant parameters. Ultimately, the contribution of this work includes a critical analysis of collected evidence and an extensive modeling effort of Geo-Online Exploratory Data Visualization (Geo-OEDV) tools, synthesized in terms of an Entity-Relationship schema.
    [Show full text]