Database Repairing and Consistent Query Answering

BERTOSSI SeriesSeriesSeries ISSN: ISSN:ISSN: 2153-5418 2153-54182153-5418 BERTOSSI BERTOSSI MMM MorganMorganMorgan & & & ClaypoolClaypoolClaypool PublishersPublishersPublishers SSSYNTHESISYNTHESISYNTHESIS L LLECTURESECTURESECTURES ON ONON D DDATAATAATA M MMANAGEMENTANAGEMENTANAGEMENT &&&CCC SeriesSeriesSeries Editor: Editor:Editor: M. M.M. Tamer TamerTamer Özsu, Özsu,Özsu, University UniversityUniversity of ofof Waterloo WaterlooWaterloo DATABASE REPAIRING AND CONSISTENT QUERYDATABASE ANSWERING DATABASE REPAIRING AND CONSISTENT QUERYDATABASE ANSWERING REPAIRING AND CONSISTENT QUERYDATABASE ANSWERING DatabaseDatabaseDatabase Repairing RepairingRepairing and andand Consistent ConsistentConsistent DatabaseDatabaseDatabase RepairingRepairingRepairing QueryQueryQuery Answering AnsweringAnswering LeopoldoLeopoldoLeopoldo Bertossi, Bertossi,Bertossi, Carleton CarletonCarleton University, University,University, Ottowa, Ottowa,Ottowa, Canada CanadaCanada andandand ConsistentConsistentConsistent QueryQueryQuery IntegrityIntegrityIntegrity constraints constraintsconstraints are areare semantic semanticsemantic conditions conditionsconditions that thatthat a a adatabase databasedatabase should shouldshould satisfy satisfysatisfy in inin order orderorder to toto be bebe an anan appropriate appropriateappropriate model modelmodel ofofof external externalexternal reality. reality.reality. In InIn practice, practice,practice, and andand for forfor many manymany reasons, reasons,reasons, a a adatabase databasedatabase may maymay not notnot satisfy satisfysatisfy those thosethose integrity integrityintegrity constraints, constraints,constraints, AnsweringAnsweringAnswering andandand for forfor that thatthat reason reasonreason it itit is isis said saidsaid to toto be bebe inconsistent inconsistentinconsistent. .However,. However,However, and andand most mostmost likely, likely,likely, a a alarge largelarge portion portionportion of ofof the thethe database databasedatabase is isis still stillstill semanticallysemanticallysemantically correct, correct,correct, in inin a a asense sensesense that thatthat has hashas to toto be bebe made mademade precise preciseprecise. .After. AfterAfter having havinghaving provided providedprovided a a aformal formalformal characterization characterizationcharacterization ofofof consistent consistentconsistent data datadata in inin an anan inconsistent inconsistentinconsistent database, database,database, the thethe natural naturalnatural problem problemproblem emerges emergesemerges of ofof extracting extractingextracting that thatthat semantically semanticallysemantically correctcorrectcorrect data, data,data, as asas query queryquery answers. answers.answers. TheTheThe consistent consistentconsistent data datadata in inin an anan inconsistent inconsistentinconsistent database databasedatabase is isis usually usuallyusually characterized characterizedcharacterized as asas the thethe data datadata that thatthat persists persistspersists across acrossacross all allall thethethe database databasedatabase instances instancesinstances that thatthat are areare consistent consistentconsistent and andand minimally minimallyminimally differ differdiffer from fromfrom the thethe inconsistent inconsistentinconsistent instance. instance.instance. Those ThoseThose are areare the thethe so-calledso-calledso-called repairs repairsrepairs of ofof the thethe database. database.database. In InIn particular, particular,particular, the thethe consistent consistentconsistent answers answersanswers to toto a a aquery queryquery posed posedposed to toto the thethe inconsistent inconsistentinconsistent databasedatabasedatabase are areare those thosethose answers answersanswers that thatthat can cancan be bebe simultaneously simultaneouslysimultaneously obtained obtainedobtained from fromfrom all allall the thethe database databasedatabase repairs. repairs.repairs. LeopoldoLeopoldoLeopoldo Bertossi BertossiBertossi AsAsAs expected, expected,expected, the thethe notion notionnotion of ofof repair repairrepair requires requiresrequires an anan adequate adequateadequate notion notionnotion of ofof distance distancedistance that thatthat allows allowsallows for forfor the thethe comparison comparisoncomparison ofofof databases databasesdatabases with withwith respect respectrespect to toto how howhow much muchmuch they theythey differ differdiffer from fromfrom the thethe inconsistent inconsistentinconsistent instancse. instancse.instancse. On OnOn this thisthis basis, basis,basis, the thethe minimality minimalityminimality conditionconditioncondition on onon repairs repairsrepairs can cancan be bebe properly properlyproperly formulated. formulated.formulated. InInIn this thisthis monograph monographmonograph we wewe present presentpresent and andand discuss discussdiscuss these thesethese fundamental fundamentalfundamental concepts, concepts,concepts, different differentdifferent repair repairrepair semantics, semantics,semantics, algorithms algorithmsalgorithms forforfor computing computingcomputing consistent consistentconsistent answers answersanswers to toto queries, queries,queries, and andand also alsoalso complexity-theoretic complexity-theoreticcomplexity-theoretic results resultsresults related relatedrelated to toto the thethe computation computationcomputation ofofof repairs repairsrepairs and andand doing doingdoing consistent consistentconsistent query queryquery answering. answering.answering. AboutAboutAbout SYNTHESIs SYNTHESIsSYNTHESIs MORGAN ThisThisThis volume volumevolume is isis a a aprinted printedprinted version versionversion of ofof a a awork workwork that thatthat appears appearsappears in inin the thethe Synthesis SynthesisSynthesis MORGANMORGAN DigitalDigitalDigital Library LibraryLibrary of ofof Engineering EngineeringEngineering and andand Computer ComputerComputer Science. Science.Science. Synthesis SynthesisSynthesis Lectures LecturesLectures provideprovideprovide concise, concise,concise, original originaloriginal presentations presentationspresentations of ofof important importantimportant research researchresearch and andand development developmentdevelopment topics,topics,topics, published publishedpublished quickly, quickly,quickly, in inin digital digitaldigital and andand print printprint formats. formats.formats. For ForFor more moremore information informationinformation & visitvisitvisit www.morganclaypool.com www.morganclaypool.comwww.morganclaypool.com & & CLAYPOOL CLAYPOOLCLAYPOOL ISBN:ISBN:ISBN: 978-1-60845-762-5978-1-60845-762-5978-1-60845-762-5 MorganMorganMorgan & & & Claypool ClaypoolClaypool Publishers PublishersPublishers 999000000000000 SSSYNTHESISYNTHESISYNTHESIS L LLECTURESECTURESECTURES ON ONON D DDATAATAATA M MMANAGEMENTANAGEMENTANAGEMENT www.morganclaypool.comwww.morganclaypool.comwww.morganclaypool.com 999777888111666000888444555777666222555 M.M.M. Tamer TamerTamer Özsu, Özsu,Özsu, Series SeriesSeries Editor EditorEditor Database Repairing and Consistent Query Answering Synthesis Lectures on Data Management Editor M.Tamer Özsu, University of Waterloo Synthesis Lectures on Data Management is edited by Tamer Özsu of the University of Waterloo. The series will publish 50- to 125-page publications on topics pertaining to data management. The scope will largely follow the purview of premier information and computer science conferences, such as ACM SIGMOD, VLDB, ICDE, PODS, ICDT, and ACM KDD. Potential topics include, but not are limited to: query languages, database system architectures, transaction management, data warehousing, XML and databases, data stream systems, wide scale data distribution, multimedia data management, data mining, and related subjects. Database Repairing and Consistent Query Answering Leopoldo Bertossi 2011 Managing Event Information: Modeling, Retrieval, and Applications Amarnath Gupta and Ramesh Jain 2011 Fundamentals of Physical Design and Query Compilation David Toman and Grant Weddell 2011 Methods for Mining and Summarizing Text Conversations Giuseppe Carenini, Gabriel Murray, and Raymond Ng 2011 Probabilistic Databases Dan Suciu, Dan Olteanu, Christopher Ré, and Christoph Koch 2011 Peer-to-Peer Data Management Karl Aberer 2011 iv Probabilistic Ranking Techniques in Relational Databases Ihab F. Ilyas and Mohamed A. Soliman 2011 Uncertain Schema Matching Avigdor Gal 2011 Fundamentals of Object Databases: Object-Oriented and Object-Relational Design Suzanne W. Dietrich and Susan D. Urban 2010 Advanced Metasearch Engine Technology Weiyi Meng and Clement T. Yu 2010 Web Page Recommendation Models: Theory and Algorithms Sule Gündüz-Ögüdücü 2010 Multidimensional Databases and Data Warehousing Christian S. Jensen, Torben Bach Pedersen, and Christian Thomsen 2010 Database Replication Bettina Kemme, Ricardo Jimenez Peris, and Marta Patino-Martinez 2010 Relational and XML Data Exchange Marcelo Arenas, Pablo Barcelo, Leonid Libkin, and Filip Murlak 2010 User-Centered Data Management Tiziana Catarci, Alan Dix, Stephen Kimani, and Giuseppe Santucci 2010 Data Stream Management Lukasz Golab and M. Tamer Özsu 2010 Access Control in Data Management Systems Elena Ferrari 2010 v An Introduction to Duplicate Detection Felix Naumann and Melanie Herschel 2010 Privacy-Preserving Data Publishing: An Overview Raymond Chi-Wing Wong and Ada Wai-Chee Fu 2010 Keyword Search in Databases Jeffrey Xu Yu, Lu Qin, and Lijun Chang 2009 Copyright © 2011 by Morgan & Claypool All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted

Database Repairing and Consistent Query Answering

SAQE: Practical Privacy-Preserving Approximate Query Processing for Data Federations

WIN: an Efficient Data Placement Strategy for Parallel XML Databases

Tepzz¥ 67¥¥Za T

Conjunctive Query Answering in the Description Logic EL Using a Relational Database System

Query Rewriting Using Monolingual Statistical Machine Translation

On Using an Automatic, Autonomous and Non-Intrusive Approach for Rewriting SQL Queries

Towards Practical Privacy-Preserving Data Analytics by Noah Michael

Data Management 09 Transaction Processing

Distributed Complex Event Processing with Query Rewriting

Probabilistic Query Rewriting for Efficient and Effective Keyword

Agent Based System for Realtime Database Audit Monitoring

Negative Database for Data Security