SIGMOD/PODS 2003 Final Program San Diego, California June 8-13, 2003

Answer Queries, Rada Chirkova, Chen Li Streams, Abhinandan Das, Johannes Sunday, June 8th Gehrke, Mirek Riedewald 8:30-17:30 The Impact of the Constant Complement DEBS Workshop (Windsor) Approach Towards View Updating, Jens SIGMOD Research Session 3: OLAP PKC50 Workshop (Hampton) Lechtenboerger (Garden Salon 1) MPDS Workshop (Sheffield) Chair: Latha Colby View-Based Query Containment, Diego 19:30-21:30 Calvanese, Giuseppe De Giacomo, Maurizio Best Paper: Spreadsheets in RDBMS for SIGMOD/PODS Welcoming Reception Lenzerini, Moshe Y. Vardi OLAP, Abhinav Gupta, Andy Witkowski, (Garden Salon 1) Gregory Dorman, Srikanth Bellamkonda, The View Selection Problem for XML Tolga Bozkaya, Nathan Folkert, Lei Sheng, Content Based Routing, Ashish Kumar Sankar Subramanian Monday, June 9th Gupta, Alon Y. Halevy, Dan Suciu 8:00-8:45 QC-Trees: An Efficient Summary Structure Continental Breakfast (Atlas Foyer) 19:00-22:00 for Semantic OLAP, Laks Lakshmanan, Jian SIGMOD New DB Faculty Symposium Pei, Yan Zhao 8:45-10:00 (Windsor & Sheffield) PODS Session 1: Invited Talk SIGMOD Industrial Session 1: Web (Hampton & Sheffield) 21:00-23:00 Services (Sheffield) Chair: Tova Milo PODS Business Meeting Chair: Surajit Chaudhuri (Garden Salon 1) E-services: A Look Behind the Curtain, Rick The Future of Web Services - I, Adam Hull Bosworth (BEA Systems) Tuesday, June 10th

10:00-10:15 8:00-8:45 The Future of Web Services - II, Felipe Morning Coffee Break (Atlas Foyer) Continental Breakfast (Atlas Foyer) Cabrera (Microsoft)

10:15-11:15 8:45-9:45 SIGMOD Demos: Group A (Crescent) PODS Session 2: Award Papers SIGMOD Keynote (Regency Ballroom) (Hampton & Sheffield) Chair: Alon Halevy A System for Watermarking Relational Chair: Catriel Beeri Databases, Rakesh Agrawal, Jerry Kiernan The Database Course Qua Poster Child For Best Paper: An Information-Theoretic More Efficient Education, Jeffrey D. Ullman Query by Humming - in Action with its Approach to Normal Forms for Relational () Technology Revealed, Yunyue Zhu, Dennis and XML Data, Marcelo Arenas, Leonid Shasha Libkin 9:45-10:15 Morning Coffee Break (Atlas Foyer) PeerDB: Peering into Personal Databases, Best Newcomer Paper: for Beng Chin Ooi, Kian-Lee Tan, Aoying Zhou, Data Migration with Cloning, Samir Khuller, 10:15-11:15 Chin Hong Goh,Yingguang Li, Chu Yee Yoo-Ah Kim, Yung-Chun (Justin) Wan PODS Session 5: Data Integration Liau, Bo Ling, Wee Siong Ng,Yanfeng Shu, (Garden Salon 2) Xiaoyu Wang, Ming Zhang 11:15-11:30 Chair: Foto Afrati Break GridDB: A Database Interface to the Grid, Computing Full Disjunctions, Yaron Kanza, David T. Liu, Michael J. Franklin 11:30-12:30 Yehoshua Sagiv FCRC Plenary (Grand Ballroom) CMVF: A Novel Dimension Reduction Data Exchange: Getting to the Core, Scheme for Efficient Indexing in A Large Provably Unbreakable Encryption: Theory , Phokion G. Kolaitis, Lucian Image Database, Jialie Shen, Anne H.H. and Implementation, Michael Rabin Popa Ngu, John Shepherd, Du Q. Huynh, Quan (Harvard University) Z. Sheng SIGMOD Research Session 1: XML and 12:30-14:00 Text (Windsor) PLASTIC: Reducing the Cost of Query Lunch (Royal Palm Court) Chair: Divesh Srivastava Optimization through Query Clustering, Vibhuti S. Sengar, Jayant R. Haritsa 14:00-15:30 Querying Structured Text in an XML PODS Session 3: Invited Tutorial Database, Shurug Al-Khalifa, Cong Yu, H. BIRN-M: A Semantic Mediator for Solving (Hampton & Sheffield) V. Jagadish Real-World Neuroscience Problems, Chair: Surajit Chadhuri Amarnath Gupta, Bertram Ludascher, XRANK: Ranked Keyword Search over XML Maryann E. Martone, Arcot Rajasekar, Privacy in Data Systems, Rakesh Agrawal Documents, Lin Guo, Feng Shao, Chavdar Edward Ross, Xufei Qian, Simone Santini, Botev, Jayavel Shanmugasundaram Haiyun He, Ilya Zaslavsky 15:30-16:00 11:15-11:30 Afternoon Coffee Break (Atlas Foyer) SIGMOD Research Session 2: Stream Query Processing I (Hampton) Break 16:00-18:00 Chair: Minos Garofalakis 11:30-12:30 PODS Session 4: Views (Hampton & Sheffield) Distributed Top-K Monitoring, Brian FCRC Plenary (Grand Ballroom) Chair: Marie-Christine Rousset Babcock, Chris Olston Computer Architecture and Technology: Materializing Views with Minimal Size to Approximate Join Processing Over Data Some Thoughts on the Road Ahead, Michael Flynn (Stanford University) Databases (Garden Salon 1) Hongjun Lu, Jeffrey Xu Yu

12:30-14:00 Moderators: Nick Koudas, Divesh Efficient Processing of Joins on Set-valued Lunch (Royal Palm Court) Srivastava Attributes, Nikos Mamoulis Panelists: Daniela Florescu, Hector Garcia- 14:00-15:30 Molina, Tim Griffin, Alon Halevy, Tomasz SIGMOD Research Session 7: Temporal PODS Session 6: Optimization Imielinski, Prabhakar Raghavan Queries (Garden Salon 2) (Hampton 17:00-18:00) Chair: Victor Vianu SIGMOD Demos: Group B (Crescent) Chair: John Cho

Soft Stratification for Magic Set Based STREAM: The Stanford Stream Data Temporal Coalescing with Now, Query Evaluation in Deductive Databases, Manager, Arvind Arasu, Brian Babcock, Granularity, and Incomplete Information, Andreas Behrend Shivnath Babu, Mayur Datar, Keith Ito, Curtis Dyreson Justin Rosenstein, and Jennifer Widom Query Containment and Rewriting Using Query by Humming: a Time Series Views for Regular Path Queries under Aurora: A Data Stream Management Database Approach, Yunyue Zhu, Dennis Constraints, Gosta Grahne, Alex Thomo System, D. Abadi, D. Carney, U. Shasha Cetintemel, M. Cherniack, C. Convey, C. Concise Descriptions of Subsets of Erwin, E. Galvez, M. Hatoun,A. Maskey, A. SIGMOD Research Session 8: Meta- Structured Sets, Alberto O. Mendelzon, Rasin, A. Singer, M. Stonebraker, N. Data Management Ken Q. Pu Tatbul, Y. Xing, R. Yan, S. Zdonik (Windsor 16:00-17:30) Chair: Zack Ives On Producing Join Results Early, Jens-Peter IrisNet: Internet-scale Resource-Intensive Dittrich, Bernhard Seeger, David Scot Sensor Services, Amol Deshpande, Suman Rondo: A Programming Platform for Taylor, Peter Widmayer Nath, Phillip B. Gibbons, Srinivasan Seshan Generic Model Management, Sergey Melnik, Erhard Rahm, Phil Bernstein SIGMOD Research Session 4: Data TelegraphCQ: Continuous Dataflow Security and Protection (Hampton) Processing, The TelegraphCQ Team On Schema Matching with Opaque Column Chair: Sharad Mehrotra Names and Data Values, Jaewoo Kang, Transparent Mid-Tier Database Caching in Jeffrey Naughton Winnowing: Local Algorithms for Document SQL Server, Per-Åke Larson, Jonathan Fingerprinting, Saul Schleimer, Daniel Goldstein Statistical Schema Integration across the Wilkerson, Alex Aiken Deep Web, Bin He, Kevin Chen-Chuan DBCache: Middle-tier Database Caching for Chang Information Sharing Across Private Highly Scalable e-Business Architectures, Databases, Rakesh Agrawal, Alexandre Mehmet Altinel, Christof Bornhövd, Sailesh SIGMOD Research Session 9: Statistics Evfimievski, Ramakrishnan Srikant Krishnamurthy, C. Mohan, Hamid Pirahesh, (Sheffield 16:00-17:00) Berthold Reinwald Chair: Phil Gibbons Rights Protection for Relational Data, Radu Sion, Mikhail Atallah, Sunil Prabhakar IPSOFACTO:A Visual Correlation Tool for Extended Wavelets for Multiple Measures, Aggregate Network Traffic Data, Flip Korn, Antonios Deligiannakis, Nick Roussopoulos SIGMOD Research Session 5: XML S. Muthukrishnan, Yunyue Zhu Indexing and Compression (Windsor) Spectral Bloom Filters, Saar Cohen, Yossi Chair: Hank Korth SOCQET: Semantic OLAP with Compressed Matias Cube and Summarization, Laks V.S. ViST: A Dynamic Index Method for Lakshmanan, Jian Pei, Yan Zhao SIGMOD Industrial Session 3: Querying XML Data by Tree Structures, Database Applications Haixun Wang, Sanghyun Park, Wei Fan, 15:30-16:00 (Sheffield 17:00-18:00) Philip Yu Afternoon Coffee Break (Atlas Foyer) Chair: Michael Carey

XPRESS: A Queriable Compression for XML 16:00-18:00 Integration of Electronic Tickets and Data, Jun-Ki Min, Myung-Jae Park, Chin- PODS Session 7: XML (Garden Salon 2) Personal Guide System for Public Transport Wan Chung Chair: Gerhard Weikum using Mobile Terminals, Koichi Goto (RTRI), Yahiko Kambayashi (Kyoto D(k)-Index: An Adaptive Structural Correlating XML Data Streams Using Tree- University) Summary for Graph-Structured Data, Qun Edit Distance Embeddings, Minos Chen, Andrew Lim, Kian Win Ong Garofalakis, Amit Kumar Gigascope: A Stream Database for Network Applications, Theodore Johnson SIGMOD Industrial Session 2: Server Numerical Document Queries, Anca (AT&T), Chuck Cranor (AT&T), Oliver Technology (Sheffield) Muscholl, Thomas Schwentick, Helmut Spatscheck (AT&T), Vladislav Shkapenyuk Chair: James Hamilton Seidl (CMU)

Oracle RAC: Architecture and Performance, Typing and Querying XML Documents: SIGMOD Tutorial 1 (Garden Salon 1) Angelo Pruscino (Oracle) Some Complexity Bounds, Luc Segoufin Data Quality and Data Cleaning: An Oracle XML DB Repository, Viswanathan The Complexity of XPath Query Evaluation, Overview, Theodore Johnson, Tamraparni Krishnamurthy (Oracle) Georg Gottlob, Christoph Koch, Reinhard Dasu (AT&T Labs Research) Pichler Multi-Dimensional Clustering: A New Data SIGMOD Demos: Group C (Crescent) Layout Scheme in DB2, Sriram SIGMOD Research Session 6: Join Padmanabhan, Bishwaranjan Algorithms (Hampton 16:00-17:00) QXtract: A Building Block for Efficient Bhattacharjee, Timothy Malkemus, Leslie Chair: Alfons Kemper Information Extraction from Plain-Text Cranston, Matthew Huras (IBM) Databases, Eugene Agichtein, Luis Gravano Containment Join Size Estimation: Models SIGMOD Panel 1: Querying Network and Methods, Wei Wang, Haifeng Jiang, PIX: Exact and Approximate Phrase Matching in XML, Sihem Amer-Yahia, Mary Chair: Mariano Consens Chair: Phil Bernstein Fernandez, Divesh Srivastava, Yu Xu Capturing both Types and Constraints in Mapping Data in Peer-to-Peer Systems: LockX: A System for Efficiently Querying Data Integration, Wenfei Fan, Michael Semantics and Algorithmic Issues, Secure XML, SungRan Cho, Sihem Amer- Benedikt, Chee-Yong Chan, Juliana Freire, Anastasios Kementsietsidis, Marcelo Yahia, Laks V.S. Lakshmanan, Divesh Rajeev Rastogi Arenas, Renée J Miller Srivastava Exchanging Intensional XML Data, Tova Extracting Structured Data from Web TREX: DTD-Conforming XML to XML Milo, Serge Abiteboul, Bernd Amann, Omar Pages, Arvind Arasu, Hector Garcia-Molina Transformations, Aoying Zhou, Qing Wang, Benjelloun, Frederic Dang Ngoc Zhimao Guo, Xueqing Gong, Shihui Zheng, Scientific Data Repositories: Designing for Hongwei Wu, Jianchang Xiao, Kun SIGMOD Research Session 12: a Moving Target, Etzard Stolte, Gustavo Yue,Wenfei Fan Similarity Queries I (Hampton) Alonso, Christoph Praun, Thomas Gross Chair: Hans-Peter Kriegel Rainbow: Multi-XQuery Optimization Using SIGMOD Research Session 14: Query Materialized XML Views, Xin Zhang, Efficient Similarity Search and Processing (Hampton) Bradford Pielech, Luping Ding, Brian Classification via Rank Aggregation, Ronald Chair: Ken Ross Murphy, Ling Wang, Katica Dimitrova, Fagin, Ravi Kumar, D Sivakumar Maged El Sayed, Elke A. Rundensteiner Factorizing Complex Predicates in Queries Robust and Efficient Fuzzy Match for Online to Exploit Indexes, Surajit Chaudhuri, TIMBER: A Native System for Querying Data Cleaning, Surajit Chaudhuri, Kris Prasanna Ganesan, Sunita Sarawagi XML, Stelios Paparizos, Shurug Al-Khalifa, Ganjam, Venkatesh Ganti, Adriane Chapman, H. V. Jagadish, Laks V. Estimating Compilation Time of a Query S. Lakshmanan, Andrew Nierman, Jignesh SIGMOD Tutorial 2 (Part 1) Optimizer, Ihab Ilyas, Jun Rao, Guy M. Patel, Divesh Srivastava, Nuwee (Garden Salon 1) Lohman, Dengfeng Gao, Eileen Lin Wiwatwattana, Yuqing Wu, Cong Yu XQuery: A Query Language for XML, Don A Characterization of the Sensitivity of ROLEX: Relational On-Line Exchange with Chamberlin (IBM Almaden Research Query Optimization to Storage Access Cost XML, Philip Bohannon, Xin (Luna) Dong, Center) Parameters, Frederick Reiss, Tapas Sumit Ganguly, Henry F. Korth, Chengkai Kanungo Li, P.P.S. Narayan, Pradeep Shenoy SIGMOD Demos: Group C (Crescent) SIGMOD Industrial Session 4: 19:30-21:30 See Monday’s Listing Information Retrieval & Data SIGMOD/PODS Reception Visualization (Sheffield) (Terrace Salon 1-3) 9:45-10:15 Chair: Luis Gravano Sponsored by: Morning Coffee Break (Atlas Foyer) San Diego Supercomputer Center in a Box - Building the Google 10:15-11:15 Search Appliance, Narayanan Shivakumar

th SIGMOD Awards and Business Meeting (Google) Wednesday, June 11 (Regency Ballroom) 8:00-8:45 Extracting and Exploiting Structure in Text Continental Breakfast (Atlas Foyer) 11:15-11:30 Search, Prabhakar Raghavan (Verity) Break 8:45-9:45 Visionary: A Next Generation Visualization PODS Session 8: Security and Privacy 11:30-12:30 System for Databases, Michael (Garden Salon 2 8:45-10:00) FCRC Plenary (Grand Ballroom) Stonebraker (MIT and Rocket Software) Chair: Dan Suciu Issues in Peer-to-Peer Computing, Barbara SIGMOD Tutorial 2 (Part 2) Query-preserving Watermarking of Liskov (M.I.T.) (Garden Salon 1) Relational Databases and XML Documents, David Gross-Amblard 12:30-14:00 XQuery: A Query Language for XML, Don Lunch (Royal Palm Court) Chamberlin (IBM Almaden Research Revealing Information while Preserving Center) Privacy, , 14:00-15:30 PODS Session 9: Streams and SIGMOD Demos: Group B (Crescent) Limiting Privacy Breaches in Privacy Indexing (Garden Salon 2) Preserving Data Mining, Alexandre Chair: Nick Koudas See Monday’s Listing Evfimievski, Johannes Gehrke, Ramakrishnan Srikant Maintaining Time-Decaying Stream 15:30-16:00 Aggregates, Edith Cohen, Martin Strauss Afternoon Coffee Break (Atlas Foyer) SIGMOD Research Session 10: Stream Query Processing II (Sheffield) Maintaining Variance and k--Medians over 16:00-18:00 Chair: Data Stream Windows, Brian Babcock, PODS Session 10: Integration and Mayur Datar, Rajeev Motwani, Liadan Mining (Garden Salon 2) Chain: Operator Scheduling for Memory O`Callaghan Chair: Jan Van Den Bussche Minimization in Data Stream Systems, Brian Babcock, Shivnath Babu, Mayur Optimal Indexing Using Near-Minimal On the Decidability and Complexity of Datar, Rajeev Motwani Space, C. Heeren, H. V. Jagadish, L. Pitt Query Answering over Inconsistent and Incomplete Databases, Andrea Calì, Processing Set Expressions over On Nearest Neighbor Indexing of Non- Domenico Lembo, Riccardo Rosati Continuous Update Streams, Sumit Linear Trajectories, Charu Aggarwal, Ganguly, Minos Garofalakis, Rajeev Rastogi Dakshi Agrawal How to Quickly Find a Witness, Daniel Kifer, Johannes Gehrke, Cristian Bucila, SIGMOD Research Session 11: Data SIGMOD Research Session 13: Data Walker White Integration and Sharing I (Windsor) Integration and Sharing II (Windsor) Feasible Itemset Distributions in Data The Design of an Acquisitional Query Qcluster: Relevance Feedback Using Mining: Theory and Application, Ganesh Processor For Sensor Networks, Samuel Adaptive Clustering for Content-Based Ramesh, William A. Maniatty, Mohammed Madden, Michael Franklin, Joseph Image Retrieval, Deok-Hwan Kim, Chin- J. Zaki Hellerstein, Wei Hong Wan Chung

What’s Hot and What’s Not: Tracking Most Cache-and-Query for Wide Area Sensor SIGMOD Research Session 23: XML Frequent Items Dynamically, Graham Databases, Amol Deshpande, Suman Nath, Query Processing II (Garden Salon 2) Cormode, S. Muthukrishnan Phillip Gibbons, Srinivasan Seshan Chair: Jai Shanmugasundaram

SIGMOD Research Session 15: Formal SIGMOD Research Session 19: XML On Relational Support for XML Publishing: Foundations (Windsor 16:00-17:00) Query Processing I (Hampton) Beyond Sorting and Tagging, Surajit Chair: Wenfei Fan Chair: Jignesh Patel Chaudhuri, Raghav Kaushik, Jeffrey Naughton A Theory of Redo Recovery, David Lomet, Composing XSL Transformations with XML Mark Tuttle Publishing Views, Chengkai Li, Philip A Comprehensive XQuery to SQL Bohannon, Hank Korth, PPS Narayan Translation using Dynamic Interval Formal Semantics and Analysis of Object Encoding, David DeHaan, David Toman, Queries, Gavin Bierman Dynamic XML Documents with Distribution Mariano Consens, M. Tamer Ozsu and Replication, Serge Abiteboul, Angela SIGMOD Research Session 16: Bonifati, Gregory Cobena, Ioana SIGMOD Industrial Session 6: CRM and Streaming XML (Windsor 17:00-18:00) Manolescu, Tova Milo Query Optimization (Sheffield) Chair: Frank Tompa Chair: Michael Stonebraker SIGMOD Research Session 20: Stream Processing of XPath Queries with Approximate Querying Data Management Challenges in CRM, Predicates, Ashish Gupta, Dan Suciu (Garden Salon 1) George Colliat (Siebel Systems) Chair: Jiawei Han XPath Queries on Streaming Data, Feng WinMagic: Subquery Elimination Using Peng, Sudarshan Chawathe Dynamic Sample Selection for Approximate Window Aggregation, Calisto Zuzarte, Query Processing, Brian Babcock, Surajit Wenbin Ma, Hamid Pirahesh, Qi Cheng, SIGMOD Research Session 17: Spatial Chaudhuri, Gautam Das Linqi Liu, Kwai Wong (IBM) and Nearest Neighbor Queries (Hampton) Evaluating Probabilistic Queries over SIGMOD Demos: Potpourri (Crescent) Chair: Bongki Moon Imprecise Data, Reynold Cheng, Dmitri V. Kalashnikov, Sunil Prabhakar See Monday’s Listing Location-based Spatial Queries, Dimitris Papadias, Jun Zhang, Manli Zhu, Yufei Tao, SIGMOD Industrial Session 5: 11:15-11:30 Dik Lee Subscription Services (Sheffield) End Of SIGMOD Chair: Raghu Ramakrishnan Hardware Acceleration for Spatial 11:30-12:30 Selections and Joins, Chengyu Sun, Building Notification Services with FCRC Plenary (Town & Country) Divyakant Agrawal, Amr El Abbadi Microsoft SQLServer, Praveen Seshadri (Microsoft) Building a Web Warehouse, Hector Garcia- An Optimal and Progressive for Molina (Stanford University) Skyline Queries, Dimitris Papadias, Yufei NonStop SQL/MX Publish/Subscribe: Tao, Greg Fu, Bernhard Seeger Continuous Data Streams in Transaction 14:00-18:00 Processing, Hansjorg Zeller (Hewlett WebDB Workshop (Windsor) Contorting High Dimensional Data for Packard)

Efficient Main Memory Processing, Bin Cui, th Beng Chin Ooi, Kian-Lee Tan, Jianwen Su SIGMOD Demos: Potpourri (Crescent) Friday, June 13 8:30-15:30 SIGMOD Panel 2: Report on the Fourth See Monday’s Listing WebDB Workshop (Windsor) "Where Should We Be Going" Meeting (Sheffield) 9:45-10:15 8:30-17:30 Morning Coffee Break (Atlas Foyer) DMKD Workshop (Sheffield) Moderators: Michael Stonebraker, Jim Gray, Hans Schek, Jeff Ullman 10:15-11:15 Acknowledgments SIGMOD Research Session 21: SIGMOD Demos: Group A (Crescent) Monitoring Data Streams (Hampton) The SIGMOD/PODS 2003 Conference Chair: Anastassia Ailamaki gratefully acknowledges the contributions See Monday’s Listing of the following institutions. Adaptive Filters for Continuous Queries 19:00-23:00 over Distributed Data Streams, Chris Sponsors SIGMOD/PODS Banquet Olston, Jing Jiang, Jennifer Widom (Town & Country) A Framework for Change Diagnosis of Data th Streams, Charu Aggarwal Thursday, June 12

Contributors 8:00-8:45 SIGMOD Research Session 22: Continental Breakfast (Atlas Foyer) Similarity Queries II (Windsor)

Chair: Jonathan Goldstein 8:45-9:45 SIGMOD Research Session 18: Sensor Using Sets of Feature Vectors for Similarity Databases (Windsor) Search on Voxelized CAD Objects, Hans Chair: Guy Lohman Peter Kriegel, Peer Kröger, Martin Pfeifle, Matthias Schubert, Stefan Brecheisen