Conference Program
Total Page:16
File Type:pdf, Size:1020Kb
The Third International Conference on Knowledge Discovery and Data Mining August 14–17 1997, Newport Beach, California Sponsored by the American Association for Artificial Intelligence Cosponsored by General Motors Research, The National Institute for Statistical Sciences, and NCR Corporation In cooperation with The American Statistical Association Welcome to KDD-97! KDD-97 Organization General Conference Chair William Eddy, Carnegie Mellon Sally Morton, Rand Corporation University Richard Muntz, University of California Ramasamy Uthurusamy, General Motors Charles Elkan, University of California at at Los Angeles Corporation San Diego Raymond Ng, University of British Usama Fayyad, Microsoft Research Columbia Program Cochairs Ronen Feldman, Bar-Ilan University, Steve Omohundro, NEC Research David Heckerman, Microsoft Research Israel Gregory Piatetsky-Shapiro, Heikki Mannila, University of Helsinki, Jerry Friedman, Stanford University GTE Laboratories Finland Dan Geiger, Technion, Israel Daryl Pregibon, Bell Laboratories Daryl Pregibon, AT&T Laboratories Clark Glymour, Carnegie Mellon Raghu Ramakrishnan, University of University Wisconsin, Madison Publicity Chair Moises Goldszmidt, SRI International Patricia Riddle, Boeing Computer Georges Grinstein, University of Services Paul Stolorz, Jet Propulsion Laboratory Massachusetts, Lowell Ted Senator, NASD Regulation Inc. Tutorial Chair Jiawei Han, Simon Fraser University, Jude Shavlik, University of Wisconsin at Canada Madison Padhraic Smyth, University of California, David Hand, Open University, UK Wei-Min Shen, University of Southern Irvine Trevor Hastie, Stanford University California David Heckerman, Microsoft Corpora- Arno Siebes, CWI, Netherlands Demo and tion Avi Silberschatz, Bell Laboratories Poster Sessions Chair Haym Hirsh, Rutgers University Evangelos Simoudis, IBM Almaden Jim Hodges, University of Minnesota Research Center Tej Anand, NCR Corporation Se June Hong, IBM T .J. Watson Research Andrzej Skowron, University of Warsaw Awards Chair Center Padhraic Smyth, University of California Tomasz Imielinski, Rutgers University at Irvine Gregory Piatetsky-Shapiro, Yannis Ioannidis, University of Wisconsin Ramakrishnan Srikant, IBM Almaden GTE Laboratories Lawrence D. Jackel, AT&T Laboratories Research Center David Jensen, University of Massachusetts John Stasko, Georgia Institute of Technol- Program Committee Members Michael Jordan, Massachusetts Institute ogy Helena Ahonen, University of Helsinki of Technology Salvatore J. Stolfo, Columbia University Tej Anand, NCR Daniel Keim, University of Munich Paul Stolorz, Jet Propulsion Laboratory Ron Brachman, AT&T Laboratories Willi Kloesgen, GMD, Germany Hanna Toivonen, University of Helsinki Carla Brodley, Purdue University Ronny Kohavi, Silicon Graphics Alexander Tuzhilin, New York University, Dan Carr, George Mason University David D. Lewis, AT&T Laboratories – Stern School Peter Cheeseman, Research Ramasamy Uthurusamy, General Motors NASA AMES Research Center David Madigan, University of R&D Center David Cheung, University of Hong Kong Washington Graham Wills, Bell Laboratories Wesley Chu, University of California at Heikki Mannila, University of Helsinki David Wolpert, IBM Almaden Research Los Angeles Brij Masand, GTE Laboratories Center Gregory Cooper, University of Pittsburgh Gary McDonald, General Motors Jan Zytkow, Wichita State University Robert Cowell, City University, UK Research Bruce Croft, University of Massachusetts Eric Mjolsness, University of California at Amherst at San Diego 2 KDD–97 Thursday, August 14th 8:00AM–6:00 PM 1:30–3:30 PM 4:00–6:00 PM Tutorials Tutorial 4 Tutorial 7 Exploratory Data Analysis Using Statistical Models for Categorical 8:00–10:00 AM (SINGLE SESSION) Interactive Dynamic Graphics Response Data Tutorial 1 Deborah Swayne, Bell Communications Re- William DuMouchel, AT&T Research Data Mining and KDD: search and Diane Cook, Iowa State University An Overview 6:30–7:30 PM Usama Fayyad, Microsoft Research and Opening Reception Evangelos Simoudis, IBM 1:30–3:30 PM Tutorial 5 California Ballroom, Newport Beach Mar- 10:00–10:30 AM OLAP and Data Warehousing riott Hotel and Tennis Club Coffee Break Surajit Chaudhuri, Microsoft Research and Umesh Dayal, Hewlett Packard Laboratories 10:30 AM–12:30 PM Tutorial 2 3:30–4:00 PM Modeling Data and Discovering Coffee Break Knowledge David Hand, Open University, UK 4:00–6:00 PM Tutorial 6 10:30 AM–12:30 PM Visual Techniques for Exploring Tutorial 3 Databases Text Mining - Theory and Practice Daniel Keim, University of Munich, Ger- Ronen Feldman, Bar-Ilan University, many Israel 12:30–1:30 PM Lunch KDD-97 Tutorial Program All tutorials will be presented on Thursday, August 14, 1997 in sections of the Pacific Ballroom. Admission to the tutorials is included in your conference registration fee. Registrants can attend up to four consecutive tutorials, including four tutorial syllabi. TIME SESSION 1 SESSION 2 8:00 to 10:00 AM T1 – Fayyad and Simoudis (single session) 10:30 AM to 12:30 PM T2 – Hand T3 – Feldman 1:30 to 3:30 PM T4 – Swayne and Cook T5 – Chaudhuri and Dayal 4:00 to 6:00 PM T6 – Keim T7 – DuMouchel KDD–97 3 Friday, August 15th All sessions are plenary sessions and will be 10:30–12:15 PM 11:15 AM–12:15 PM held in the Pacific Ballroom, Newport Beach Keynote Address and Poster Previews Session Marriott Hotel and Tennis Club Poster Previews Session (Authors A-J) 12:15–2:00 PM 8:30–10:00 AM 10:30–11:15 AM Keynote Address and Keynote Address: Searching Lunch for Causes and Predicting Invited Talk Session Interventions (Program Committee Luncheon, New- Clark Glymour, University of California port North Room, lobby level, Newport 8:30 –8:45 AM San Diego and Carnegie Mellon University Beach Marriott Hotel and Tennis Club) Welcome and Introduction 2:00–3:00 PM KDD-97 General Conference Chair: The problem in many data mining tasks Poster Previews Session Ramasamy Uthurusamy and KDD-97 Pro- is to predict features of a new, unob- gram Cochairs: David Heckerman, Heikki (Authors K-Z) Mannila, and Daryl Pregibon served sample from a recorded sample, assuming the sampling distribution 3:00–7:00 PM does not change. In causal data mining Exhibits, Demos, Poster Sessions 8:45–9:30 AM the task is to predict the features of new Keynote Address: From Large to unobserved samples from a recorded California Ballroom, Newport Beach Huge: A Statistician’s Reactions to sample, assuming any unobserved sam- Marriott Hotel and Tennis Club KDD & DM ple results from an intervention which 3:00 –4:20 PM Peter J. Huber, University of Bayreuth, Ger- directly alters the underlying probabili- many Poster Session 1 ty distribution of some variables in a known way, and alters others indirectly Authors A-G The statistics and AI communities are through the influences of the directly confronted by the same challenge—the 4:20 –5:40 PM manipulated variables. Data mining in onslaught of ever larger data collec- the sciences and for business, military Poster Session 2 tions—but the two communities have or public policy is often concerned with Authors H-O reacted independently and differently. discovering causal structure, although What could they learn from each other that focus is sometimes obscured by the 5:40 –7:00 PM if they looked over the fence? What is belief that causation is especially myste- Poster Session 3 amiss on either side? rious. Causation is not probability, but Authors P-Z 9:30–10:00 AM it is no more (and no less) mysterious. (Coffee and soft drinks will be available Invited Talk: Machine Learning and This lecture will illustrate why data in the foyer from 3:00–7:00 PM.) KDD: New Developments mining for causes using techniques de- Thomas Dietterich, University of Oregon signed for recognition or classification is unwise, and will illustrate principles Machine learning research is pursuing of causal data mining with several in- many directions relevant to KDD. This teresting cases that use both Bayesian talk will review two exciting lines of re- and constraint based methods. search: learning with ensembles (com- mittees, bagging, boosting, etc.) and learning with stochastic models. I will also briefly mention current research in reinforcement learning and methods for scaling up machine learning algo- rithms. 10:00–10:30 AM Coffee Break 4 KDD–97 Saturday, August 16th 8:30–9:45 AM 11:00–11:30 AM Propulsion Laboratory, California Institute Paper Session Invited Talk: Highlights from the of Technology; Andrew Fraser, Portland State Database Methodology Joint Statistical Meetings University Jim Hodges, University of Minnesota 8:30–8:55 AM 2:30–3:00 PM Computing Optimized Rectilinear Statisticians and data-miners have a lot in Automated Discovery of Active Regions for Association Rules common but their paths tend not to cross. Motifs in Three Dimensional Kunikazu Yoda, Takeshi Fukuda, Yasuhiko Thus, for example, many statisticians are Molecules missing out on computing and other ideas Morimoto, Shinichi Morishita, and Takeshi Xiong Wang and Jason T.L. Wang, New Jer- familiar to data-miners, and data-miners Tokuyama, IBM Tokyo Research Laboratory, sey Institute of Technology; Dennis Shasha, sometimes reinvent things that are well- Japan New York University; Bruce Shapiro, Nation- known to statisticians. This talk is intended al Cancer Institute; Sitaram Dikshitulu, New to toss a small bridge across the gap, by Jersey Institute of Technology; Isidore Rigout- 8:55–9:20 AM summarizing some talks and trends from sos, IBM T. J. Watson Research Center; Density-Connected Sets and their the 1997