Lecture Notes in Artificial Intelligence 6913

Subseries of Lecture Notes in Computer Science

LNAI Series Editors Randy Goebel University of Alberta, Edmonton, Canada Yuzuru Tanaka Hokkaido University, Sapporo, Japan Wolfgang Wahlster DFKI and Saarland University, Saarbrücken,

LNAI Founding Series Editor Joerg Siekmann DFKI and Saarland University, Saarbrücken, Germany Dimitrios Gunopulos Thomas Hofmann Donato Malerba Michalis Vazirgiannis (Eds.)

Machine Learning and Knowledge Discovery in Databases

European Conference, ECML PKDD 2011 Athens, , September 5-9, 2011 Proceedings, Part III

13 Series Editors

Randy Goebel, University of Alberta, Edmonton, Canada Jörg Siekmann, University of Saarland, Saarbrücken, Germany Wolfgang Wahlster, DFKI and University of Saarland, Saarbrücken, Germany

Volume Editors

Dimitrios Gunopulos University of Athens, Greece E-mail: [email protected] Thomas Hofmann Google Switzerland GmbH, Zurich, Switzerland E-mail: [email protected] Donato Malerba University of Bari “Aldo Moro”, Bari, E-mail: [email protected] Michalis Vazirgiannis Athens University of Economics and Business, Greece E-mail: [email protected]

ISSN 0302-9743 e-ISSN 1611-3349 ISBN 978-3-642-23807-9 e-ISBN 978-3-642-23808-6 DOI 10.1007/978-3-642-23808-6 Springer Heidelberg Dordrecht London New York

Library of Congress Control Number: Applied for

CR Subject Classification (1998): I.2, H.2.8, H.2, H.3, G.3, J.1, I.7, F.2.2, F.4.1

LNCS Sublibrary: SL 7 Ð Artificial Intelligence

© Springer-Verlag Berlin Heidelberg 2011 This work is subject to copyright. All rights are reserved, whether the whole or part of the material is concerned, specifically the rights of translation, reprinting, re-use of illustrations, recitation, broadcasting, reproduction on microfilms or in any other way, and storage in data banks. Duplication of this publication or parts thereof is permitted only under the provisions of the German Copyright Law of September 9, 1965, in its current version, and permission for use must always be obtained from Springer. Violations are liable to prosecution under the German Copyright Law. The use of general descriptive names, registered names, trademarks, etc. in this publication does not imply, even in the absence of a specific statement, that such names are exempt from the relevant protective laws and regulations and therefore free for general use. Typesetting: Camera-ready by author, data conversion by Scientific Publishing Services, Chennai, India Printed on acid-free paper Springer is part of Springer Science+Business Media (www.springer.com) Welcome to ECML PKDD 2011

Welcome to the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD 2011) held in Athens, Greece, during September 5–9, 2011. ECML PKDD is an annual confer- ence that provides an international forum for the discussion of the latest high- quality research results in all areas related to machine learning and knowledge discovery in databases as well as other innovative application domains. Over the years it has evolved as one of the largest and most selective international conferences in machine learning and data mining, the only one that provides a common forum for these two closely related fields. ECML PKDD 2011 included all the scientific events and activities of big con- ferences. The scientific program consisted of technical presentations of accepted papers, plenary talks by distinguished keynote speakers, workshops and tutorials, a discovery challenge track, as well as demo and industrial tracks. Moreover, two co-located workshops were organized on related research topics. We expect that all those scientific activities provide opportunities for knowledge dissemination, fruitful discussions and exchange of ideas among people both from academia and industry. Moreover, we hope that this conference will continue to offer a unique forum that stimulates and encourages close interaction among researchers work- ing on machine learning and data mining. We were very happy to have the conference back in Greece after 1995 when ECML was successfully organized in Heraklion, Crete. However, this was the first time that the joint ECML PKDD event was organized in Greece and, more specifically, in Athens, with the conference venue boasting a superb location under the Acropolis and in front of the Temple of Zeus. Besides the scientific activities, the conference offered delegates an attractive range of social activities, such as a welcome reception on the roof garden of the conference venue directly facing the Acropolis hill, a poster session at “Technopolis” Gazi industrial park, the conference banquet, and a farewell party at the new Acropolis Museum, one of the most impressive archaeological museums worldwide, which included a guided tour of the museum exhibits. Several people worked hard together as a superb dedicated team to ensure the successful organization of this conference. First, we would like to express our thanks and deep gratitude to the PC Chairs Dimitrios Gunopulos, Thomas Hof- mann, Donato Malerba and Michalis Vazirgiannis. They efficiently carried out the enormous task of coordinating the rigorous hierarchical double-blind review process that resulted in a rich, while at the same time, very selective scientific program. Their contribution was crucial and essential in all phases and aspects of the conference organization and it was by no means restricted only to the paper review process. We would also like to thank the Area Chairs and Program Com- mittee members for the valuable assistance they offered to the PC Chairs in their VI Welcome to ECML PKDD 2011 timely completion of the review process under strict deadlines. Special thanks should also be given to the Workshop Co-chairs, Bart Goethals and Katharina Morik, the Tutorial Co-chairs, Fosca Giannotti and Maguelonne Teisseire, the Discovery Challenge Co-chairs, Alexandros Kalousis and Vassilis Plachouras, the Industrial Session Co-chairs, Alexandros Ntoulas and Michail Vlachos, the Demo Track Co-chairs, Michelangelo Ceci and Spiros Papadimitriou, and the Best Pa- per Award Co-chairs, Sunita Sarawagi and Mich`ele Sebag. We further thank the keynote speakers, workshop organizers, the tutorial presenters and the organizers of the discovery challenge. Furthermore, we are indebted to the Publicity Co-chairs, Annalisa Appice and Grigorios Tsoumakas, who developed and implemented an effective dissem- ination plan and supported the Program Chairs in the production of the pro- ceedings, and also to Margarita Karkali for the development, support and timely update of the conference website. We further thank the members of the ECML PKDD Steering Committee for their valuable help and guidance. The conference was financially supported by the following generous sponsors who are worthy of special acknowledgment: Google, Pascal2 Network, Xerox, Ya- hoo Labs, COST-MOVE Action, Rapid-I, FP7-MODAP Project, Athena RIC / Institute for the Management of Information Systems, Hellenic Artificial Intel- ligence Society, Marathon Data Systems, and Transinsight. Additional support was generously provided by Sony, Springer, and the UNESCO Privacy Chair Pro- gram. This support has given us the opportunity to specify low registration rates, provide video-recording services and support students through travel grants for attending the conference. The substantial effort of the Sponsorship Co-chairs, Ina Lauth and Ioannis Kopanakis, was crucial in order to attract these spon- sorships, and therefore, they deserve our special thanks. Special thanks should also be given to the five organizing institutions, namely, University of Bari “Aldo Moro”, Athens University of Economics and Business, University of Athens, Uni- versity of Ioannina, and University of Piraeus for supporting in multiple ways our task. We would like to especially acknowledge the members of the Local Organiza- tion team, Maria Halkidi, Despoina Kopanaki and Nikos Pelekis, for making all necessary local arrangements and Triaena Tours & Congress S.A. for efficiently handling finance and registrations. The essential contribution of the student vol- unteers also deserves special acknowledgment. Finally, we are indebted to all researchers who considered this conference as a forum for presenting and disseminating their research work, as well as to all conference participants, hoping that this event will stimulate further expansion of research and industrial activities in machine learning and data mining.

July 2011 Aristidis Likas Yannis Theodoridis Preface

The European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD 2011) took place in Athens, Greece, during September 5–9, 2011. This year we have completed the first decade since the junction between the European Conference on Machine Learn- ing and the Principles and Practice of Knowledge Discovery in Data Bases con- ferences, which as independent conferences date back to 1986 and 1997, respec- tively. During this decade there has been an increasing integration of the two fields, as reflected by the rising number of submissions of top-quality research re- sults. In 2008 a single ECML PKDD Steering Committee was established, which gathered senior members of both communities. The ECML PKDD conference is a highly selective conference in both areas, the leading forum where researchers in machine learning and data mining can meet, present their work, exchange ideas, gain new knowledge and perspectives, and motivate the development of new interesting research results. Although tra- ditionally based in Europe, ECML PKDD is also a truly international conference with rich and diverse participation. In 2011, as in previous years, ECML PKDD followed a full week sched- ule, from Monday to Friday. It featured six plenary invited talks by Rakesh Agrawal, Albert-L´aszl´o Barab´asi, Christofer Bishop, Andrei Broder, Marco Gori and Heikki Mannila. Monday and Friday were devoted to workshops selected by Katharina Morik and Bart Goethals, and tutorials, organized and selected by Fosca Giannotti and Maguelonne Teisseire. There was also an interesting in- dustrial session, managed by Alexandros Ntoulas and Michalis Vlachos, which welcomed distinguished speakers from the ML and DM industry: Vasilis Agge- lis, Radu Jurca, Neel Sundaresan and Olivier Verscheure. The 2011 discovery challenge was organized by Alexandros Kalousis and Vassilis Plachouras. The main conference program unfolded from Tuesday to Thursday, where 121 papers selected among 599 full-paper submissions were presented in the technical parallel sessions and in a poster session open to all accepted papers. The acceptance rate of 20% supports the traditionally high standards of the joint conference. The selection process was assisted by 35 Area Chairs, each supervising the reviews and discussions of about 17 papers, and by 270 members of the Program Committee, with the help of 197 additional reviewers. While the selection process was made particularly intense due to the very high number of submissions, we are grateful and heartily thank all Area Chairs, members of the Program Committee, and additional reviewers for their commitment and hard work during the tight reviewing period. The composition of the paper topics covered a wide spectrum of issues. A sig- nificant portion of the accepted papers dealt with core issues such as supervised VIII Preface and unsupervised learning with some innovative contributions in fundamental issues such as cluster-ability of a dataset. Other fundamental issues tackled by accepted papers include dimensionality reduction, distance and similarity learning, model learning and matrix/tensor analysis. In addition, there was a significant cluster of papers with valuable con- tributions on graph mining, graphical models, hidden Markov models, kernel methods, active and ensemble learning, semi-supervised and transductive learn- ing, mining sparse representations, model learning, inductive logic programming, and statistical learning. A significant part of the program covered novel and timely applications of data mining and machine learning in industrial domains, including: privacy- preserving and discrimination-aware mining, spatiotemporal data mining, text mining, topic modeling, learning from environmental and scientific data, Web mining and Web search, link analysis, bio/medical data, data Streams and sensor data, ontology-based data, relational data mining, learning from time series data, time series data. In the past three editions of the joint conference, the two Springer journals Machine Learning and Data Mining and Knowledge Discovery published the top papers in two special issues printed in advance of the conference. These papers were not included in the conference proceedings, so there was no double publication of the same work. A novelty introduced this year was the post- conference publication of the special issues in order to guarantee the expected high-standard reviews for top-quality journals. Therefore, authors of selected machine learning and data mining papers were invited to submit a significantly extended version of their paper to the special issues. The selection was made by Program Chairs on the basis of their exceptional scientific quality and high impact on the field, as indicated by conference reviewers. Following an earlier tradition, the Best Paper Chairs Sunita Sarawagi and Mich`ele Sebag contributed to the selection of papers deserving the Best Pa- per Awards and Best Student Paper Awards in Machine Learning and in Data Mining, sponsored by Springer. As ECML PKDD completes 10 years of joint organization, the PC chairs, together with the steering committee, initiated a 10-year Awards series. This award is established for the author(s), whose paper appeared in the ECML PKDD conference 10 years ago, and had the most im- pact on the machine learning and data mining research since then. This year’s, first award, committee consisted of three PC co-chairs (Dimitrios Gunopulos, Do- nato Malerba and Michalis Vazirgiannis) and three Steering Committee members (Wray Buntine, Bart Goethals and Mich`ele Sebag). The conference also featured a demo track, managed by Michelangelo Ceci and Spiros Papadimitriou; 11 demos out of 21 submitted were selected by a Demo Track Program Committee, presenting prototype systems that illustrate the high impact of machine learning and data mining application in technology. The demo descriptions are included in the proceedings. We further thank the members of the Demo Track Program Committee for their efforts in timely reviewing submitted demos. Preface IX

Finally, we would like to thank the General Chairs, Aristidis Likas and Yannis Theodoridis, for their critical role in the success of the conference, the Tutorial, Workshop, Demo, Industrial Session, Discovery Challenge, Best Paper, and Local Chairs, the Area Chairs and all reviewers, for their voluntary, highly dedicated and exceptional work, and the ECML PKDD Steering Committee for their help and support. Our last and warmest thanks go to all the invited speakers, the speakers, all the attendees, and especially to the authors who chose to submit their work to the ECML PKDD conference and thus enabled us to build up this memorable scientific event.

July 2011 Dimitrios Gunopulos Thomas Hofmann Donato Malerba Michalis Vazirgiannis Organization

ECML/PKDD 2011 was organized by the Department of Computer Science— University of Ioannina, Department of Informatics—University of Piraeus, De- partment of Informatics and Telecommunications—University of Athens, Google Inc. (Zurich), Dipartimento di Informatica—Universit`a degli Studi di Bari “Aldo Moro,” and Department of Informatics—Athens University of Economics & Business.

General Chairs Aristidis Likas University of Ioannina, Greece Yannis Theodoridis University of Piraeus, Greece

Program Chairs

Dimitrios Gunopulos University of Athens, Greece Thomas Hofmann Google Inc., Zurich, Switzerland Donato Malerba University of Bari “Aldo Moro,” Italy Michalis Vazirgiannis Athens University of Economics and Business, Greece

Workshop Chairs

Katharina Morik University of Dortmund, Germany Bart Goethals University of Antwerp,

Tutorial Chairs Fosca Giannotti Knowledge Discovery and Delivery Lab, Italy Maguelonne Teisseire TETIS Lab. Department of Information System and LIRMM Lab. Department of Computer Science,

Best Paper Award Chairs

Sunita Sarawagi Computer Science and Engineering, IIT Bombay, India Mich`ele Sebag University of Paris-Sud, France XII Organization

Industrial Session Chairs Alexandros Ntoulas Microsoft Research, USA Michail Vlachos IBM Zurich Research Laboratory, Switzerland

Demo Track Chairs Michelangelo Ceci University of Bari “Aldo Moro,” Italy Spiros Papadimitriou Google Research

Discovery Challenge Chairs

Alexandros Kalousis University of Geneva, Switzerland Vassilis Plachouras Athens University of Economics and Business, Greece

Publicity Chairs

Annalisa Appice University of Bari “Aldo Moro,” Italy Grigorios Tsoumakas Aristotle University of Thessaloniki, Greece

Sponsorship Chairs

Ina Lauth IAIS Fraunhofer, Germany Ioannis Kopanakis Technological Educational Institute of Crete, Greece

Organizing Committee

Maria Halkidi University of Pireaus, Greece Despina Kopanaki University of Piraeus, Greece Nikos Pelekis University of Piraeus, Greece

Steering Committee

Jos´eBalc´azar Bart Goethals Francesco Bonchi Katharina Morik Wray Buntine Dunja Mladenic Walter Daelemans John Shawe-Taylor Aristides Gionis Mich`ele Sebag Organization XIII

Area Chairs

Elena Baralis Michael May Hendrik Blockeel Taneli Mielikainen Francesco Bonchi Yucel¨ Saygin Gautam Das Arno Siebes Janez Demsar Jian Pei Amol Deshpande Myra Spiliopoulou Carlotta Domeniconi Jie Tang Tapio Elomaa Evimaria Terzi Floriana Esposito Bhavani M. Thuraisingham Fazel Famili Hannu Toivonen Wei Fan Luis Torgo Peter Flach Ioannis Tsamardinos Johannes Furnkranz Panayiotis Tsaparas Aristides Gionis Ioannis P. Vlahavas George Karypis Haixun Wang Ravi Kumar Stefan Wrobel James Kwok Xindong Wu Stan Matwin

Program Committee

Foto Afrati Konstantinos Blekas Aijun An Mario Boley Aris Anagnostopoulos Zoran Bosnic Gennady Andrienko Marco Botta Ion Androutsopoulos Jean-Francois Boulicaut Annalisa Appice Pavel Bradzil Marta Arias Ulf Brefeld Ira Assent Paula Brito Vassilis Athitsos Wray Buntine Martin Atzmueller Toon Calders Jose Luis Balcazar Rui Camacho Daniel Barbara Longbing Cao Sugato Basu Michelangelo Ceci Roberto Bayardo Tania Cerquitelli Klaus Berberich Sharma Chakravarthy Bettina Berendt Keith Chan Michele Berlingerio Vineet Chaoji Michael Berthold Keke Chen Indrajit Bhattacharya Ling Chen Marenglen Biba Xue-wen Chen Albert Bifet Weiwei Cheng Enrico Blanzieri Yun Chi XIV Organization

Silvia Chiusano Maxim Gurevich Vassilis Christophides Maria Halkidi Frans Coenen Mohammad Hasan James Cussens Jose Hernandez-Orallo Alfredo Cuzzocrea Eyke Hullermeier¨ Maria Damiani Vasant Honavar Atish Das Sarma Andreas Hotho Tijl De Bie Xiaohua Hu Jeroen De Knijf Ming Hua Colin de la Higuera Minlie Huang Antonios Deligiannakis Marcus Hutter Krzysztof Dembczynski Nathalie Japkowicz Anne Denton Szymon Jaroszewicz Christian Desrosiers Daxin Jiang Wei Ding Alipio Jorge Ying Ding Theodore Kalamboukis Debora Donato Alexandros Kalousis Kurt Driessens Panagiotis Karras Chris Drummond Samuel Kaski Pierre Dupont Ioannis Katakis Saso Dzeroski John Keane Tina Eliassi-Rad Kristian Kersting Roberto Esposito Latifur Khan Nicola Fanizzi Joost Kok Fabio Fassetti Christian Konig Ad Feelders Irena Koprinska Hakan Ferhatosmanoglou Walter Kosters Stefano Ferilli Georgia Koutrika Cesar Ferri Stefan Kramer Daan Fierens Raghuram Krishnapuram Eibe Frank Marzena Kryszkiewicz Enrique Frias-Martinez Nicolas Lachiche Elisa Fromont Nada Lˇavrac Efstratios Gallopoulos Wang-Chien Lee Byron Gao Feifei Li Jing Gao Jiuyong Li Paolo Garza Juanzi Li Ricard Gavalda Tao Li Floris Geerts Chih-Jen Lin Pierre Geurts Hsuan-Tien Lin Aris Gkoulalas-Divanis Jessica Lin Bart Goethals Shou-de Lin Vivekanand Gopalkrishnan Song Lin Marco Gori Helger Lipmaa Henrik Grosskreutz Bing Liu Organization XV

Huan Liu Kunal Punera Yan Liu Chedy Raissi Corrado Loglisci Jan Ramon Chang-Tien Lu Huzefa Rangwala Ping Luo Zbigniew Ras Panagis Magdalinos Ann Ratanamahatana Giuseppe Manco Jan Rauch Yannis Manolopoulos Matthias Renz Simone Marinai Christophe Rigotti Dimitrios Mavroeidis Fabrizio Riguzzi Ernestina Menasalvas Celine Robardet Rosa Meo Marko Robnik-Sikonja Pauli Miettinen Pedro Rodrigues Dunja Mladenic Fabrice Rossi Marie-Francine Moens Juho Rousu Katharina Morik Celine Rouveirol Mirco Nanni Ulrich Ruckert¨ Alexandros Nanopoulos Salvatore Ruggieri Benjamin Nguyen Stefan Ruping Frank Nielsen Lorenza Saitta Siegfried Nijssen Ansaf Salleb-Aouissi Richard Nock Claudio Sartori Kjetil Norvag Lars Schmidt-Thieme Irene Ntoutsi Matthias Schubert Salvatore Orlando Mich`ele Sebag Gerhard Paass Thomas Seidl George Paliouras Prithviraj Sen Apostolos Papadopoulos Andrzej Skowron Panagiotis Papapetrou Carlos Soares Stelios Paparizos Yangqiu Song Dimitris Pappas Alessandro Sperduti Ioannis Partalas Jerzy Stefanowski Srinivasan Parthasarathy Jean-Marc Steyaert Andrea Passerini Alberto Suarez Vladimir Pavlovic Johan Suykens Dino Pedreschi Einoshin Suzuki Nikos Pelekis Panagiotis Symeonidis Jing Peng Marcin Szczuka Ruggero Pensa Andrea Tagarelli Bernhard Pfahringer Domenico Talia Fabio Pinelli Pang-Ning Tan Enric Plaza Letizia Tanca George Potamias Lei Tang Michalis Potamias Dacheng Tao Doina Precup Nikolaj Tatti XVI Organization

Martin Theobald Jieping Ye Dimitrios Thilikos Jeffrey Yu Jilei Tian Philip Yu Ivor Tsang Bianca Zadrozny Grigorios Tsoumakas Gerson Zaverucha Theodoros Tzouramanis Demetris Zeinalipour Antti Ukkonen Filip Zelezny Takeaki Uno Changshui Zhang Athina Vakali Kai Zhang Giorgio Valentini Kun Zhang Maarten van Someren Min-Ling Zhang Iraklis Varlamis Nan Zhang Julien Velcin Shichao Zhang Celine Vens Zhongfei Zhang Jean-Philippe Vert Junping Zhang Vassilios Verykios Ying Zhao Herna Viktor Bin Zhou Jilles Vreeken Zhi-Hua Zhou Willem Waegeman Kenny Zhu Jianyong Wang Xingquan Zhu Wei Wang Djamel Zighed Xuanhui Wang Indre Zliobaite Hui Xiong Blaz Zupan

Additional Reviewers

Pedro Abreu Brigitte Boden Raman Adaikkalavan Samuel Branders Artur Aiguzhinov Janez Brank Darko Aleksovski Agn`es Braud Tristan Allard Giulia Bruno Alessia Amelio Luca Cagliero Panayiotis Andreou Yuanzhe Cai Fabrizio Angiulli Ercan Canhas Josephine Antoniou Eugenio Cesario Andrea Argentini George Chatzimilioudis Krisztian Balog Xi Chen Teresa M.A. Basile Anna Ciampi Christian Beecks Marek Ciglan Antonio Bella Tyler Clemons Aurelien Bellet Joseph Cohen Dominik Benz Carmela Comito Thomas Bernecker Andreas Constantinides Alberto Bertoni Michael Corsello Jerzy Blaszczynski Michele Coscia Organization XVII

Gianni Costa Faisal Kamiran Claudia D’Amato Michael Kamp Mayank Daswani Mehdi Kargar Gerben de Vries Mehdi Kaytoue Nicoletta Del Buono Jyrki Kivinen Elnaz Delpisheh Dragi Kocev Elnaz Delpisheh Domen Koˇsir Engin Demir Iordanis Koutsopoulos Nicola Di Mauro Hardy Kremer Ivica Dimitrovski Anastasia Krithara Martin Dimkovski Artus Krohn-Grimberghe Huyen Do Onur Kucuktunc Lucas Drumond Tor Lattimore Ana Luisa Duboc Florian Lemmerich Tobias Emrich Jun Li Saeed Faisal Rong-Hua Li Zheng Fang Yingming Li Wei Feng Siyi Liu C`esar Ferri Stefano Lodi Alessandro Fiori Claudio Lucchese Nuno A. Fonseca Gjorgji Madjarov Marco Frasca M. M. Hassan Mahmud Christoph Freudenthaler Fernando Mart´ınez-Plumed Sergej Fries Elio Masciari Natalja Friesen Michael Mathioudakis David Fuhry Ida Mele Dimitrios Galanis Corrado Mencar Zeno Gantner Glauber Menezes Bo Gao Pasquale Minervini Juan Carlos Gomez Ieva Mitasiunaite-Besson Alberto Grand Folke Mitzlaff Francesco Gullo Anna Monreale Tias Guns Gianluca Moro Marwan Hassani Alessandro Moschitti Zhouzhou He Yang Mu Daniel Hern`andez-Lobato Ricardo Nanculef˜ Tomas Horvath Franco Maria Nardini Dino Ienco Robert Neumayer Elena Ikonomovska Phuong Nguyen Anca Ivanescu Stefan Novak Francois Jacquenet Tim O’Keefe Baptiste Jeudy Riccardo Ortale Dustin Jiang Aline Paes Lili Jiang George Pallis Maria Jose Luca Pappalardo XVIII Organization

J´erˆome Paul Anˇze Staric Daniel Paurat Erik Strumbelj Sergios Petridis Ilija Subasic Darko Pevec Peter Sunehag Jean-Philippe Peyrache Izabela Szczech Anja Pilz Frank Takes Marc Plantevit Aditya Telang Matija Polajnar Eleftherios Tiakas Giovanni Ponti Gabriele Tolomei Adriana Prado Marko Toplak Xu Pu Daniel Trabold ZhongAng Qi Roberto Trasarti Ariadna Quattoni Paolo Trunfio Alessandra Raffaet`a George Tsatsaronis Maria Jose Ram´ırez-Quintana Guy Van den Broeck Hongda Ren Antoine Veillard Salvatore Rinzivillo Mathias Verbeke Ettore Ritacco Jan Verwaeren Matthew Robards Alessia Visconti Christophe Rodrigues Dimitrios Vogiatzis Giulio Rossetti Dawei Wang Andr´e Rossi Jun Wang Cl´audio S´a Louis Wehenkel Venu Satuluri Adam Woznica Leander Schietgat Ming Yang Marijn Schraagen Qingyan Yang Wen Shao Xintian Yang Wei Shen Lan Zagar A. Pedro Duarte Silva Jure Zbontarˇ Claudio Silvestri Bernard Zenko Ivica Slavkov Pin Zhao Henry Soldano Yuanyuan Zhu Yi Song Albrecht Zimmermann Miha Stajdohar Andreas Zufle¨

Sponsoring Institutions We wish to express our gratitude to the sponsors of ECML PKDD 2011 for their essential contribution to the conference: Google, Pascal2 Network, Xerox, Yahoo Lab, COST/MOVE (Knowledge Discovery from Movin, Objects), FP7/MODAP (Mobility, Data Mining, and Privacy), Rapid-I (report the future), Athena/IMIS (Institute for the Management of Information Systems), EETN (Hellenic Arti- ficial Intelligence Society), MDS Marathon Data Systems, Transinsight, SONY, UNESCO Privacy Chair, Springer, Universit`a degli Studi di Bari “Aldo Moro,” Athens University of Economics and Business, University of Ioannina, National and Kapodistrian University of Athens, and the University of Piraeus. Table of Contents – Part III

Regular Papers

Sparse Kernel-SARSA(λ) with an Eligibility Trace ...... 1 Matthew Robards, Peter Sunehag, Scott Sanner, and Bhaskara Marthi

Influence and Passivity in Social Media ...... 18 Daniel M. Romero, Wojciech Galuba, Sitaram Asur, and Bernardo A. Huberman

Preference Elicitation and Inverse Reinforcement Learning ...... 34 Constantin A. Rothkopf and Christos Dimitrakakis

A Novel Framework for Locating Software Faults Using Latent Divergences ...... 49 Shounak Roychowdhury and Sarfraz Khurshid

Transfer Learning with Adaptive Regularizers ...... 65 Ulrich Ruckert¨ and Marius Kloft

Multimodal Nonlinear Filtering Using Gauss-Hermite Quadrature ...... 81 Hannes P. Saal, Nicolas M.O. Heess, and Sethu Vijayakumar

Active Supervised Domain Adaptation ...... 97 Avishek Saha, Piyush Rai, Hal Daum´e III, Suresh Venkatasubramanian, and Scott L. DuVall

Efficiently Approximating Markov Tree Bagging for High-Dimensional Density Estimation ...... 113 Fran¸cois Schnitzler, Sourour Ammar, Philippe Leray, Pierre Geurts, and Louis Wehenkel

Resource-Aware On-line RFID Localization Using Proximity Data ..... 129 Christoph Scholz, Stephan Doerfel, Martin Atzmueller, Andreas Hotho, and Gerd Stumme

On the Stratification of Multi-label Data ...... 145 Konstantinos Sechidis, Grigorios Tsoumakas, and Ioannis Vlahavas

Learning First-Order Definite Theories via Object-Based Queries ...... 159 Joseph Selman and Alan Fern

Fast Support Vector Machines for Structural Kernels ...... 175 Aliaksei Severyn and Alessandro Moschitti XX Table of Contents – Part III

Generalized Agreement Statistics over Fixed Group of Experts ...... 191 Mohak Shah

Compact Coding for Hyperplane Classifiers in Heterogeneous Environment ...... 207 Hao Shao, Bin Tong, and Einoshin Suzuki

Multi-label Ensemble Learning...... 223 Chuan Shi, Xiangnan Kong, Philip S. Yu, and Bai Wang

Rule-Based Active Sampling for Learning to Rank ...... 240 Rodrigo Silva, Marcos A. Gon¸calves, and Adriano Veloso

Parallel Structural Graph Clustering ...... 256 Madeleine Seeland, Simon A. Berger, Alexandros Stamatakis, and Stefan Kramer

Aspects of Semi-supervised and Active Learning in Conditional Random Fields ...... 273 Nataliya Sokolovska

Comparing Probabilistic Models for Melodic Sequences ...... 289 Athina Spiliopoulou and Amos Storkey

Fast Projections Onto 1,q-Norm Balls for Grouped Feature Selection ... 305 Suvrit Sra

Generalized Dictionary Learning for Symmetric Positive Definite Matrices with Application to Nearest Neighbor Retrieval ...... 318 Suvrit Sra and Anoop Cherian

Network Regression with Predictive Clustering Trees ...... 333 Daniela Stojanova, Michelangelo Ceci, Annalisa Appice, and Saˇso Dˇzeroski

Learning from Label Proportions by Optimizing Cluster Model Selection ...... 349 Marco Stolpe and Katharina Morik

The Minimum Code Length for Clustering Using the Gray Code ...... 365 Mahito Sugiyama and Akihiro Yamamoto

Learning to Infer Social Ties in Large Networks ...... 381 Wenbin Tang, Honglei Zhuang, and Jie Tang

Comparing Apples and Oranges: Measuring Differences between Data Mining Results ...... 398 Nikolaj Tatti and Jilles Vreeken Table of Contents – Part III XXI

Learning Monotone Nonlinear Models Using the Choquet Integral ...... 414 AliFallahTehrani,WeiweiCheng,KrzysztofDembczy´nski, and Eyke Hullermeier¨

Feature Selection for Transfer Learning ...... 430 Selen Uguroglu and Jaime Carbonell

Multiview Semi-supervised Learning for Ranking Multilingual Documents ...... 443 Nicolas Usunier, Massih-Reza Amini, and Cyril Goutte

Non-redundant Subgroup Discovery in Large and Complex Data ...... 459 Matthijs van Leeuwen and Arno Knobbe

Larger Residuals, Less Work: Active Document Scheduling for Latent Dirichlet Allocation ...... 475 Mirwaes Wahabzada and Kristian Kersting

A Community-Based Pseudolikelihood Approach for Relationship Labeling in Social Networks ...... 491 Huaiyu Wan, Youfang Lin, Zhihao Wu, and Houkuan Huang

Correcting Bias in Statistical Tests for Network Classifier Evaluation ... 506 Tao Wang, Jennifer Neville, Brian Gallagher, and Tina Eliassi-Rad

Differentiating Code from Data in x86 Binaries ...... 522 Richard Wartell, Yan Zhou, Kevin W. Hamlen, Murat Kantarcioglu, and Bhavani Thuraisingham

Bayesian Matrix Co-Factorization: Variational Algorithm and Cram´er-Rao Bound ...... 537 Jiho Yoo and Seungjin Choi

Learning from Inconsistent and Unreliable Annotators by a Gaussian Mixture Model and Bayesian Information Criterion ...... 553 Ping Zhang and Zoran Obradovic iDVS: An Interactive Multi-document Visual Summarization System ... 569 Yi Zhang, Dingding Wang, and Tao Li

Discriminative Experimental Design ...... 585 Yu Zhang and Dit-Yan Yeung

Active Learning with Evolving Streaming Data ...... 597 Indr˙e Zliobait˙ˇ e, Albert Bifet, Bernhard Pfahringer, and Geoff Holmes XXII Table of Contents – Part III

Demo Papers

Celebrity Watch: Browsing News Content by Exploiting Social Intelligence ...... 613 Omar Ali, Ilias Flaounas, Tijl De Bie, and Nello Cristianini

MOA: A Real-Time Analytics Open Source Framework ...... 617 Albert Bifet, Geoff Holmes, Bernhard Pfahringer, Jesse Read, Philipp Kranen, Hardy Kremer, Timm Jansen, and Thomas Seidl

L-SME: A System for Mining Loosely Structured Motifs ...... 621 Fabio Fassetti, Gianluigi Greco, and Giorgio Terracina

The MASH Project ...... 626 Fran¸cois Fleuret, Philip Abbet, Charles Dubout, and Leonidas Lefakis

Activity Recognition With Mobile Phones ...... 630 Jordan Frank, Shie Mannor, and Doina Precup

MIME: A Framework for Interactive Visual Pattern Mining ...... 634 Bart Goethals, Sandy Moens, and Jilles Vreeken

InFeRno – An Intelligent Framework for Recognizing Pornographic Web Pages ...... 638 Sotiris Karavarsamis, Nikos Ntarmos, and Konstantinos Blekas

MetaData Retrieval: A Software Prototype for the Annotation of Maps with Social Metadata ...... 642 Rosa Meo, Elena Roglia, and Enrico Ponassi

TRUMIT: A Tool to Support Large-Scale Mining of Text Association Rules ...... 646 Robert Neumayer, George Tsatsaronis, and Kjetil Nørv˚ag

Traffic Jams Detection Using Flock Mining ...... 650 Rebecca Ong, Fabio Pinelli, Roberto Trasarti, Mirco Nanni, Chiara Renso, Salvatore Rinzivillo, and Fosca Giannotti

Exploring City Structure from Georeferenced Photos Using Graph Centrality Measures ...... 654 Katerina Vrotsou, Natalia Andrienko, Gennady Andrienko, and Piotr Jankowski

Author Index ...... 659