Lecture Notes in Computer Science 4976 Commenced Publication in 1973 Founding and Former Series Editors: Gerhard Goos, Juris Hartmanis, and Jan van Leeuwen

Editorial Board David Hutchison Lancaster University, UK Takeo Kanade Carnegie Mellon University, Pittsburgh, PA, USA Josef Kittler University of Surrey, Guildford, UK Jon M. Kleinberg Cornell University, Ithaca, NY, USA Alfred Kobsa University of California, Irvine, CA, USA Friedemann Mattern ETH Zurich, Switzerland John C. Mitchell Stanford University, CA, USA Moni Naor Weizmann Institute of Science, Rehovot, Israel Oscar Nierstrasz University of Bern, Switzerland C. Pandu Rangan Indian Institute of Technology, Madras, India Bernhard Steffen University of Dortmund, Germany Madhu Sudan Massachusetts Institute of Technology, MA, USA Demetri Terzopoulos University of California, , CA, USA Doug Tygar University of California, Berkeley, CA, USA Gerhard Weikum Max-Planck Institute of Computer Science, Saarbruecken, Germany Yanchun Ge Yu Elisa Bertino Guandong Xu (Eds.)

Progress in WWW Research and Development

10th Asia-Pacific Web Conference, APWeb 2008 Shenyang, , April 26-28, 2008 Proceedings

1 3 Volume Editors

Yanchun Zhang Guandong Xu Victoria University, School of Computer Science and Mathematics Melbourne, VIC 8001, Australia E-mail: [email protected], [email protected]

Ge Yu Northeastern University, Department of Computer Science and Engineering Shenyang 110004, China E-mail: [email protected]

Elisa Bertino Purdue University, Department of Computer Science West Lafayette, IN 47907, USA E-mail: [email protected]

Library of Congress Control Number: 2008924178

CR Subject Classification (1998): H.2-5, C.2, D.2, I.2, K.4, J.1

LNCS Sublibrary: SL 3 – Information Systems and Application, incl. Internet/Web and HCI

ISSN 0302-9743 ISBN-10 3-540-78848-4 Springer Berlin Heidelberg New York ISBN-13 978-3-540-78848-5 Springer Berlin Heidelberg New York

This work is subject to copyright. All rights are reserved, whether the whole or part of the material is concerned, specifically the rights of translation, reprinting, re-use of illustrations, recitation, broadcasting, reproduction on microfilms or in any other way, and storage in data banks. Duplication of this publication or parts thereof is permitted only under the provisions of the German Copyright Law of September 9, 1965, in its current version, and permission for use must always be obtained from Springer. Violations are liable to prosecution under the German Copyright Law. Springer is a part of Springer Science+Business Media springer.com © Springer-Verlag Berlin Heidelberg 2008 Printed in Germany Typesetting: Camera-ready by author, data conversion by Scientific Publishing Services, Chennai, India Printed on acid-free paper SPIN: 12247257 06/3180 5 4 3 2 1 0 Message from the Conference Co-chairs

This volume is the published record of the 10th Asia Pacific Conference on Web Technology (APWeb 2008), held in Shenyang, China, April 26-28, 2008. APWeb has been a premier forum in the Asia-Pacific region on theoretical and practical aspects of Web technologies, database systems, information man- agement and software engineering. Previous APWeb conferences were held in (1998), Hong Kong (1999), Xi’an (2000), Changsha (2001), Xi’an (2003), Hangzhou (2004), Shanghai (2005), Harbin (2006), and Huang Shan (Yellow Mountain) (2007). It was our great pleasure to have hosted this year’s APWeb conference in Shenyang, which is the center for communication, commerce, sci- ence and culture in the northeastern part of China. APWeb 2008 attracted more than 160 papers from 19 countries and regions. This made the work of the Program Committee rather challenging. We are grate- ful to the Program Co-chairs, namely, Yanchun Zhang, Ge Yu, and Elisa Bertino, who worked very hard to ensure the quality of the paper review process. Spe- cial thanks are due to Yanchun Zhang for taking care of many organizational issues. Without him the conference would not have been so successful. We would like to express our gratitude to Ling Feng and Keun Ho Ryu, Tutorial/Panel Co-chairs, and Haixun Wang, Industrial Chair, for their devotion in designing an attractive overall conference program. Moreover, we are thankful to Masaru Kitsuregawa, Wei-Ying Ma, Xuemin Lin, and Yong Shi for their keynote/invited lectures, which were the highlights of this year’s conference. Alongside the main conference, three high-quality workshops were arranged by Yoshiharu Ishikawa and Jing He, the Workshop Co-chairs. The Workshop on Information-explosion and Next Generation Search was organized by Katsumi Tanaka; the Workshop on Business Intelligence and Data Mining was run by Yong Shi, Guangyan Huang, and Jing He; and the Workshop on Health Data Management was offered by Chaoyi Pang and Qing Zhang. Moreover, a Doctoral Consortium on Data Engineering and Web Technology Research was organized to promote the research of doctoral students. All of them attracted many par- ticipants. In addition to the afore-mentioned program officers, the success of APWeb 2008 is attributed to many other people. Especially, we would like to thank Guoren Wang and Bin Zhang, Local Arrangements Co-chairs; Guandong Xu, Publication Chair, and Toshiyuki Amagasa and Hua Wang, Publicity Co-chairs. We also thank the APWeb Steering Committee and WISE Society for their continuous support. We hope that you will find the papers in this volume intellectually stimulat- ing. We believe APWeb 2008 will lead to further developments in advanced Web VI Preface engineering technologies and research activities, not only in the Asia-Pacific region but also in the international research arena.

April 2008 Hiroyuki Kitagawa Kam Fai Wong Preface

The rapid development of Web applications and the flux of Web information require new technologies for the design, implementation and management of information infrastructure on the Web. This volume contains papers selected for presentation at the 10th Asia Pacific Conference on Web Technology (APWeb 2008), which was held in Shenyang, China during April 26-28, 2008. APWeb is an international conference series on WWW technologies and is the primary forum for researchers, practitioners, developers and users from both academia and industry to exchange cutting-edge ideas, results, experience, techniques and tools on WWW-related technologies and new advanced applications. APWeb 2008 received 169 submissions from 19 countries and regions world- wide, including USA, Australia, Japan, Korea, China, Hong Kong, Taiwan, UK, Germany, India, France, Turkey, Pakistan, Switzerland, New Zealand, Iran, Macao, Malaysia, and Tunisia. After a thorough review process in which each paper was reviewed and recommended by at least three Program Committe (PC) members or external reviewers, the APWeb 2008 PC selected 48 regular research papers (acceptance ratio 28%) and 15 short papers (acceptance ratio 9%). This volume also includes four invited/keynote papers. The keynote lec- tures were given by Masaru Kitsuregawa (University of ), Wei-Ying Ma (Microsoft Research Asia), Xuemin Lin (University of New South Wales) and Yong Shi (China Academy of Sciences). The abstracts of two tutorials presented by Xiaofang Zhou (University of Queensland) and Souvav Bhowmick (Nanyang Technological University, Singapore) are also included in these proceedings. The conference was co-organized by Northeastern University, China, and Vic- toria University, Australia, and it was also financially sponsored by the National Natural Science Foundation of China and the Science and Technology Innovation Platform of Information Infrastructure Key Techniques of the 985 Program. We wish to present our gratitude to the APWeb Steering Committee, the APWeb 2008 Organizing Committee, and the PC members and many external reviewers for their dedication in promoting the conference and for their expertise in selecting papers. We also wish to thank all authors for submitting high-quality work to the conference. We would like to thank the General Chairs Hiroyuki Kitagawa and Kam- Fai Wong for the leadership; the Workshop Chairs Yoshiharu Ishikawa and Jing He for coordinating and organizing a high-quality workshop program together with the workshop organizers; the Tutorial Chairs Ling Feng and Keun Ho Ryu for the soliciting and selecting two excellent tutorials; and the Publicity Chairs Toshiyuki Amagasa and Hua Wang for actively promoting the event. Last but not least, we would like to thank the local Arrangements Com- mittee, led by Guoren Wang, and many colleagues and volunteers significantly contributed to the preparation of the conference for their enormous help. In VIII Preface particular, we thank Zhibin Zhao, Xiangguo Zhao and Donghong Han for their support and help in registration, accommodation and local arrangements, and Yanan Hao for his great efforts in maintaining the paper review system and communication with authors.

February 2008 Yanchun Zhang Ge Yu Elisa Bertino Guandong Xu Conference Organization

Conference Co-chairs

Hiroyuki Kitagawa, University of Tsukuba, Japan Kam-Fai Wong, Chinese University of Hong Kong

Program Committee Co-chairs

Yanchun Zhang, Victoria University, Australia Ge Yu, Northeastern University, China Elisa Bertino, Purdue University, USA

Workshop Co-chairs

Yoshiharu Ishikawa, Nagoya University, Japan Jing He, Chinese Academy of Sciences, China

Tutorial/Panel Co-chairs

Ling Feng, Tsinghua University, China Keun Ho Ryu, Chungbuk National University, Korea

Industrial Chair

Haixun Wang, IBM T.J. Watson Research Center, USA

Publication Chair

Guandong Xu, Victoria University, Australia

Publicity Co-chairs

Toshiyuki Amagasa, University of Tsukuba, Japan Hua Wang, University of Southern Queensland, Australia

Local Arrangements Co-chairs Guoren Wang, Northeastern University, China Bin Zhang, Northeastern University, China X Organization

APWeb Steering Committee

Xiaoming Li (Peking University) Xuemin Lin (University of New South Wales) Maria Orlowska (Ministry of Sciences and Higher Education, Poland) Kyu-Young Whang (KAIST) Jeffrey Yu (Chinese University of Hong Kong) Yanchun Zhang (Victoria University) Xiaofang Zhou (University of Queensland) Chair

APWeb Steering Committee Liaison

Xiaofang Zhou, University of Queensland, Australia

WISE Society Liaison

Qing Li, City University of Hong Kong

Program Committee

James Bailey, Australia Guido Governatori, Australia Rafae Bhatti, USA Stephane Grumbach, France Sourav Bhowmick, Singapore Giovanna Guerrini, Italy Haiyun Bian, USA Jingfeng Guo, China Klemens Boehm, Germany Mohand-Said Hacid, France Athman Bouguettaya, USA Michael Houle, Japan Stephane Bressan, Singapore Ela Hunt, Switzerland Ji-Won Byun, USA Yoshiharu Ishikawa, Japan Akmal Chaudhri, USA Renato Iannella, Australia Lei Chen, Hong Kong Yan Jia, China Reynold Cheng, Hong Kong Panagiotis Kalnis, Singapore Byron Choi, Singapore Murat Kantarcioglu, USA Gao Cong, UK Markus Kirchberg, New Zealand Bin Cui, China Flip Korn, USA Alfredo Cuzzocrea, Italy Manolis Koubarakis, Greece Xiaoyong Du, China Chiang Lee, Taiwan Yaokai Feng, Japan Chen Li, USA Ling Feng, China Qing Li, Hong Kong Jianhua Feng, China Zhanhuai Li, China Eduardo Fernandez, USA Xuemin Lin, Australia Gabriel Fung, Hong Kong Tieyan Liu, China Hong Gao, China Qing Liu, Australia Zhiguo Gong, Macao Mengchi Liu, Canada Organization XI

Hongyan Liu, China Nan Tang, Hong Kong Weiyi Liu, China Changjie Tang, China Jiaheng Lu, USA David Taniar, Australia Emil Lupu, UK Jianyong Wang, China Liping Ma, Australia Tengjiao Wang, China Lorenzo Martino, USA Guoren Wang, China Weiyi Meng, USA Junhu Wang, Australia Xiaofeng Meng, China Lizhen Wang, China Mukesh Mohania, India Wei Wang, Australia Miyuki Nakano, Japan Daling Wang, China Federica Paci, Italy Haixun Wang, USA Vasile Palade, UK Hua Wang, Australia Chaoyi Pang, Australia Min Wang, USA Jian Pei, Canada Wei Wang, China Zhiyong Peng, China Jitian Xiao, Australia Evaggelia Pitoura, Greece Jianliang Xu, Hong Kong Weining Qian, China Jun Yan, Australia Wenyu Qu, Japan Dongqing Yang, China Cartic Ramakrishnan, USA Jian Yang, Australia KeunHo Ryu, Korea Xiaochun Yang, China Monica Scannapieco, Italy Jian Yin, China Albrecht Schmidt, Denmark Cui Yu, USA Markus Schneider, USA Jeffrey Yu, Hong Kong HengTao Shen, Australia Lihua Yue, China Jialie Shen, Singapore Rui Zhang, Australia Derong Shen, China Xiuzhen Zhang, Australia Timothy Shih, Taiwan Qing Zhang, Australia Anna Squicciarini, USA Peixiang Zhao, Hong Kong Peter Stanchev, USA Aoying Zhou, China Kian-Lee Tan, Singapore Qiang Zhu, USA Table of Contents

Keynote Papers Socio-Sense: A System for Analysing the Societal Behavior from Long Term Web Archive ...... 1 Masaru Kitsuregawa, Takayuki Tamura, Masashi Toyoda, and Nobuhiro Kaji Building Web-Scale Data Mining Infrastructure for Search ...... 9 Wei-Ying Ma Aggregate Computation over Data Streams ...... 10 Xuemin Lin and Ying Zhang A Family of Optimization Based Data Mining Methods ...... 26 Yong Shi, Rong Liu, Nian Yan, and Zhenxing Chen

Tutorials Web Evolution Management: Detection, Monitoring, and Mining ...... 39 Sourav S. Bhowmick and Sanjay Madria Research and Practice in Data Quality...... 41 Shazia Sadiq, Xiaofang Zhou, and Ke Deng

Data Mining and Knowledge Discoverty Detecting Overlapping Community Structures in Networks with Global Partition and Local Expansion ...... 43 Fang Wei, Chen Wang, Li Ma, and Aoying Zhou High Confidence Fragment-Based Classification Rule Mining for Imbalanced HIV Data ...... 56 Bing Lv, Jianyong Wang, and Lizhu Zhou Source Credibility Model for Neighbor Selection in Collaborative Web Content Recommendation ...... 68 JinHyung Cho and Kwiseok Kwon The Layered World of Scientific Conferences ...... 81 Michael Kuhn and Roger Wattenhofer Mining Maximal Frequent Subtrees with Lists-Based Pattern-Growth Method ...... 93 Juryon Paik, Junghyun Nam, Jaegak Hwang, and Ung Mo Kim XIV Table of Contents

Mining the Web for Hyponymy Relations Based on Property Inheritance ...... 99 Shun Hattori, Hiroaki Ohshima, Satoshi Oyama, and Katsumi Tanaka

Detecting Outliers in Categorical Record Databases Based on Attribute Associations ...... 111 Kazuyo Narita and Hiroyuki Kitagawa

Wireless, Sensor Networks and Grid

An Energy-Efficient Multi-agent Based Architecture in Wireless Sensor Network ...... 124 Yi-Ying Zhang, Wen-Cheng Yang, Kee-Bum Kim, Min-Yu Cui, Ming Xue, and Myong-Soon Park

Task Migration Enabling Grid Workflow Application Rescheduling ..... 130 Xianwen Hao, Yu Dai, Bin Zhang, and Tingwei Chen

A Dependent Tasks Scheduling Model in Grid ...... 136 Tingwei Chen, Bin Zhang, and Xianwen Hao

A Hierarchical Replica Location Approach Based on Cache Mechanism and Load Balancing in Data Grid ...... 148 Baoyan Song, Yanying Mao, Yan Wang, and Derong Shen

Wireless Video-Based Sensor Networks for Surveillance of Residential Districts ...... 154 Guangyan Huang, Jing He, and Zhiming Ding

Trust Maintenance Toward Virtual Computing Environment in the Grid Service ...... 166 Dongbo Wang and Ai-min Wang

An Association Model of Sensor Properties for Event Diffusion Spotting Sensor Networks ...... 178 Xiaoning Cui, Qing Li, and Baohua Zhao

CROWNBench: A Grid Performance Testing System Using Customizable Synthetic Workload ...... 190 Xing Yang, Xiang Li, Yipeng Ji, and Mo Sha

XML and Query Processing and Optimization

A Decision Procedure for XPath Satisfiability in the Presence of DTD Containing Choice ...... 202 Yu Zhang, Yihua Cao, and Xunhao Li Table of Contents XV

Performance Analysis and Improvement for Transformation Operators in XML Data Integration ...... 214 Jiashen Tian, Jixue Liu, Weidong Pan, Millist Vincent, and Chengfei Liu

Similarity Computation for XML Documents by XML Element Sequence Patterns ...... 227 Haiwei Zhang, Xiaojie Yuan, Na Yang, and Zhongqi Liu

Evolving Weighting Functions for Query Expansion Based on Relevance Feedback ...... 233 A. Borji and M.Z. Jahromi

Ontology-Based Mobile Information Service Platform ...... 239 Hongwei Qi, Qiangze Feng, Bangyong Liang, and Toshikazu Fukushima

On Safety, Computability and Local Property of Web Queries (Extended Abstract) ...... 251 Hong-Cheu Liu and Jeffrey Xu Yu

Privacy, and Security

Privacy Inference Attacking and Prevention on Multiple Relative K-Anonymized Microdata Sets...... 263 Yalong Dong, Zude Li, and Xiaojun Ye

Exposing Homograph Obfuscation Intentions by Coloring Unicode Strings ...... 275 Liu Wenyin, Anthony Y. Fu, and Xiaotie Deng

On the Complexity of Restricted k-anonymity Problem ...... 287 Xiaoxun Sun, Hua Wang, and Jiuyong Li

An Efficient Electronic Marketplace Bidding Auction Protocol with Bid Privacy ...... 297 Wenbo Shi, Injoo Jang, and Hyeong Seon Yoo

A Provable Secure Authentication Protocol Given Forward Secure Session Key ...... 309 Wenbo Shi, Injoo Jang, and Hyeong Seon Yoo

A Secure Multi-dimensional Partition Based Index in DAS ...... 319 Jieping Wang and Xiaoyong Du

Multilateral Approaches to the Mobile RFID Security Problem Using Web Service ...... 331 Namje Park, Youjin Song, Dongho Won, and Howon Kim XVI Table of Contents

Classifying Security Patterns ...... 342 Eduardo B. Fernandez, Hironori Washizaki, Nobukazu Yoshioka, Atsuto Kubo, and Yoshiaki Fukazawa

Enhanced Mutual Authentication and Key Exchange Protocol for Wireless Communications ...... 348 Yi-jun He, Moon-Chuen Lee, and Jie Li

Verification of the Security Against Inference Attacks on XML Databases ...... 359 Kenji Hashimoto, Fumikazu Takasuka, Kimihide Sakano, Yasunori Ishihara, and Toru Fujiwara

Information Extraction, Presentation and Retrieval

Mining, Ranking, and Using Acronym Patterns ...... 371 Xiaonan Ji, Gu Xu, James Bailey, and Hang Li

A Method for Web Information Extraction ...... 383 Man I. Lam, Zhiguo Gong, and Maybin Muyeba

Information Presentation on Mobile Devices: Techniques and Practices ...... 395 Lin Qiao, Ling Feng, and Lizhu Zhou

Pattern-Based Extraction of Addresses from Web Page Content ...... 407 Saeid Asadi, Guowei Yang, Xiaofang Zhou, Yuan Shi, Boxuan Zhai, and Wendy Wen-Rong Jiang

An Effective Method Supporting Data Extraction and Schema Recognition on Deep Web ...... 419 Wei Liu, Derong Shen, and Tiezheng Nie

The Experiments with the Linear Combination Data Fusion Method in Information Retrieval ...... 432 Shengli Wu, Yaxin Bi, Xiaoqin Zeng, and Lixin Han

Squeezing Long Sequence Data for Efficient Similarity Search ...... 438 Guojie Song, Bin Cui, Baihua Zheng, Kunqing Xie, and Dongqing Yang

P2P, Agent Systems

An Architecture for Distributed Controllable Networks and Manageable Node Based on Network Processor ...... 450 Tao Liu, Depei Qian, Huang Yongxiang, and Rui Wang Table of Contents XVII

Building a Scalable P2P Network with Small Routing Delay ...... 456 Shiping Chen, Yuan Li, Kaihua Rao, Lei Zhao, Tao Li, and Shigang Chen

ERASP: An Efficient and Robust Adaptive Superpeer Overlay Network ...... 468 Wenjun Liu, Jiguo Yu, Jingjing Song, Xiaoqing Lan, and Baoxiang Cao

Traceable P2P Record Exchange Based on Database Technologies ...... 475 Fengrong Li and Yoshiharu Ishikawa

Ontology, Semantic Web and Web Applications

ACORN: Towards Automating Domain Specific Ontology Construction Process ...... 487 Eric Bae, Bintu G. Vasudevan, and Rajesh Balakrishnan

Ontological Knowledge Management Through Hybrid Unsupervised Clustering Techniques ...... 499 Ching-Chieh Kiu and Chien-Sing Lee

Semantic-Enabled Organization of Web Services ...... 511 Khanh Nguyen, Jinli Cao, and Chengfei Liu

Rule Mining for Automatic Ontology Based Data Cleaning ...... 522 Stefan Br¨uggemann

Towards Automatic Verification of Web-Based SOA Applications ...... 528 Xiangping Chen, Gang Huang, and Hong

A Novel and Effective Method for Web System Tuning Based on Feature Selection ...... 537 Shi Feng, Yan Liu, Daling Wang, and Derong Shen

Effective Data Distribution and Reallocation Strategies for Fast Query Response in Distributed Query-Intensive Data Environments ...... 548 Tengjiao Wang, Bishan Yang, Jun Gao, and Dongqing Yang

Data Streams, Time Series Analysis and Data Mining

A Novel Chi2 Algorithm for Discretization of Continuous Attributes .... 560 Wenyu Qu, Deqian Yan, Yu Sang, Hongxia Liang, Masaru Kitsuregawa, and Keqiu Li

Mining Multiple Time Series Co-movements ...... 572 Di Wu, Gabriel Pui Cheong Fung, Jeffrey Xu Yu, and Zheng Liu XVIII Table of Contents

Effective Spatio-temporal Analysis of Remote Sensing Data ...... 584 Zhongnan Zhang, Weili Wu, and Yaochun Huang

Supporting Top-K Aggregate Queries over Unequal Synopsis on Internet Traffic Streams...... 590 Ling Wang, Yang Koo Lee, and Keun Ho Ryu

Web Mining and Web Search

ONOMATOPEDIA: Onomatopoeia Online Example Dictionary System Extracted from Data on the Web ...... 601 Chisato Asaga, Yusuf Mukarramah, and Chiemi Watanabe

Connectivity of the Thai Web Graph ...... 613 Kulwadee Somboonviwat, Shinji Suzuki, and Masaru Kitsuregawa

On the Trustworthiness and Transparency of a Web Search Site Examined Using “Gender-equal” as a Search Keyword ...... 625 Naoko Oyama and Yoshifumi Masunaga

SemSearch: A Scalable Semantic Searching Algorithm for Unstructured P2P Network ...... 631 Wei Song, Ruixuan Li, Zhengding Lu, and Mudar Sarem

Web Image Annotation Based on Automatically Obtained Noisy Training Set ...... 637 Mei Wang, Xiangdong Zhou, and Hongtao Xu

An Effective Query Relaxation Solution for the Deep Web...... 649 Ye Ma, Derong Shen, Yue Kou, and Wei Liu

Workflow and Middleware

A Framework for Query Capabilities and Interface Design of Mediators on the Gulf of Mexico Data Sources ...... 660 Longzhuang Li, John Fernandez, and Hongyu Guo

Process Mediation Based on Triple Space Computing ...... 672 Zhangbing Zhou, Brahmananda Sapkota, Emilia Cimpian, Doug Foxvog, Laurentiu Vasiliu, Manfred Hauswirth, and Peng Yu

An Efficient Approach for Supporting Dynamic Evolutionary Change of Adaptive Workflow ...... 684 Daoye Zhang, Dahai Cao, Lijie Wen, and Jianmin Wang

Author Index ...... 697