Khalid Belhajjame

School of Phone: +44 7 72 53 69 128 Fax: +44 161 275 6204 Office 2.104, Kilburn Building [email protected] Oxford Road, M13 9PL, UK http://www.cs.man.ac.uk/˜khalidb

Current Position

• Since November 2004: Research associate, Information Management Group, University of Manchester, Manchester, UK.

Research Interests

• Data mapping/integration, knowledge management, semantic web services, semantic annotation, service oriented computing, scientific data provenance acquisition and ex- ploitation.

Education

• 2000 - 2004: Ph.D. in Computer Science, Department of Computer Science, University of Grenoble, France. - Dissertation: Defining and Orchestrating Open Services for Building Distributed Information Systems. - Supervisors: Prof. Christine Collet and Dr. Genoveva Vargas-Solar - External examiners: Pr. Mokrane Bouzeghoub, Pr. Claude Godart, and Pr. Mohand- Sad Hacid. - Scholarship: MENRT (by the French Ministry of Higher Education and Research).

• 1999 - 2000: M.Sc. in Computer Science, University of Grenoble, France - Dissertation: Active Services for Automating Business Processes. - Scholarship: AUF, Agence Universitaire de la Francophonie. • 1996 - 1999: Engineering Degree in Computer Science, Ecole Nationale Supérieure d’Informatique et d’Analyse des Systèmes, Rabat, Morocco.

Current and Recent Projects

• On-Demand Data Integration: Dataspaces by Refinement.

1 • QuASAR, Quality Assurance of Semantic Annotations of web seRvices. • FuGE, Functional Genomics Experiment. • iSPIDER, In Silico Proteome Integrated Data Environment Resource.

• myGrid, a UK e-Science pilot project.

Professional Experience

• 2005 - present: Teaching assistant, School of Computer Science, University of Manch- ester.

- Course: Software Engineering. - Lab Exercises: Advanced Systems (Querying Semi-Structured Data, Data Integration and Data Mining). • 2006 - 2007: Teaching assistant, Distance Learning, School of Computer Science, Uni- versity of Manchester.

- Course: Introduction to . • 2000 - 2004: Teaching assistant, Engineering School of Computer Science (ENSIMAG), University of Grenoble, France. - Courses: Principles of Database Management Systems, Advanced Databases, and Introduction to Workflow Management Systems. • 1999 - 2000: Research Engineer, University of Grenoble/Electricit de France (EDF). - Topic investigated: The use of active rules for managing sensor-based applications.

Grants

• On-Demand Data Integration: Dataspaces by Refinement, University of Manchester. (PI) and Khalid Belhajjame (Researcher Co-investigator). Funded by the EPSRC. From 01 July 2008 until 30 June 2011. Value £582,679.

Community and Standardisation Activities

• Member of the IEEE Services Computing Community. • Member of the W3C Semantic Web Health Care and Life Sciences Interest Group. • Member of .org.

• Member of the Web Service Modelling Ontology group.

2 • Member of Service Oriented Software Research Network (SOSoRNet). • Member of Workflow Systems in eScience.

Publications

Journals 1. Andrew T. Jones, Allyson L. Lister, Leandro Hermida, Peter Wilkinson, Martin Eise- nacher, Khalid Belhajjame, Frank Gibson, Phil Lord, Matthew Pocock, Heiko Rosenfelder, Javier Santoyo, Anil Wipat, and Norman W. Paton. Modelling and Managing Experimental Data Using FuGE. Omics: Journal of Integrative Biology, 13(3): 239-251, 2009. 2. Andrew Gibson, Matthew Gamble, Katy Wolstencroft, Tom Oinn, Carole A. Goble, Khalid Belhajjame, and Paolo Missier: The Data Playground: An Intuitive Workflow Specification Environment. Future Generation Comp. Syst., 25(4), 2009. 3. Khalid Belhajjame, Suzanne M. Embury, Norman W. Paton, Robert Stevens, and Carole A. Goble. Automatic Annotation of Web Services Based on Workflow Definitions. ACM Transactions on the Web, 2(2), 2008. 4. Khalid Belhajjame, Andrew R. Jones, and Norman W. Paton. A Toolkit for Capturing and Sharing FuGE Experiments. Bioinformatics, 24(22), 2008. 5. Jennifer A. Siepen, Khalid Belhajjame, Julian N. Selley, Suzanne M. Embury, Norman W. Paton, Carole A. Goble, Stephen G. Oliver, Robert Stevens, Lucas Zamboulis, Nigel Martin, Alexandra Poulovassillis, Philip Jones, Richard Ct, Henning Hermjakob, Melissa M. Pentony, David T. Jones, Christine A. Orengo, and Simon J. Hubbard. ISPIDER Central: An Integrated Database Web-Server for Proteomics. Nucleic Acids Research, volume 36, Web Server Issue, 1-6, 2008. 6. Khalid Belhajjame, Genoveva Vargas-Solar, and Christine Collet: Intégration de Ser- vices: Une Analyse Structurée. Ingénierie des Systèmes d’Information, 10(3): 91-110, 2005. (In French)

Book Chapters 7. Cornelia Hedeler, Khalid Belhajjame, Norman W. Paton, Alessandro Campi, Alvaro A.A. Fernandes, and Suzanne M. Embury. Dataspaces. A chapter in S. Ceri and M. Brambilla, editors, Search Computing: Challenges and Directions. Springer, 114-134, 2010. 8. Khalid Belhajjame, Paolo Missier, and Carole A. Goble. Characterising Data Provenance in Scientific Workflows. A chapter in Handbook of Research on Computational Grid Tech- nologies for Life Sciences, Biomedicine and Healthcare, 2009. 9. , Katy Wolstencroft, Antoon Goderis, Duncan Hull, Jun Zhao, Pinar Alper, Philip Lord, Chris Wroe, Khalid Belhajjame, Daniele Turi, Robert Stevens, Tom Oinn, and David D. Roure. Knowledge Discovery for Biology with Taverna: Producing and Consuming Semantics in the Web of Science. In C. J. Baker and K.-H. Cheung, editors, Revolutionizing Knowledge Discovery in the Life Sciences. Springer-Verlag, 355-396, 2007.

3 Conferences 10. Khalid Belhajjame, Norman W. Paton, Suzanne M. Embury, Alvaro A. A. Fernandes, and Cornelia Hedeler. Feedback-Based Annotation, Selection and Refinement of Schema Mappings for Dataspaces. Proceedings of the13th International Conference on Extending Database Technology (EDBT 2010), ACM, 2010. 11. Paolo Missier, Norman Paton, and Khalid Belhajjame. Fine-Grained and Efficient Lin- eage Querying of Collection-Based Workflow Provenance. Proceedings of the13th Interna- tional Conference on Extending Database Technology (EDBT 2010), ACM, 2010. 12. Cornelia Hedeler, Khalid Belhajjame, Alvaro A.A. Fernandes, Suzanne Embury and Norman W. Paton. Dimensions of Dataspaces. Proceedings of the 26th British National Conference on Databases, Springer, 55-66, 2009. 13. Lu Mao, Khalid Belhajjame, Norman W. Paton, and Alvaro A. A. Fernandes. Defining and Using Schematic Correspondences for Automatically Generating Schema Mappings. Proceedings of the 21st International Conference on Advanced Information Systems Engi- neering (CAiSE 2009), Springer, 79-93, 2009. 14. Khalid Belhajjame, Katy Wolstencroft, Oscar Corcho, Tom Oinn, Franck Tanoh, Alan William, and Carole A. Goble. Metadata Management in the Taverna Workflow System. Proceedings of the International Conference on Computational Science (4), 651-656, 2008. 15. Khalid Belhajjame, Addressing the Issue of Service Volatility in Scientific Workflows, Proceedings of the 5th International Conference of Service Oriented Computing, Vienna, Springer, 377-382, 2007. 16. Khalid Belhajjame. Semantic Replaceability of eScience Web Services. Proceedings of the IEEE International Conference on e-Science and Grid Computing, IEEE CS, 449-456, 2007. 17. Khalid Belhajjame, Suzanne M. Embury, Norman W. Paton, Robert Stevens, and Car- ole Goble, Automatic Annotation of Web Services Based on Workflow Definitions, Proceed- ings of the 5th International Semantic Web Conference, Springer, 116-129, 2006. 18. Khalid Belhajjame, Genoveva Vargas-Solar, and Christine Collet, Building Information Systems by Orchestrating Open Services, Proceedings of the Ninth International Database Engineering and Applications Symposium (IDEAS 2005), IEEE CS, 27-36,2005. 19. The MediaGrid project, Towards a Mediation System Framework for Transparent Ac- cess to Largely Distributed Sources, Proceedings of the 1st International Conference on Se- mantics of a Networked World (ICSNW’04), Springer, 65-78, 2004. 20. Genoveva Vargas-Solar, Khalid Belhajjame, Esau E. Castillo-Contrares, and Karla J. Peredo-Marquez, Terra-Acqua: Adaptable Definition and Execution of Workflows, Proceed- ings of the 4th International Conference on Web Information System Engineering (WISE’03), IEEE CS, 299-302, 2003. 21. Khalid Belhajjame, Genoveva Vargas-Solar, and Christine Collet, Defining and Coor- dinating Services Using Workflows, Proceedings of the Tenth International Conference on Cooperative Information Systems, (CoopIS’03). Springer, 110-128,2003.

4 22. Khalid Belhajjame, Genoveva Vargas-Solar, and Christine Collet, A Flexible Workflow Model for Process-Oriented Opplications, Proceedings of the 2nd International Conference on Web Information Systems Engineering, (WISE’01), IEEE CS, 72-80, 2001. 23. Khalid Belhajjame, Genoveva Vargas-Solar, and Christine Collet, Towards an Adapt- able Workflow Management System, Proceedings of the 17mes Journes Bases de Donnes Avances, BDA’2001, 2001. 24. Genoveva Vargas-Solar, Khalid Belhajjame, and Christine Collet, A Parametric Model for Adaptable Workflow Management, Proceedings of the 3rd Mexican International Confer- ence on Computer Science (ENC’01), 2001.

Workshops

25. Paolo Missier, Katy Wolstencroft, Franck Tanoh, Peter Li, Sean Bechhofer, Khalid Bel- hajjame and Carole Goble. Functional Units: Abstractions for Web Service Annotations. Proceedings of the Fourth International Workshop on Scientific Workflows (SWF 2010), IEEE CS, 2010.

26. Cornelia Hedeler, Khalid Belhajjame, Lu Mao, Norman W. Paton, Alvaro A. A. Fernan- des, Chenjuan Guo and Suzanne M. Embury. Flexible Dataspace Management Through Model Management, Proceedings of the 2010 EDBT/ICDT Workshops, ACM Conference Pro- ceeding Series; Vol. 426, 2010.

27. Khalid Belhajjame and Marco Brambilla. Ontology-Based Description and Discovery fo Business Processes. Proceedings of the 10th Workshop on Business Process Modeling, Development, and Support (BPMDS) at CAiSE 2009, Springer LNBIP, vol. 29, pp. 85-98, 2009. 28. Paolo Missier, Khalid Belhajjame, Jun Zhao, and Carole Goble, Data Lineage Model for Taverna Workflows with Lightweight Annotation Requirements. Proceedings of the Second International Provenance and Annotation Workshop (IPAW’08), Springer, 17-30, 2008. 29. Khalid Belhajjame, Suzanne M. Embury, Norman W. Paton. On Characterising and Identifying Mismatches in Scientific Workflows. Proceedings of the 3rd International Work- shop on Data Integration in the Life Sciences (DILS’06), Springer, 240-247, 2006.

30. Lucas Zamboulis, Hao Fan, Khalid Belhajjame, Jennifer A. Siepen, Andrew C. Jones, Nigel J. Martin, Alexandra Poulovassilis, Simon J. Hubbard, Suzanne M. Embury, and Norman W. Paton. Data Access and Integration in the ISPIDER Proteomics Grid. In 3rd International Workshop on Data Integration in the Life Sciences (DILS 06), Springer, 3-18, 2006.

31. Khalid Belhajjame, Genoveva Vargas-Solar, and Christine Collet, Integrating Services Using Workflows, Workshop organised by the ECI working group, Lyon, 1-18, 2004.

Invited Talks

32. Dipartimento di Elettronica e Informazione, Politecnico di Milano, On Inferring, Using and Verifying Semantic Web Services, Milan-Italy, November 2007.

5 33. Third Franco-Mexican Summer School on Distributed Systems, Service Oriented Com- puting: A New Paradigm for Building Large and Distributed Web-Based Applications, Puebla-Mexico, August 2004.

Professional Activities

Editorial Activities

• Co-chair of the First International Workshop on Semantic Metadata Management and Ap- plications (SeMMA) colocated with the Fifth European Semantic Web Conference ESWC2008, June 2, 2008 Teneriffe, Spain

Reviewer for Journals

• ACM Computing Surveys (2010). • ACM Transactions on the Web (2008).

• Data and Knowledge Engineering (2010). • Future Generation Computer Systems (2008 and 2009). • IEEE Internet computing (2008). • IEEE Transactions on Automation Science and Engineering (2008 and 2009).

• Information Systems (2008). • Journal of Systems and Software (2009 and 2010). • Journal of Web Semantics (2008).

Program Committee Membership

• Data and Services Management in Mobile Environments (DSMME 2008), colocated with ICDE 2008. • International Conference on Web Engineering (ICWE 2010).

• Mexican International Conference on Computer Science (ENC 2008).

External Reviewing

• ACM SIGMOD International Conference (SIGMOD 2004). • British National Conference on Databases (BNCOD 2005).

• First International Workshop on REsource Discovery (RED 2008). • French Database Conference (BDA 2001/2006)

6 • International Conference on Extending Database Technology (EDBT 2002/04/08/09/10). • International Conference on Web Information Systems Engineering (WISE 2001/02). • International Meeting High Performance Computing for Computational Science (2006). • International Provenance and Annotation Workshop (IPAW 2008). • International Semantic Web Conference (ISWC 2008). • International Workshop on Scientific Workflows and Business Workflow Standards in e-Science (SWBES 2008). • International Workshop on Semantic Matchmaking and Resource Retrieval (SMR 2006). • International World Wide Web Conference (WWW 2009). • Scientific and Statistical Database Management Conference (SSDBM 2010). • Very Large Data Bases Conference (VLDB 2005/2010).

Softwares The results of my research formed the basis of several tools. Below is a list of the most popular ones. The symbol ? indicates that I was the sole designer and developer of the tool in question. • QuASAR?, a tool for verifying semantic annotations of web services. • FuGE?, a toolkit for capturing and sharing proteomics experiments. • Taverna, a workflow system for specifying and enacting scientific workflows. • Biocatalogue, a curated catalogue for bioinformatics web services.

Students Co-Supervision

PhD Students • 2007 - Present: Lu Mao, Query Evaluation on Schematic Correspondences in the Context of Dataspaces. Supervised together with Norman W. Paton and Alvaro A. A. Fernandes. • 2007 - Present: Chenjuan Guo, Matching Schemas for Dataspaces. Supervised together with Norman W. Paton and Alvaro A. A. Fernandes. • 2008 - Present: Ravinder Singh, A Framework for Web Service Annotations. Supervised together with Suzanne M. Embury and Sean Bechhofer.

Masters Students • 2007 - 2008: Ahmad Golzar, Using Source Code Analysis Techniques to Assist the Se- mantic Annotation of Web Services. Supervised together with Suzanne M. Embury.

7 • 2006 - 2007: Sihao Chen, Automatic Generation of Web Service Interfaces Based on Semantic Annotations of Web Services. Supervised together with Suzanne M. Embury. • 2005 - 2006: Sinan Huang, On Generating Web Interfaces for Software Programs. Su- pervised together with Suzanne M. Embury.

• 2004 - 2005: Pedro Lozada, Developing a Portal for Web Services. Supervised together with Genoveva Vargas-Solar. • 2003 - 2004: Esa E. Castillo Contreras, Developing an Adabtable Workflow Enactment Engine. Supervised together with Geneoveva Varags-Solar.

• 2003 - 2004: Karla Joana Peredo Mrquez, A workbench for Designing Business Work- flows. Supervised together with Genoveva Vargas-Solar.

Undergraduate Students

• 2004: Aurélie Cambera, Building a Query Processor Based on Active Rules. Supervised together with Christine Collet.

8