The Semantic Grid: Enabling E-Science

Total Page:16

File Type:pdf, Size:1020Kb

The Semantic Grid: Enabling E-Science The Semantic Grid: Enabling e-Science David De Roure University of Southampton www.semanticgrid.org PDF 檔案以 "pdfFactory Pro" 試用版建立 www.pdffactory.com Outline 1. The ambition 2. Enabling Technologies n Grid n Semantic Web 3. Semantic Grid 4. Are we there yet? 5. The Future May 2006 ISGC 2 PDF 檔案以 "pdfFactory Pro" 試用版建立 www.pdffactory.com Outline 1. The ambition 2. Enabling Technologies n Grid n Semantic Web 3. Semantic Grid 4. Are we there yet? 5. The Future May 2006 ISGC 3 PDF 檔案以 "pdfFactory Pro" 試用版建立 www.pdffactory.com Vision: e-Science e-Science is about global collaboration in key areas of science and the next generation of [computing] infrastructure that will enable it. e-Science will change the dynamic of the way science is undertaken. John Taylor, Director General of UK Research Councils in 2001 May 2006 ISGC 4 PDF 檔案以 "pdfFactory Pro" 試用版建立 www.pdffactory.com Vision: Grid Grid computing has emerged as an important new field, distinguished from conventional distributed computing by its focus on large-scale resource sharing, innovative applications, and, in some cases, high- performance orientation...we [define] the "Grid problem”…as flexible, secure, coordinated resource sharing among dynamic collections of individuals, institutions, and resources -what we refer to as virtual organizations From "The Anatomy of the Grid: Enabling Scalable Virtual Organizations" by Foster, Kesselmanand Tuecke May 2006 ISGC 5 PDF 檔案以 "pdfFactory Pro" 試用版建立 www.pdffactory.com Requirements These visions require an infrastructure for flexible, coordinated resource sharing – they are fundamentally about joining resources up, automatically, in order to do things that weren’t possible before Instruments Computation Services Data People Devices May 2006 ISGC 6 PDF 檔案以 "pdfFactory Pro" 試用版建立 www.pdffactory.com Challenges: virtual orgs n Resource configurations are n Scale of data and compute transient, dynamic and resources is large volatile as services (databases, n Quality of Service and sensors, compute servers) performance criteria are switched in and out severe n They are ad-hoc as service n Platform must be scalable, consortia have no central able to evolve, fault-tolerant, location or control, and no robust, persistent and existing trust relationships reliable n They may be large, with n It should work seamlessly, hundreds of services and transparently – the user orchestrated at any time might not know or care n They may be long-lived, for where their calculation is example a protein folding done using how many simulation could take weeks machines, or where data is actually held May 2006 ISGC 7 PDF 檔案以 "pdfFactory Pro" 試用版建立 www.pdffactory.com Challenges: integration May 2006 ISGC 8 PDF 檔案以 "pdfFactory Pro" 試用版建立 www.pdffactory.com myGrid n Wish to reuse n Data n Services n Knowledge n SofCtowmbaerechem n Practice n Anticipated use n Unanticipated use May 2006 ISGC 9 PDF 檔案以 "pdfFactory Pro" 試用版建立 www.pdffactory.com Outline 1. The ambition 2. Enabling Technologies n Grid n Semantic Web 3. Semantic Grid 4. Are we there yet? 5. The Future May 2006 ISGC 10 PDF 檔案以 "pdfFactory Pro" 試用版建立 www.pdffactory.com n An automatically processable, n On demand transparently machine understandable web constructed multi- organisational federations n Distributed knowledge and of distributed services information management n Distributed computing n Information integration middleware n Computational Integration Carole Goble May 2006 ISGC 11 PDF 檔案以 "pdfFactory Pro" 試用版建立 www.pdffactory.com May 2006 ISGC 12 PDF 檔案以 "pdfFactory Pro" 試用版建立 www.pdffactory.com Origins of the Semantic Web The Semantic Web is an extension of the current Web in which information is given a well-defined meaning, better enabling computers and people to work in cooperation. It is the idea of having data on the Web defined and linked in a way that it can be used for more effective discovery, automation, integration and reuse across various applications. The Web can reach its full potential if it becomes a place where data can be processed by automated tools as well as people. W3C Activity Statement May 2006 ISGC 13 PDF 檔案以 "pdfFactory Pro" 試用版建立 www.pdffactory.com Layers of Languages You are here May 2006 ISGC 14 PDF 檔案以 "pdfFactory Pro" 試用版建立 www.pdffactory.com Web vsSemantic Web Web page Any Web Resource <a href= URI> HTML <a href=“http://…”> URI URI URI RDF is like the web! RDF Hendler May 2006 ISGC 15 PDF 檔案以 "pdfFactory Pro" 試用版建立 www.pdffactory.com Making Knowledge Explicit All influenced by RDF OWL Lite(thesaurus) OWL DL (reason-able) OWL Full (anything goes) RDF Resource Description OWL Web Ontology Framework Language May 2006 ISGC 16 PDF 檔案以 "pdfFactory Pro" 試用版建立 www.pdffactory.com Applications connected by concepts May 2006 ISGC 17 PDF 檔案以 "pdfFactory Pro" 試用版建立 www.pdffactory.com Outline 1. The ambition 2. Enabling Technologies n Grid n Semantic Web 3. Semantic Grid 4. Are we there yet? 5. The Future May 2006 ISGC 18 PDF 檔案以 "pdfFactory Pro" 試用版建立 www.pdffactory.com So what is the Semantic Grid? n The Grid vision is about coordinated resource sharing – virtual organisations n Fundamentally, this means that information and knowledge in and about the system must be ‘machine processable’ n The full richness of the Grid ambition depends upon realisingthe Semantic Grid May 2006 ISGC 19 PDF 檔案以 "pdfFactory Pro" 試用版建立 www.pdffactory.com The Semantic Grid Report 2001 n At this time, there are a number of grid applications being developed and there is a whole raft of computer technologies that provide fragments of the necessary functionality. n However there is currently a major gap between these endeavours and the vision of e-Science in which there is a high degree of easy-to-use and seamless automation and in which there are flexible collaborations and computations on a global scale. www.semanticgrid.org NB Report updated – March 2005 issue of Proceedings of the IEEE May 2006 ISGC 20 PDF 檔案以 "pdfFactory Pro" 試用版建立 www.pdffactory.com Building bridges May 2006 ISGC 21 PDF 檔案以 "pdfFactory Pro" 試用版建立 www.pdffactory.com Semantic Grid Semantic Semantic Web Grid Classical Classical Web Grid May 2006 ISGC 22 PDF 檔案以 "pdfFactory Pro" 試用版建立 www.pdffactory.com Semantics in and on the Grid The Semantic Grid is an extension of the current Grid in which information and services are given well-defined meaning, better enabling computers and people to work in cooperation May 2006 ISGC 23 PDF 檔案以 "pdfFactory Pro" 試用版建立 www.pdffactory.com Semantics in and on the Grid The Semantic Grid May 2006 ISGC 24 PDF 檔案以 "pdfFactory Pro" 試用版建立 www.pdffactory.com Grid is metadata based middleware 1. Portals and Workbenches Astronomy Sky Survey Data Grid 2.Knowledge & Resource 3. Metadata Data Catalog Bulk Data Management View View Analysis Analysis Concept space Standard APIs and Protocols 4.Grid Information Metadata Data Data Security 5. Discovery delivery Discovery Delivery Caching Replication Standard Metadata format, Data model, Wire format Backup Scheduling 6. Catalog Mediator Data mediator Catalog/Image Specific Access 7. Compute Resources Derived Collections Catalogs Data Archives May 2006 ISGC 25 PDF 檔案以 "pdfFactory Pro" 試用版建立 www.pdffactory.com For example… Annotations Personal notes results provenance May 2006 ISGC 26 PDF 檔案以 "pdfFactory Pro" 試用版建立 www.pdffactory.com Semantics in e-Science n RDF-based service and data registries n RDF-based metadata for experimental components n RDF-based provenance graphs n OWL based controlled vocabularies for database content n OWL based integration May 2006 ISGC 27 PDF 檔案以 "pdfFactory Pro" 試用版建立 www.pdffactory.com Engineering Design Reliability Security QoS GEODISE PORTAL Traceability OPTIMISATION APPLICATION SERVICE COMPUTATION PROVIDER Licenses and code May 2006 ISGC 28 PDF 檔案以 "pdfFactory Pro" 試用版建立 www.pdffactory.com Combe Chempilot project e e May 2006 ISGC 29 PDF 檔案以 "pdfFactory Pro" 試用版建立 www.pdffactory.com FloodNet subscribers users GPRS May 2006 ISGC 30 PDF 檔案以 "pdfFactory Pro" 試用版建立 www.pdffactory.com GGF9 Semantic Grid Workshop n The Role of Concepts in myGrid Carole Goble n Planning and Metadata on the Computational Grid Jim Blythe n Semantic support for Grid-Enabled Design Search in Engineering Simon Cox n Knowledge Discovery and Ontology-based services on the Grid Mario Cannataro n Attaching semantic annotations to service descriptions Luc Moreau n Semantic Matching of Grid Resource Description Frameworks John Brooke n Interoperability challenges in Grid for Industrial Applications Mike Surridge n Semantic Grid and Pervasive Computing David De Roure May 2006 ISGC 31 PDF 檔案以 "pdfFactory Pro" 試用版建立 www.pdffactory.com E-Science Special Issue n IEEE Intelligent Issue Special Issue on E-Science, Jan-Feb 2004 n De Roure, Gil, Hendler n Challenges: n Realisingthe network effect n Moving beyond centralisedstores n Automated assembly n Collaboration tools May 2006 ISGC 32 PDF 檔案以 "pdfFactory Pro" 試用版建立 www.pdffactory.com GGF11 Semantic Grid Workshop n Engineering semantics: Costs and n Using the Semantic Grid to Build Benefits Cox Bridges between Museums and Indigenous Communities Schroeter n Designing Ontologies and Distributed Resource Discovery n Collaborative Tools in the Semantic Grid De Roure Services for an Earthquake Simulation Grid Pierce n The Integration of Peer-to-peer and the Grid to Support Scientific n Exploring Williams-Beuren Collaboration Syndrome Using myGrid Goble n OWL-Based Resource Discovery n Distributed Data Management and for Inter-Cluster Resource Integration Framework: The Borrowing Yoshida MobiusProject Hastings n Semantic
Recommended publications
  • The Design and Realisation of the Myexperiment Virtual Research Environment for Social Sharing of Workflows
    The Design and Realisation of the myExperiment Virtual Research Environment for Social Sharing of Workflows David De Roure a Carole Goble b Robert Stevens b aUniversity of Southampton, UK bThe University of Manchester, UK Abstract In this paper we suggest that the full scientific potential of workflows will be achieved through mechanisms for sharing and collaboration, empowering scientists to spread their experimental protocols and to benefit from those of others. To facilitate this process we have designed and built the myExperiment Virtual Research Environ- ment for collaboration and sharing of workflows and experiments. In contrast to systems which simply make workflows available, myExperiment provides mecha- nisms to support the sharing of workflows within and across multiple communities. It achieves this by adopting a social web approach which is tailored to the par- ticular needs of the scientist. We present the motivation, design and realisation of myExperiment. Key words: Scientific Workflow, Workflow Management, Virtual Research Environment, Collaborative Computing, Taverna Workflow Workbench 1 Introduction Scientific workflows are attracting considerable attention in the community. Increasingly they support scientists in advancing research through in silico ex- perimentation, while the workflow systems themselves are the subject of ongo- ing research and development (1). The National Science Foundation Workshop on the Challenges of Scientific Workflows identified the potential for scientific advance as workflow systems address more sophisticated requirements and as workflows are created through collaborative design processes involving many scientists across disciplines (2). Rather than looking at the application or ma- chinery of workflow systems, it is this dimension of collaboration and sharing that is the focus of this paper.
    [Show full text]
  • Semantic Web: a Review of the Field Pascal Hitzler [email protected] Kansas State University Manhattan, Kansas, USA
    Semantic Web: A Review Of The Field Pascal Hitzler [email protected] Kansas State University Manhattan, Kansas, USA ABSTRACT which would probably produce a rather different narrative of the We review two decades of Semantic Web research and applica- history and the current state of the art of the field. I therefore do tions, discuss relationships to some other disciplines, and current not strive to achieve the impossible task of presenting something challenges in the field. close to a consensus – such a thing seems still elusive. However I do point out here, and sometimes within the narrative, that there CCS CONCEPTS are a good number of alternative perspectives. The review is also necessarily very selective, because Semantic • Information systems → Graph-based database models; In- Web is a rich field of diverse research and applications, borrowing formation integration; Semantic web description languages; from many disciplines within or adjacent to computer science, Ontologies; • Computing methodologies → Description log- and a brief review like this one cannot possibly be exhaustive or ics; Ontology engineering. give due credit to all important individual contributions. I do hope KEYWORDS that I have captured what many would consider key areas of the Semantic Web field. For the reader interested in obtaining amore Semantic Web, ontology, knowledge graph, linked data detailed overview, I recommend perusing the major publication ACM Reference Format: outlets in the field: The Semantic Web journal,1 the Journal of Pascal Hitzler. 2020. Semantic Web: A Review Of The Field. In Proceedings Web Semantics,2 and the proceedings of the annual International of . ACM, New York, NY, USA, 7 pages.
    [Show full text]
  • Scientific Social Objects the Social Objects and Multidimensional Network of the Myexperiment Website
    Scientific Social Objects The Social Objects and Multidimensional Network of the myExperiment Website David De Roure Sean Bechhofer and Carole Goble David Newman Oxford e-Research Centre School of Computer Science Electronics and Computer Science University of Oxford The University of Manchester University of Southampton Oxford, UK Manchester, UK Southampton, UK [email protected] [email protected] [email protected] Abstract— Scientific research is increasingly conducted digitally content, and with nearly 2000 workflows it provides the largest and online, and consequently we are seeing the emergence of new collection available. It is, however, characteristically a digital objects shared as part of the conduct and discourse of ‘boutique’ site with a specialist audience. science. These Scientific Social Objects are more than lumps of domain-specific data: they may comprise multiple components We propose that workflows, and myExperiment as a which can also be shared separately and independently, and some resource, provide a useful case study in scientific social contain descriptions of scientific processes from which new objects. Workflows are indeed social objects that are shared objects will be generated. Using the myExperiment social website and used by researchers, but significantly they are composite as a case study we explore Scientific Social Objects and discuss objects containing heterogeneous components which can be their evolution. shared separately and independently. Since they capture process they are also prescriptions for the creation of other Keywords—Scientific Social Object; workflow; Research Object objects. This distinguishes them from an object type such as a photo or collection of photos. I.
    [Show full text]
  • Conceptual Open Hypermedia = Semantic Web?
    Conceptual Open Hypermedia = Semantic Web? Carole Goble, Sean Bechhofer Information Management Group University of Manchester, UK Leslie Carr, Wendy Hall, David De Roure Intelligence, Agents, Multimedia Group University of Southampton, UK Take Home Message • The Semantic Web is a Web – a collection of linked nodes. • People will use it as well as agents, and for them, navigation of links is a key mechanism for exploring the space. Question • Can we use semantic metadata to support the construction of our hypertexts… • …and if so, does it help? Technologies for the Semantic Web Three pieces needed for implementing the Semantic Web: • Ontologies • Hypertext Architecture • Web Framework Ontologies Hypertext Web Framework Architecture Ontologies • The Semantic Web relies on the provision of Semantics. • Representations and tools are thus required for the – delivery; – construction; Ontologies – maintenance; – management of ontologies Hypertext Web Framework Architecture Hypertext Architecture • The Semantic Web is a Web. • Thus we need an underlying architecture supporting the notion of nodes and links. Ontologies Hypertext Web Framework Architecture Web Framework • A delivery mechanism that: – conforms to existing standards; – can scale Ontologies Hypertext Web Framework Architecture Historical Combinations • Open Hypermedia Systems and Link Services. – DLS – XLink • Ontology Services for document metadata. Ontologies – DAML+OIL, RDFS – SHOE, On2Broker • Conceptual Hypermedia – Nanard Hypertext – Tudhope Web Framework Architecture • The Semantic Web? COHSE Philosophy • Metadata can provide a mechanism not only for the support of resource discovery, but also for the provision of source anchors. • Annotation allows both linking into and out of a resource. Resource Discovery Link Construction COHSE Prototype System • A software agent that generates and presents links on behalf of both an author and a reader.
    [Show full text]
  • James A. Hendler, Phd
    James A. Hendler, PhD Tetherless World Professor & Director, Rensselaer Institute for Data Exploration and Applications Rensselaer Polytechnic Institute [email protected] US citizen, Security Clearance (on request) 518-276-4401 phone Social Security # (on request) 518-276-4464 Fax http://www.cs.rpi.edu/~hendler Twitter: @jahendler Education May, 1986 Department of Computer Science, Brown University, Providence, Rhode Island. PhD -- Computer Science, Artificial Intelligence. Thesis title: Integrating Marker-Passing and Problem-Solving: A Spreading Activation Approach to Improved Choice in Planning. May, 1983 Department of Computer Science, Brown University, Providence, Rhode Island. ScM -- Computer Science, Artificial Intelligence. Thesis project: Implementation of a pseudo-parallel marker-passing system within a frame representation language. May, 1982 Psychology Department, Southern Methodist University, Dallas, Tx. MS -- Cognitive Psychology, Human Factors Engineering. Thesis title: The Effects of a System-Imposed Grammatical Restriction on Interactive Natural Language Dialog May, 1978 Department of Computer Science, Yale University, New Haven, Ct. BS -- Computer Science, Artificial Intelligence. Experience in Higher Education 2007–pres. Rensselaer Polytechnic University (RPI) 1/07 – pres, Tetherless World Constellation Chair (Endowed), joint positions in computer and cognitive science departments 9/10 – 7/12, Program Director, Information Technology and Web Science 7/12 – 9/13, Department Head, Computer Science 9/13 –pres, Director, Institute for Data Exploration and Applications Affiliate faculty, Experimental Media and Performing Arts Center (2009- ) Affiliate, Industrial and System Engineering (2015 - ) Affiliate faculty, Center for Materials, Devices, and Integrated Systems (2015- ) 1 1986-12/06 University of Maryland, College Park 8/99-12/06. Full Professor, 8/92-7/99. Associate Professor 1/86-8/92, Assistant Professor Director, Joint Institute for Knowledge Discovery, 1/04-12/06.
    [Show full text]
  • Lessons from Myexperiment: Two Insights Into Emerging E-Research Practice
    Lessons from myExperiment: Two insights into emerging e-Research practice Lessons from myExperiment: Two insights into emerging e-Research practice David De Roure Carole Goble University of Southampton University of Manchester Southampton, UK Manchester, UK [email protected] [email protected] Abstract. The design of the myExperiment social web site for scientists adopted the Web 2.0 design principles, investigating the question: does Web 2.0 work for scientists? Two years later the site has thousands of users and the largest public collection of scientific workflows of its kind. Here we reflect on two of the Web 2.0 design principles and the insights we have gained into Web 2.0 for researchers. They reveal new forms of sharable object and new ways of assembling data and services. 1. Introduction The myExperiment Virtual Research Environment [1] has successfully adopted a Web 2.0 approach in delivering a social web site where scientists can safely publish their scientific workflows and other artefacts, share them with groups and find those of others. While it shares many characteristics with other Web 2.0 sites, the myExperiment’s distinctive features to meet the needs of its research user base are support for credit, attributions and licensing, fine control over privacy, a federation model and the ability to execute workflows. The myExperiment design was consistent with the Web 2.0 design patterns described by O’Reilly [2], a process we reported at the outset of the project [3]. A prototype was launched in July 2007 to gather requirements, and the current www.myexperiment.org site went live after a further 5 months of development and has since evolved substantially in “perpetual beta”.
    [Show full text]
  • Conceptual Open Hypermedia = the Semantic Web?
    Conceptual Open Hypermedia = The Semantic Web? Carole Goble, Sean Bechhofer Les Carr, David De Roure, Wendy Hall Information Management Group Intelligence, Agents, Multimedia Department of Computer Science Department of Electronics and Computer Science, University of Manchester University of Southampton, Oxford Road Southampton SO17 1BJ, UK Manchester M13 9PL, UK +44 23 8059 3255 +44 161 275 2000 {carole, seanb}@cs.man.ac.uk {lac, dder, wh}@ecs.soton.ac.uk ABSTRACT 1. INTRODUCTION The Semantic Web is still a web, a collection of linked nodes. The Semantic Web—a web of data defined and linked in such Navigation of links is currently, and will remain for humans if a way that its meaning is explicitly interpretable by software not machines, a key mechanism for exploring the space. The processes rather than just being implicitly interpretable by Semantic Web is viewed by many as a knowledge base, a humans—is a vision held by many and articulated by Tim database or an indexed and searchable document collection; in Berners Lee [5]. The issues surrounding the implementation the work discussed here we view it as a hypertext. of the Semantic Web are the focus of much current research. Much of this work, however, focuses on particular issues such The aim of the COHSE project is to research into methods to as the implementation and representation of metadata, for improve significantly the quality, consistency and breadth of example the OIL and DAML web languages [12, 15] or the linking of Web documents at retrieval time (as readers browse activity of resource discovery, i.e.
    [Show full text]
  • GGF11 Semantic Grid Applications Workshop
    GGF11 Semantic Grid Applications Workshop Honolulu June 10, 2004 Editors: Danius Michaelides Luc Moreau Coordinators: David De Roure Simon Cox Geoffrey Fox Carole Goble Workshop organizers The workshop is being held by the GGF Semantic Grid Research Group (SEM-RG) in conjunction with the Applications and Testbeds Working Group (APPS-RG). The Workshop Co-Chairs are: David De Roure (Chair) Semantic Grid Research Group Geoffrey Fox Grid Computing Environments and Semantic Grid Research Groups Carole Goble Semantic Grid Research Group Simon J. Cox Applications Working Group Proceedings Luc Moreau (proceedings chair) University of Southampton, UK Danius Michaelides (submission chair) University of Southampton, UK Program Committee Mark Baker University of Portsmouth, UK Jim Blythe Information Sciences Institute, USC William Johnston Lawrence Berkeley National Laboratory Kerstin Kleese CLRC Daresbury Laboratory, UK Libby Miller ILRT, University of Bristol, UK Jim Myers Pacific Northwest National Laboratory Marlon Pierce Indiana University i Table of Contents Foreword to the GGF11 Semantic Grid Applications Workshop. iii David De Roure Designing Ontologies and Distributed Resource Discovery Services for an Earthquake Simulation Grid. 1 Mehmet S. Aktas, Marlon Pierce, Geoffrey C. Fox Collaborative Tools in the Semantic Grid. 7 Michelle Bachler, Jessica Chen-Burger, David De Roure, Simon Buck- ingham Shum, Jeff Dalton, Danius Michaelides, Marc Eisenstadt, Kevin Page, Stephen Potter, Jiri Komzak, Austin Tate, Nigel Shad- bolt Distributed Data Management and Integration Framework: The Mo- bius Project. 20 Shannon Hastings, Stephen Langella, Scott Oster, Joel Saltz Interoperability and Transformability through Semantic Annotation of a Job Description Language. .39 Jeffrey Hau, William Lee, Steven Newhouse Using the Semantic Grid to Buildp Bridges between Museums and Indigenous Communities.
    [Show full text]
  • Understanding and Combating Decay in Taverna Workflows
    The University of Manchester Research Why workflows break — understanding and combating decay in Taverna workflows DOI: 10.1109/eScience.2012.6404482 Document Version Accepted author manuscript Link to publication record in Manchester Research Explorer Citation for published version (APA): Zhao, J., Gomez-perez, J. M., Belhajjame, K., Klyne, G., Garcia-cuesta, E., Garrido, A., Hettne, K., Roos, M., De Roure, D., & Goble, C. (2012). Why workflows break — understanding and combating decay in Taverna workflows. 1-9. Paper presented at 2012 IEEE 8th International Conference on E-Science (e-Science), . https://doi.org/10.1109/eScience.2012.6404482 Citing this paper Please note that where the full-text provided on Manchester Research Explorer is the Author Accepted Manuscript or Proof version this may differ from the final Published version. If citing, it is advised that you check and use the publisher's definitive version. General rights Copyright and moral rights for the publications made accessible in the Research Explorer are retained by the authors and/or other copyright owners and it is a condition of accessing publications that users recognise and abide by the legal requirements associated with these rights. Takedown policy If you believe that this document breaches copyright please refer to the University of Manchester’s Takedown Procedures [http://man.ac.uk/04Y6Bo] or contact [email protected] providing relevant details, so we can investigate your claim. Download date:28. Sep. 2021 Why Workflows Break - Understanding
    [Show full text]
  • How Semantic Web Knowledge Technologies Might Improve Natural Language Querying Systems
    UNIVERSITY OF SOUTHAMPTON Faculty of Engineering, Science and Mathematics School of Electronics and Computer Science A progress report submitted for continuation towards a PhD Supervisor: Prof. David De Roure Examiner: TBC How Semantic Web Knowledge Technologies Might Improve Natural Language Querying Systems by David R. Newman July 19, 2006 UNIVERSITY OF SOUTHAMPTON ABSTRACT FACULTY OF ENGINEERING, SCIENCE AND MATHEMATICS SCHOOL OF ELECTRONICS AND COMPUTER SCIENCE A progress report submitted for continuation towards a PhD by David R. Newman Natural Language (NL) querying has an extensive history, which this report reviews to determine what constitutes a NL querying system. It then examines Semantic Web (SW) knowledge technologies and considers how these may be used to improve an NL querying system. This report looks at the CombeChem project for an example of how these knowledge technologies can be used. Then it describes the partial implementation of a NL querying system using SW knowledge technologies. The report concludes by considering some applications that may benefit from this type of of NL querying system and how, through the use of ontologies, some of the inherent flaws in existing NL querying systems can be resolved. The report finishes my laying out a plan for future work. Contents 1 Introduction 1 2 Natural Language (NL) Querying 3 2.1 The Origins of NL Querying . 3 2.2 NL Querying Pros and Cons . 4 2.3 The Data Flow of a NL Querying System . 5 2.3.1 Natural Language Parsers (NLPs) . 5 2.3.2 NLP Output Evaluation . 8 2.3.3 Terminology Standardisation . 9 2.3.4 Machine-Readable (M-R) Query Generation .
    [Show full text]
  • Workflow-Centric Research Objects: First Class Citizens in Scholarly
    Workflow-Centric Research Objects: First Class Citizens in Scholarly Discourse Khalid Belhajjame1, Oscar Corcho2, Daniel Garijo2, Jun Zhao4, Paolo Missier9, David Newman5, Ra´ulPalma6, Sean Bechhofer1, Esteban Garc´ıa Cuesta3, Jos´eManuel G´omez-P´erez3, Graham Klyne4, Kevin Page4, Marco Roos7, Jos´eEnrique Ruiz8, Stian Soiland-Reyes1, Lourdes Verdes-Montenegro8, David De Roure4, Carole A. Goble1 1 University of Manchester, UK. 2 Ontology Engineering Group, Universidad Polit´ecnicade Madrid, Spain. 3 iSOCO, Spain. 4 University of Oxford, UK. 6 Poznan Supercomputing and Networking Center, Poland. 7 Leiden University Medical Centre, Netherlands. 8 Instituto de Astrof´ısicade Andaluc´ıa,CSIC, Spain. 9 Newcastle University, UK. [email protected] Abstract. A workflow-centric research object bundles a workflow, the provenance of the results obtained by its enactment, other digital objects that are relevant for the experiment (papers, datasets, etc.), and anno- tations that semantically describe all these objects. In this paper, we propose a model to specify workflow-centric research objects, and show how the model can be grounded using semantic technologies and exist- ing vocabularies, in particular the Object Reuse and Exchange (ORE) model and the Annotation Ontology (AO). We describe the life-cycle of a research object, which resembles the life-cycle of a scientific experiment. 1 Introduction Scientific workflows are used to describe series of structured activities and com- putations that arise in scientific problem-solving, providing scientists from vir- tually any discipline with a means to specify and enact their experiments [3]. From a computational perspective, such experiments (workflows) can be defined as directed acyclic graphs where the nodes correspond to analysis operations, which can be supplied locally or by third party web services, and where the edges specify the flow of data between those operations.
    [Show full text]
  • Dagrep V008 I006 P173 18262.Pdf (9
    Report from Dagstuhl Perspectives Workshop 18262 10 Years of Web Science: Closing The Loop Edited by Susan Halford1, James A. Hendler2, Eirini Ntoutsi3, and Steffen Staab4 1 University of Southampton, GB, [email protected] 2 Rensselaer Polytechnic Institute, Troy, US, [email protected] 3 Leibniz Universität Hannover, DE, [email protected] 4 Universität Koblenz-Landau, DE, [email protected] Abstract This report documents the program and the outcomes of Dagstuhl Seminar 18262 “10 years Web Science: Closing the Loop” that took place in Schloss Dagstuhl from 25-29 June 2018. In total, an interdisciplinary team of 22 researchers from computer science, sociology, philosophy and law gathered and discussed on the past, present and future of Web Science and what sort of actions the community should take to stay faithful to its initial mission for societal good. The role of Web Science is more critical than ever given the ever growing impact of the Web in our society. Seminar June 24–29, 2018 – http://www.dagstuhl.de/18262 2012 ACM Subject Classification Information systems → Collaborative and social computing systems and tools, Information systems → World Wide Web Keywords and phrases Dagstuhl Report, Web Science, Dagstuhl Perspectives Workshop Digital Object Identifier 10.4230/DagRep.8.6.173 Edited in cooperation with Fabien Gandon, Bettina Berendt, Katharina Kinder-Kurlanda, Pinelopi Troullinou 1 Executive Summary Eirini Ntoutsi License Creative Commons BY 3.0 Unported license © Eirini Ntoutsi This Dagstuhl Seminar aimed at bringing together researchers from different disciplines related to Web Science, namely computer science, sociology, philosophy and law to discuss on future of Web Science and how it can stay faithful to its initial mission for societal good.
    [Show full text]