An Entity Retrieval Model

Total Page:16

File Type:pdf, Size:1020Kb

An Entity Retrieval Model Searching Web Data: an Entity Retrieval Model Renaud Delbru Supervisor: Dr. Giovanni Tummarello Internal Examiner: Prof. Stefan Decker External Examiner: Dr. J´er^omeEuzenat External Examiner: Dr. Fabrizio Silvestri Dissertation submitted in pursuance of the degree of Doctor of Philosophy Digital Enterprise Research Institute Galway National University of Ireland, Galway / Ollscoil na hEireann,´ Gaillimh December 14, 2011 Abstract More and more (semi) structured information is becoming available on the Web in the form of documents embedding metadata (e.g., RDF, RDFa, Microformats and others). There are already hundreds of millions of such documents accessible and their number is growing rapidly. This calls for large scale systems providing effective means of searching and retrieving this semi-structured information with the ultimate goal of making it exploitable by humans and machines alike. This dissertation examines the shift from the traditional web doc- ument model to a web data object (entity) model and studies the challenges and issues faced in implementing a scalable and high per- formance system for searching semi-structured data objects on a large heterogeneous and decentralised infrastructure. Towards this goal, we define an entity retrieval model, develop novel methodologies for sup- porting this model, and design a web-scale retrieval system around this model. In particular, this dissertation focuses on the following four main aspects of the system: reasoning, ranking, indexing and querying. We introduce a distributed reasoning framework which is tolerant against low data quality. We present a link analysis approach for computing the popularity score of data objects among decentralised data sources. We propose an indexing methodology for semi-structured data which offers a good compromise between query expressiveness, query processing and index maintenance compared to other approaches. Finally, we develop an index compression technique which increase both the update and query throughput of the system. The resulting system can index billions of data objects and provides keyword-based as well as more advanced search interfaces for retrieving the most relevant data objects. This work has been part of the Sindice search engine project at the Digital Enterprise Research Institute (DERI), NUI Galway. The Sindice system currently maintains more than 100 million pages downloaded from the Web and is being used actively by many researchers within and outside of DERI. The reasoning, ranking, indexing and querying compo- nents of the Sindice search engine is a direct result of this dissertation research. iv Declaration I declare that this thesis is composed by myself, that the work contained herein is my own except where explicitly stated otherwise in the text, and that this work has not been submitted for any other degree or professional qualification except as specified. Renaud Delbru v Contents I. Prelude 1 1. Introduction 3 1.1. On Web Search and Retrieval . .5 1.2. Challenges in Information Retrieval for Web Data . .8 1.3. Scope of Research . .8 1.4. What the Thesis is not about . .9 1.5. Outline of the Thesis . 10 1.6. Contribution . 12 II. Foundations 13 2. An Entity Retrieval Model for Web Data 15 2.1. Management of Decentralised and Heterogeneous Information Sources . 15 2.2. Abstract Model . 20 2.3. Formal Model . 25 2.4. Search Model . 28 2.5. Conclusion . 36 3. Background 39 3.1. Reasoning over Web Data . 39 3.2. Link Analysis for Ranking Web Resources . 44 3.3. Indexing and Searching Semi-Structured Data . 46 3.4. Inverted Indexes and Inverted Lists . 50 vii Contents III.Methods 59 4. Context-Dependent Reasoning for Web Data 61 4.1. Introduction . 61 4.2. Context-Dependent Reasoning of RDF Models . 62 4.3. Context-Dependent Ontology Base . 67 4.4. Implementation . 71 4.5. Performance Evaluation . 72 4.6. Discussion and Future Works . 77 4.7. Conclusions . 78 5. Link Analysis over Web Data 79 5.1. Introduction . 79 5.2. A Two-Layer Model for Ranking Web Data . 80 5.3. The DING Model . 83 5.4. Scalability of the DING approach . 88 5.5. Experiments and Results . 89 5.6. Conclusion and Future Work . 96 6. Semi-Structured Indexing for Web Data 99 6.1. Introduction . 99 6.2. Node-Labelled Tree Model for RDF . 100 6.3. Query Model . 101 6.4. Implementing the Model . 104 6.5. Comparison among Entity Retrieval Systems . 111 6.6. Experiments . 115 6.7. Discussion . 121 6.8. Conclusion and Future Work . 121 7. Adaptive Frame Of Reference for Compressing Inverted Lists 123 7.1. Introduction . 123 7.2. Adaptive Frame of Reference . 124 7.3. Experiments . 130 7.4. Discussion and Future Work . 143 7.5. Conclusion . 144 viii Contents IV.System 145 8. The Sindice Search Engine: Putting the Pieces Together 147 8.1. Sindice Overview . 147 8.2. Data Acquisition . 148 8.3. Data Preprocessing and Denormalisation . 151 8.4. Distributed Entity Storage and Indexing . 154 8.5. Conclusions . 155 V. Conclusions 157 9. Conclusions 159 9.1. Summary of the Thesis . 159 9.2. Directions for Future Research . 161 VI.Appendices 163 A. Questionnaires of the User Study 165 B. Query Sets of the Index Evaluation 169 B.1. Real-World's Query Set . 169 B.2. Barton's Query Set . 170 B.3. 10 Billion's Query Set . 171 C. Results of the Compression Benchmark 173 Bibliography 181 List of figures 197 List of tables 201 ix Part I. Prelude Chapter 1. Introduction On the Web, a growing number of data management scenarios have to deal with hetero- geneous and inter-linked data sources. More and more structured and semi-structured data sources are becoming available. Hundreds of millions of documents embedding semi-structured data, e.g., HTML tables, Microformats, RDFa, are already published and publicly accessible. We observe a rapidly growing amount of online Semantic Web data repositories, e.g., the Linked Data community1. Also, we are expecting that millions of databases supporting simple web applications or more advanced Web 2.0 services, e.g., Wordpress or Facebook, will open their data to the Web. There is a growing demand for the exploitation of these data sources [AAB+09], and in particular there is the need for searching and retrieving information from these data sources. With the current availability of data publishing standards and tools, publishing semi-structured data on the Web, which from here on we will simply call Web Data, is becoming a mass activity. Indeed, nowadays it is not limited to a few trained specialists any more. Instead, it is open to industries, governments and individuals. Some prominent fields in which examples of semi-structured data publishing efforts exist are: e-government with the launch of data.gov by the US Federal Government, http: //data.gov.uk/ by the UK government or data.vancouver.ca by the City of Vancouver to publicly share open government data; editorial world in which we can cite the effort of the New York Times or Reuters to publish rich metadata about their news articles; 1Linked Data: http://linkeddata.org/ 3 Introduction e-commerce with for example BestBuy, which published product descriptions, store details and other information in a machine-readable format; web content management systems such as Drupal or Wordpress that provides means to expose their structured information as Web Data. social networking such as the Facebook's Open Graph Protocol which enables any web page to become a rich object in a social graph. e-science such as Bio2RDF2 which offers the integration of over 30 major biological databases. Also, the emergence of Web 2.0 communities in which users join ad-hoc communities to create, collaborate and curate domain-specific information has created many \closed data silos" that are not reusable outside the communities. Some of these communities, e.g., Last.fm or LiveJournal, are already publishing open data, and it is expected that other communities will follow the lead. The Web is becoming an universal interactive medium for exchanging data, being unstructured, semi-structured or structured. The result is a sea of data objects ranging from traditional web documents to records in online databases and covering various domains such as social information or business data. However, the usefulness of Web Data publishing is clearly dependent of the ease by which data can be discovered and consumed by others. Taking the e-commerce example, how can a user find products matching a certain description pattern over thousands of e-commerce data sources ? By entering simple keywords into a web search system, the results are likely to be irrelevant since the system will return pages mentioning the keywords and not the matching products themselves. Current search systems are inadequate for this task since they have been developed for a totally different model, i.e., a Web of Documents. The shift from documents to data objects poses new challenges for web search systems. One of the main challenge is to develop efficient and effective methods for searching and retrieving data objects from a world of distributed and heterogeneous data sources. Throughout this thesis, we introduce a novel methodology for searching semi-structured information coming from distributed data sources. This methodology applied directly and has been motivated by the Web Data scenario. We introduce a logical view of the distributed data sources and propose a boolean model to search over these data sources. The logical view is built around two important concepts, the concept of dataset and 2Bio2RDF: http://bio2rdf.org/ 4 Introduction entity. Based on this model, we propose appropriate methods to search data objects and improve the search results. All of the methods are designed especially for large scale scenarios. We finally show how to integrate and interconnect these novel techniques into a single system, a search engine for Web Data, which retains the same class of scalability than traditional web information retrieval systems.
Recommended publications
  • Privacy Protection for Smartphones: an Ontology-Based Firewall Johanne Vincent, Christine Porquet, Maroua Borsali, Harold Leboulanger
    Privacy Protection for Smartphones: An Ontology-Based Firewall Johanne Vincent, Christine Porquet, Maroua Borsali, Harold Leboulanger To cite this version: Johanne Vincent, Christine Porquet, Maroua Borsali, Harold Leboulanger. Privacy Protection for Smartphones: An Ontology-Based Firewall. 5th Workshop on Information Security Theory and Prac- tices (WISTP), Jun 2011, Heraklion, Crete, Greece. pp.371-380, 10.1007/978-3-642-21040-2_27. hal-00801738 HAL Id: hal-00801738 https://hal.archives-ouvertes.fr/hal-00801738 Submitted on 18 Mar 2013 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Distributed under a Creative Commons Attribution| 4.0 International License Privacy Protection for Smartphones: An Ontology-Based Firewall Johann Vincent, Christine Porquet, Maroua Borsali, and Harold Leboulanger GREYC Laboratory, ENSICAEN - CNRS University of Caen-Basse-Normandie, 14000 Caen, France {johann.vincent,christine.porquet}@greyc.ensicaen.fr, {maroua.borsali,harold.leboulanger}@ecole.ensicaen.fr Abstract. With the outbreak of applications for smartphones, attempts to collect personal data without their user’s consent are multiplying and the protection of users privacy has become a major issue. In this paper, an approach based on semantic web languages (OWL and SWRL) and tools (DL reasoners and ontology APIs) is described.
    [Show full text]
  • Luigne Breg and the Origins of the Uí Néill. Proceedings of the Royal Irish Academy: Archaeology, Culture, History, Literature, Vol.117C, Pp.65-99
    Gleeson P. (2017) Luigne Breg and the Origins of the Uí Néill. Proceedings of the Royal Irish Academy: Archaeology, Culture, History, Literature, vol.117C, pp.65-99. Copyright: This is the author’s accepted manuscript of an article that has been published in its final definitive form by the Royal Irish Academy, 2017. Link to article: http://www.jstor.org/stable/10.3318/priac.2017.117.04 Date deposited: 07/04/2017 Newcastle University ePrints - eprint.ncl.ac.uk Luigne Breg and the origins of the Uí Néill By Patrick Gleeson, School of History, Classics and Archaeology, Newcastle University Email: [email protected] Phone: (+44) 01912086490 Abstract: This paper explores the enigmatic kingdom of Luigne Breg, and through that prism the origins and nature of the Uí Néill. Its principle aim is to engage with recent revisionist accounts of the various dynasties within the Uí Néill; these necessitate a radical reappraisal of our understanding of their origins and genesis as a dynastic confederacy, as well as the geo-political landsape of the central midlands. Consequently, this paper argues that there is a pressing need to address such issues via more focused analyses of local kingdoms and political landscapes. Holistic understandings of polities like Luigne Breg are fundamental to framing new analyses of the genesis of the Uí Néill based upon interdisciplinary assessments of landscape, archaeology and documentary sources. In the latter part of the paper, an attempt is made to to initiate a wider discussion regarding the nature of kingdoms and collective identities in early medieval Ireland in relation to other other regions of northwestern Europe.
    [Show full text]
  • Camogie Association & GAA Information and Guidance Leaflet On
    Camogie Association & GAA Information and Guidance leaflet on the National Vetting Bureau (Children & Vulnerable Persons) Act 2012 March 2015 1 National Vetting Bureau (Children & Vulnerable Persons) Act The National Vetting Bureau (Children and Vulnerable Persons) Act 2012 is the vetting legislation passed by the Houses of the Oireachtas in December 2012. This legislation is part of a suite of complementary legislative proposals to strengthen child protection policies and practices in Ireland. Once the ‘Vetting Bureau Act’ commences the law on vetting becomes formal and obligatory and all organisations and their volunteers or staff who with children and vulnerable adults will be legally obliged to have their personnel vetted. Such personnel must be vetted prior to the commencement of their work with their Association or Sports body. It is important to note that prior to the Act commencing that the Associations’ policy stated that all persons who in a role of responsibility work on our behalf with children and vulnerable adults has to be vetted. This applies to those who work with underage players. (The term ‘underage’ applies to any player who is under 18 yrs of age, regardless of what team with which they play). The introduction of compulsory vetting, on an All-Ireland scale through legislation, merely formalises our previous policies and practices. 1 When will the Act commence or come into operation? The Act is effectively agreed in law but has to be ‘commenced’ by the Minister for Justice and Equality who decides with his Departmental colleagues when best to commence all or parts of the legislation at any given time.
    [Show full text]
  • Sports Capital Programme Payments in 2020 Sum of AMOUNT Carlow
    Sports Capital Programme Payments in 2020 Sum of AMOUNT Carlow Carlow Dragon Boat Club €17,877 Carlow Lawn Tennis Club €14,353 Carlow Town Hurling Club €14,332 Clonmore GFC €23,209 Fighting Cocks GFC €33,442 Fighting Cocks GFC €14,620 Kildavin Clonegal GAA Club €61,750 Leighlinbridge GFC €7,790 Republic of Ireland Ladies Snooker Ass €23,709 Slaney Rovers AFC €3,750 St Mullins GAA Club €7,000 Tullow and South Leinster Sports Club €9,430 Tullow Mountaineering Club €2,757 Tullow RFC €18,275 Tullow RFC €3,250 Cavan 3rd Cavan Virginia Scouting Group €7,754 Bailieborough Shamrocks €11,720 Ballyhaise Celtic Soccer Club €646 Ballymachugh GFC €10,481 Belturbet GAA Club €3,375 Belturbet Golf Club €23,824 Cavan Amatuer Boxing Club €1,188 Cavan Canoe Club €34,542 Cavan Co Co (Community Bowling Green) €11,624 Coiste Bhreifne Uí Raghaillaigh (Cavan G.A.A.) €7,500 Cornafean GFC €8,500 Crosserlough GFC €10,352 Cuman Gael an Chabhain (Cavan Gaels GAA) €17,500 Droim Dhuin Eire Og €20,485 Farnham National School €21,119 Kill Community Development €8,960 Killinkere GFC €2,777 Knockbride GAA €24,835 Knockbride Ladies GFC €1,942 Lavey GAA €48,785 Leisure & Sports Complex (Ballinagh) Trust €13,872 Leisure & Sports Complex (Ballinagh) Turst €57,000 Maghera Mac Finns GFC €2,792 Mullahoran GFC €10,259 Shercock GAA €6,650 Shercock Gaelic Football Club €2,183 Shercock GFC €7,125 Shercock Sports and Recreational Facilities €84,550 St Patrick's College €3,500 Virginia Golf Club €38,127 Sports Capital Programme Payments in 2020 Virginia Kayak Club €9,633 Cavan Castlerahan
    [Show full text]
  • Useful Applications – Last Updated 8 Th March 2014
    A List of Useful Applications – Last updated 8 th March 2014 In the descriptions of the software the text in black is my comments. Text in dark blue preceded by 'What they say :-' is a quote from the website providing the software. Rating :- This is my own biased and arbitrary opinion of the quality and usefulness of the software. The rating is out of 5. Unrated = - Poor = Average = Good = Very Good = Essential = Open Office http://www.openoffice.org/ Very Good = Word processor, Spreadsheet, Drawing Package, Presentation Package etc, etc. Free and open source complete office suite, equivalent to Microsoft Office. Since the takeover of this project by Oracle development seems to have ground to a halt with the departure of many of the developers. Libre Office http://www.libreoffice.org/ Essential = Word processor, Spreadsheet, Drawing Package, Presentation Package etc, etc. Free and open source complete office suite, equivalent to Microsoft Office. This package is essentially the same as Open Office however it satisfies the open source purists because it is under the control of an open source group rather than the Oracle Corporation. Since the takeover of the Open Office project by Oracle many of the developers left and a lot of them ended up on the Libre Office project. Development on the Libre Office project is now ahead of Open Office and so Libre Office would be my preferred office suite. AbiWord http://www.abisource.com/ Good = If you don't really need a full office suite but just want a simple word processor then AbiWord might be just what you are looking for.
    [Show full text]
  • Semantic Web and Services
    Where are we? Artificial Intelligence # Title 1 Introduction 2 Propositional Logic 3 Predicate Logic 4 Reasoning 5 Search Methods Semantic Web and 6 CommonKADS 7 Problem-Solving Methods 8 Planning Services 9 Software Agents 10 Rule Learning 11 Inductive Logic Programming 12 Formal Concept Analysis 13 Neural Networks 14 Semantic Web and Services © Copyright 2010 Dieter Fensel, Mick Kerrigan and Ioan Toma 1 2 Agenda • Semantic Web - Data • Motivation • Development of the Web • Internet • Web 1.0 • Web 2.0 • Limitations of the current Web • Technical Solution: URI, RDF, RDFS, OWL, SPARQL • Illustration by Larger Examples: KIM Browser Plugin, Disco Hyperdata Browser • Extensions: Linked Open Data • Semantic Web – Processes • Motivation • Technical Solution: Semantic Web Services, WSMO, WSML, SEE, WSMX SEMANTIC WEB - DATA • Illustration by Larger Examples: SWS Challenge, Virtual Travel Agency, WSMX at work • Extensions: Mobile Services, Intelligent Cars, Intelligent Electricity Meters • Summary • References 3 3 4 4 1 MOTIVATION DEVELOPMENT OF THE WEB 5 5 6 Development of the Web 1. Internet 2. Web 1.0 3. Web 2.0 INTERNET 7 8 2 Internet A brief summary of Internet evolution Age of eCommerce Mosaic Begins WWW • “The Internet is a global system of interconnected Internet Created 1995 Created 1993 Named 1989 computer networks that use the standard Internet and Goes Protocol Suite (TCP/IP) to serve billions of users TCP/IP TCP/IP Created 1984 ARPANET 1972 worldwide. It is a network of networks that consists of 1969 Hypertext millions of private
    [Show full text]
  • Irish Independent Death Notices Galway Rip
    Irish Independent Death Notices Galway Rip Trim Barde fusees unreflectingly or wenches causatively when Chris is happiest. Gun-shy Srinivas replaced: he ail his tog poetically and commandingly. Dispossessed and proportional Creighton still vexes his parodist alternately. In loving memory your Dad who passed peacefully at the Mater. Sorely missed by wife Jean and must circle. Burial will sometimes place in Drumcliffe Cemetery. Mayo, Andrew, Co. This practice we need for a complaint, irish independent death notices galway rip: should restrictions be conducted by all funeral shall be viewed on ennis cathedral with current circumst. Remember moving your prayers Billy Slattery, Aughnacloy X Templeogue! House and funeral strictly private outfit to current restrictions. Sheila, Co. Des Lyons, cousins, Ennis. Irish genealogy website directory. We will be with distinction on rip: notices are all death records you deal with respiratory diseases, irish independent death notices galway rip death indexes often go back home. Mass for Bridie Padian will. Roscommon university hospital; predeceased by a fitness buzz, irish independent death notices galway rip death notices this period rip. Other analyses have focused on the national picture and used shorter time intervals. Duplicates were removed systematically from this analysis. Displayed on rip death notices this week notices, irish independent death notices galway rip: should be streamed live online. Loughrea, Co. Mindful of stephenie, Co. Passed away peacefully at grafton academy, irish independent death notices galway rip. Cherished uncle of Paul, Co. Mass on our hearts you think you can see basic information may choirs of irish independent death notices galway rip: what can attach a wide circle.
    [Show full text]
  • Software Analysis
    visEUalisation Analysis of the Open Source Software. Explaining the pros and cons of each one. visEUalisation HOW TO DEVELOP INNOVATIVE DIGITAL EDUCATIONAL VIDEOS 2018-1-PL01-KA204-050821 1 Content: Introduction..................................................................................................................................3 1. Video scribing software ......................................................................................................... 4 2. Digital image processing...................................................................................................... 23 3. Scalable Vector Graphics Editor .......................................................................................... 28 4. Visual Mapping. ................................................................................................................... 32 5. Configurable tools without the need of knowledge or graphic design skills. ..................... 35 6. Graphic organisers: Groupings of concepts, Descriptive tables, Timelines, Spiders, Venn diagrams. ...................................................................................................................................... 38 7. Creating Effects ................................................................................................................... 43 8. Post-Processing ................................................................................................................... 45 9. Music&Sounds Creator and Editor .....................................................................................
    [Show full text]
  • Wiki Semantics Via Wiki Templating
    Chapter XXXIV Wiki Semantics via Wiki Templating Angelo Di Iorio Department of Computer Science, University of Bologna, Italy Fabio Vitali Department of Computer Science, University of Bologna, Italy Stefano Zacchiroli Universitè Paris Diderot, PPS, UMR 7126, Paris, France ABSTRACT A foreseeable incarnation of Web 3.0 could inherit machine understandability from the Semantic Web, and collaborative editing from Web 2.0 applications. We review the research and development trends which are getting today Web nearer to such an incarnation. We present semantic wikis, microformats, and the so-called “lowercase semantic web”: they are the main approaches at closing the technological gap between content authors and Semantic Web technologies. We discuss a too often neglected aspect of the associated technologies, namely how much they adhere to the wiki philosophy of open editing: is there an intrinsic incompatibility between semantic rich content and unconstrained editing? We argue that the answer to this question can be “no”, provided that a few yet relevant shortcomings of current Web technologies will be fixed soon. INTRODUCTION Web 3.0 can turn out to be many things, it is hard to state what will be the most relevant while still debating on what Web 2.0 [O’Reilly (2007)] has been. We postulate that a large slice of Web 3.0 will be about the synergies between Web 2.0 and the Semantic Web [Berners-Lee et al. (2001)], synergies that only very recently have begun to be discovered and exploited. We base our foresight on the observation that Web 2.0 and the Semantic Web are converging to a common point in their initially split evolution lines.
    [Show full text]
  • Hidden Meaning
    FEATURES Microdata and Microformats Kit Sen Chin, 123RF.com Chin, Sen Kit Working with microformats and microdata Hidden Meaning Programs aren’t as smart as humans when it comes to interpreting the meaning of web information. If you want to maximize your search rank, you might want to dress up your HTML documents with microformats and microdata. By Andreas Möller TML lets you mark up sections formats and microdata into your own source code for the website shown in of text as headings, body text, programs. Figure 1 – an HTML5 document with a hyperlinks, and other web page business card. The Heading text block is H elements. However, these defi- Microformats marked up with the element h1. The text nitions have nothing to do with the HTML was originally designed for hu- for the business card is surrounded by meaning of the data: Does the text refer mans to read, but with the explosive the div container element, and <br/> in- to a person, an organization, a product, growth of the web, programs such as troduces a line break. or an event? Microformats [1] and their search engines also process HTML data. It is easy for a human reader to see successor, microdata [2] make the mean- What do programs that read HTML data that the data shown in Figure 1 repre- ing a bit more clear by pointing to busi- typically find? Listing 1 shows the HTML sents a business card – even if the text is ness cards, product descriptions, offers, and event data in a machine-readable LISTING 1: HTML5 Document with Business Card way.
    [Show full text]
  • Petri Net-Based Graphical and Computational Modelling of Biological Systems
    bioRxiv preprint doi: https://doi.org/10.1101/047043; this version posted June 22, 2016. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. Petri Net-Based Graphical and Computational Modelling of Biological Systems Alessandra Livigni1, Laura O’Hara1,2, Marta E. Polak3,4, Tim Angus1, Lee B. Smith2 and Tom C. Freeman1+ 1The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush, Edinburgh, Midlothian EH25 9RG, UK. 2MRC Centre for Reproductive Health, 47 Little France Crescent, Edinburgh, EH16 4TJ, UK, 3Clinical and Experimental Sciences, Sir Henry Wellcome Laboratories, Faculty of Medicine, University of Southampton, SO16 6YD, Southampton, 4Institute for Life Sciences, University of Southampton, SO17 1BJ, UK. Abstract In silico modelling of biological pathways is a major endeavour of systems biology. Here we present a methodology for construction of pathway models from the literature and other sources using a biologist- friendly graphical modelling system. The pathway notation scheme, called mEPN, is based on the principles of the process diagrams and Petri nets, and facilitates both the graphical representation of complex systems as well as dynamic simulation of their activity. The protocol is divided into four sections: 1) assembly of the pathway in the yEd software package using the mEPN scheme, 2) conversion of the pathway into a computable format, 3) pathway visualisation and in silico simulation using the BioLayout Express3D software, 4) optimisation of model parameterisation. This method allows reconstruction of any metabolic, signalling and transcriptional pathway as a means of knowledge management, as well as supporting the systems level modelling of their dynamic activity.
    [Show full text]
  • Nomination of the Monastic City of Clonmacnoise and Its Cultural Landscape for Inclusion in the WORLD HERITAGE LIST
    DRAFT Nomination of The Monastic City of Clonmacnoise and its Cultural Landscape For inclusion in the WORLD HERITAGE LIST Clonmacnoise World Heritage Site Draft Nomination Form Contents EXECUTIVE SUMMARY .................................................................................................III 1. IDENTIFICATION OF THE PROPERTY ......................................................................1 1.a Country:..................................................................................................1 1.b State, Province or Region:......................................................................1 1.c Name of Property: ..................................................................................1 1.d Geographical co-ordinates to the nearest second ..................................1 1.e Maps and plans, showing the boundaries of the nominated property and buffer zone ........................................................................................................2 1.f Area of nominated property (ha.) and proposed buffer zone (ha.)..........3 2. DESCRIPTION..............................................................................................................4 2.a Description of Property ..........................................................................4 2.b History and development......................................................................31 3. JUSTIFICATION FOR INSCRIPTION ........................................................................38 3.a Criteria under which inscription
    [Show full text]