Multilingual Catalogue Strategies
Total Page:16
File Type:pdf, Size:1020Kb
CEN CWA 15992 WORKSHOP July 2009 AGREEMENT ICS 35.240.60 English version Harmonization of data interchange in tourism This CEN Workshop Agreement has been drafted and approved by a Workshop of representatives of interested parties, the constitution of which is indicated in the foreword of this Workshop Agreement. The formal process followed by the Workshop in the development of this Workshop Agreement has been endorsed by the National Members of CEN but neither the National Members of CEN nor the CEN Management Centre can be held accountable for the technical content of this CEN Workshop Agreement or possible conflicts with standards or legislation. This CEN Workshop Agreement can in no way be held as being an official standard developed by CEN and its Members. This CEN Workshop Agreement is publicly available as a reference document from the CEN Members National Standard Bodies. CEN members are the national standards bodies of Austria, Belgium, Bulgaria, Cyprus, Czech Republic, Denmark, Estonia, Finland, France, Germany, Greece, Hungary, Iceland, Ireland, Italy, Latvia, Lithuania, Luxembourg, Malta, Netherlands, Norway, Poland, Portugal, Romania, Slovakia, Slovenia, Spain, Sweden, Switzerland and United Kingdom. EUROPEAN COMMITTEE FOR STANDARDIZATION COMITÉ EUROPÉEN DE NORMALISATION EUROPÄISCHES KOMITEE FÜR NORMUNG Management Centre: Avenue Marnix 17, B-1000 Brussels © 2009 CEN All rights of exploitation in any form and by any means reserved worldwide for CEN national Members. Ref. No.:CWA 15992:2009 E CWA 15992:2009 (E) Contents Contents 2 Foreword 9 Executive summary 10 Problem statement 10 Approach 10 The five challenges 10 Semantics 10 Data transformation 11 Process handling 11 Metasearch 11 Object identification 11 Best practice case 11 Recommendations 12 Summary of recommendations 13 Overall recommendations 13 List of recommendations on different topics 14 Standards 14 Short-term recommendations 14 Long-term recommendations 14 Taxonomies 15 Short-term recommendations 15 Long-term recommendations 15 Ontologies 15 Short-term recommendations 15 Structured data mapping 15 Short-term recommendations 15 Long-term recommendations 16 Manual semantic annotation 16 Short-term recommendations 16 Long-term recommendations 16 Automatic information extraction 16 Short-term recommendations 16 Long-term recommendations 16 Inter-ontology mapping 17 Short-term recommendations 17 Long-term recommendations 17 Process handling 17 Short-term recommendations 17 Long-term recommendations 17 Metasearch methodology 17 2 CWA 15992:2009 (E) Short-term recommendations 17 Long-term recommendations 18 Querying 18 Short-term recommendations 18 Long-term recommendations 18 Object identification 18 Short-term recommendations 18 Long-term recommendations 18 1 Scope 19 2 Normative references 20 3 Abbreviations, terms and definitions 21 3.1 Abbreviations 21 3.2 Terms and definitions 22 4 Methodology and thematic overview 23 4.1 Thematic circle 23 4.2 Topics 25 4.2.1 Semantics 25 4.2.2 Data transformation 26 4.2.3 Process handling 27 4.2.4 Metasearch 27 4.2.5 Object identification 27 4.3 Cross-cutting concerns / Prerequisites 28 4.3.1 Legal aspects 28 4.3.2 Multiculturalism 29 4.3.3 Business models 30 4.3.4 Technology 31 5 Case study 32 5.1 The processes 33 5.1.1 The actors 33 5.1.2 Consumer process 33 5.1.3 Travel-related professional process 35 5.2 The information and communication technologies 36 5.2.1 Multiple levels of data sources 36 5.2.2 Type of information 38 5.2.3 Type of data sources 40 6 Semantics 42 6.1 Standards 42 6.1.1 Needs and requirements 42 Introduction 42 Needs 43 Requirements 44 6.1.2 State of the art 44 3 CWA 15992:2009 (E) Types of standards 46 List of travel industry standards, companies and organizations (examples) 46 6.1.3 Gaps and future needs 57 6.1.4 Recommendations 57 Short-term recommendations (1–3 years) 57 Long-term recommendations (3–10 years) 58 6.2 Taxonomies 58 6.2.1 Needs and requirements 58 Introduction 58 Needs 58 Requirements 59 6.2.2 State of the art 59 Examples of tourism taxonomies 60 6.2.3 Gaps and future needs 61 6.2.4 Recommendations 62 Short-term recommendations (1–3 years) 62 Long-term recommendations (3–10 years) 62 6.3 Ontologies 62 6.3.1 Needs and requirements 62 Introduction 62 Needs 63 6.3.2 State of the art 64 Definitions of the notion of ontology within the computer science domain 64 Main components of an ontology 65 Ontology development tools 65 Ontology development languages 66 Examples of standard ontologies 67 6.3.3 Gaps and future needs 70 6.3.4 Recommendations 71 Short-term recommendations (1–3 years) 71 Long-term recommendations (3–10 years) 71 7 Data transformation 72 7.1 Structured data mapping 72 7.1.1 Needs and requirements 72 Introduction 72 Needs 73 Requirements 74 7.1.2 State of the art 75 7.1.3 Gaps and future needs 76 7.1.4 Recommendations 77 Short-term recommendations (1–3 years) 77 Long-term recommendations (3–10 years) 77 4 CWA 15992:2009 (E) 7.2 Manual semantic annotation 77 7.2.1 Needs and requirements 78 7.2.2 State of the art 79 7.2.3 Gaps and future needs 80 7.2.4 Recommendations 80 Short-term recommendations (1–3 years) 80 Long-term recommendations (3–10 years) 80 7.3 Automatic information extraction 81 7.3.1 Needs and requirements 81 Needs 81 Requirements 81 7.3.2 State of the art 81 Named entity recognition 82 Event extraction 82 Tourism-specific information extraction 83 7.3.3 Gaps and future needs 84 Named entity recognition 84 Event extraction 84 Tourism-specific information extraction 84 7.3.4 Recommendations 84 Short-term recommendations (1–3 years) 84 Long-term recommendations (3–10 years) 84 7.4 Inter-ontology mapping 85 7.4.1 Needs and requirements 85 Introduction 85 Needs 85 Requirements 85 7.4.2 State of the art 86 7.4.3 Gaps and future needs 87 7.4.4 Recommendations 88 Short-term recommendations (1–3 years) 88 Long-term recommendations (3–10 years) 88 8 Process handling 89 8.1 Needs and requirements 89 8.1.1 Introduction 89 8.1.2 Needs 90 8.1.3 Requirements 92 8.2 State of the art 93 8.2.1 Global standardization efforts 93 8.2.2 Application Integration and APIs 94 8.3 Gaps and future needs 94 8.4 Recommendations 95 5 CWA 15992:2009 (E) 8.4.1 Short-term recommendations (1–3 years) 95 8.4.2 Long-term recommendations (3–10 years) 96 9 Metasearch 97 9.1 Methodology 97 9.1.1 Needs and requirements 97 Introduction 97 Quality of results 97 Response time 97 Access to data 98 Efforts for maintenance 98 9.1.2 State of the art 98 Web crawler 98 HTTP requests 98 Website wrapper 99 Application Programming Interfaces (API) 99 Web services 99 Semantic annotation 99 Caching mechanism 100 Summary 100 9.1.3 Gaps and future needs 100 9.1.4 Recommendations 101 Short-term recommendations (1–3 years) 101 Long-term recommendations (3–10 years) 101 9.2 Querying 102 9.2.1 Needs and requirements 102 Introduction 102 Needs and requirements 102 9.2.2 State of the art 103 Methods for query distribution 103 Query by example 104 Standardized query languages 104 Interface standardization 105 Metadata syndication 106 9.2.3 Gaps and future needs 107 Query by example 107 Standardized query languages / SPARQL 107 Interface standardization 107 Metadata syndication 108 9.2.4 Recommendations 108 Short-term recommendations (1–3 years) 108 Long-term recommendations (3–10 years) 108 9.3 Role of registries in eTourism 109 6 CWA 15992:2009 (E) 9.3.1 Needs and requirements 109 Introduction 109 Needs 109 Requirements 110 9.3.2 State of the art 110 UDDI and the ebXML Registry Specification 110 CEN/ISSS eGovernment Focus Group and CEN/ISSS WS eGov-Share 112 9.3.3 Gaps and future needs 114 Shortcomings of current registry standards 114 Future needs 115 9.3.4 Recommendations 116 Short-term recommendations (1–3 years) 116 Long-term recommendations (3–10 years) 116 10 Object identification 117 10.1 Needs and requirements 117 10.1.1 Introduction 117 10.1.2 Needs 117 10.1.3 Requirements 118 Location codes 118 Travel service codes 118 Travel service qualifier codes 119 Travel company codes 119 10.2 State of the art 119 10.2.1 IATA 119 10.2.2 ICAO 120 10.2.3 ISO 121 10.2.4 UN/LOCODE 121 10.2.5 HEDNA 122 10.2.6 ACRISS 122 10.2.7 GIATA 122 10.2.8 GS1 123 10.2.9 URI 123 10.2.10 UUID 123 10.3 Gaps and future needs 124 10.3.1 Location 124 Country codes 124 Region codes 124 City, airport and other point of travel codes 125 10.3.2 Currency and language codes 126 10.3.3 Travel service codes 126 10.3.4 Travel service qualifier codes 126 10.3.5 Travel company codes 126 7 CWA 15992:2009 (E) 10.4 Recommendations 127 10.4.1 Short-term recommendations (1–3 years) 127 10.4.2 Long-term recommendations (3–10 years) 127 11 Best practice case 128 11.1 The starting point 128 11.2 The existing case of euromuse.net 128 11.3 Future scenario for euromuse.net 129 11.4 Critical discussion 130 12 Bibliography and references 132 8 CWA 15992:2009 (E) Foreword The objective of the Workshop CEN/ISSS WS/eTOUR on “Harmonization of data interchange in tourism” and the production of this draft CEN Workshop Agreement (CWA) was approved by the Workshop at its plenary meeting held in Brussels on 6 February 2008.