Downloaded Items and the Respective Items and Deletes Stored Form Information

Privacy and Identity Management in Europe for Life

First Report on Standardisation and Interoperability

Overview and Analysis of Open Source Initiatives

Combined deliverable: Merger of D 3.3.1 and D 3.4.1.

Editors: Carine Bournez (W3C/ERCIM) Patrik Bichsel (IBM) Reviewers: Maren Raguse (ULD) Immanuel Scholz (TUD) Identifier: D3.3.1 / D3.4.1 Type: Deliverable Class: Public Date: 30 May 2008

Copyright © 2008 by the PrimeLife Consortiun The research leading to these results has received funding from the European Community's Seventh Framework Programme (FP7/2007-2013) under grant agreement no 216483. Members of the PrimeLife Consortium

1. IBM Research GmbH IBM Switzerland

2. Unabhängiges Landeszentrum für Datenschutz ULD Germany

3. Technische Universität Dresden TUD Germany

4. Karlstads Universitet KAU Sweden

5. Università degli Studi di Milano UNIMI Italy

6. Johann Wolfgang Goethe - Universität Frankfurt am Main GUF Germany

7. Stichting Katholieke Universiteit Brabant TILT Netherlands

8. GEIE ERCIM W3C France

9. Katholieke Universiteit Leuven K.U.Leuven Belgium

10. Università degli Studi di Bergamo UNIBG Italy

11. Giesecke & Devrient GmbH GD Germany

12. Center for Usability Research & Engineering CURE Austria

13. Europäisches Microsoft Innovations Center GmbH EMIC Germany

14. SAP AG SAP Germany

15. Brown University UBR USA

Disclaimer: The information in this document is provided "as is", and no guarantee or warranty is given that the information is fit for any particular purpose. The below referenced consortium members shall have no liability for damages of any kind including without limitation direct, special, indirect, or consequential damages that may result from the use of these materials subject to any liability which is mandatory due to applicable law. Copyright 2008 by IBM Research GmbH, Technische Universität Dresden, Università degli Studi di Milano, Johann Wolfgang Goethe - Universität Frankfurt am Main, GEIE ERCIM, Katholieke Universiteit Leuven, Giesecke & Devrient GmbH, Europäisches Microsoft Innovations Center GmbH, SAP AG.

2 Abstract

This document is a merged deliverable, covering items D3.3.1 (Overview and Analysis of Open Source Initiatives) and D3.4.1 (First Report on Standardisation and Interoperability). PrimeLife values open source and open standards and plans to share its results as open source software, public educational material and contributions to open standardisation bodies.

The first chapter will introduce the scope of this report and its importance in the PrimeLife project. The second chapter essentially describes the target technology platforms, the Web and then more specifically SOAs, and associated technologies.

The next chapters report the state of standards and open technologies, organised in 5 classes: standard identity management and privacy frameworks, policy and rules languages, authentication, user control, identity systems. When applicable, references to open source implementations are mentioned. A brief description of the organisations which coordinate standardisation of the previously described specifications follows. The last part of the document reviews in detail a number of selected open source projects which relate to the scope of the PrimeLife project.

The conclusion summarises results from the PRIME project and expectations for the PrimeLife work, so as to contribute to the definition of the project's next steps.

3 List of Contributors

Contributions from several PrimeLife partners are contained in this document. The following list presents the contributors for the chapters of this deliverable.

Chapter Author(s) Abstract Carine Bournez (W3C) Introduction Patrik Bichsel (IBM), Jan Camenisch (IBM) Target Technology Platforms Carine Bournez (W3C), Ulrich Pinsdorf (EMIC), Thomas Roessler (W3C), Rigo Wenning (W3C) Architectures and Frameworks Kai Rannenberg (GUF) Policy and Rule Languages Ulrich Pinsdorf (EMIC), Thomas Roessler (W3C), Pierangela Samarati (UNIMI), Mario Verdicchio (IBM), Rigo Wenning (W3C) Authentication Infrastructure Jan Camenisch (IBM), Markulf Kohlweiss (KU Leuven), Thomas Roessler (W3C), Stuart Short (SAP), Karel Wouters (KU Leuven) User Control Infrastructure Eduard de Jong (GD), Robert Mueller (GD) Identity Systems Michele Bezzi (SAP), Patrik Bichsel (IBM), Jan Camenisch (IBM), Ulrich Pinsdorf (EMIC), Thomas Roessler (W3C) Specification Developing Michele Bezzi (SAP), Carine Bournez (W3C), Eduard de Jong Organisations (GD), Robert Mueller (GD), Ulrich Pinsdorf (EMIC), Kai Rannenberg (GUF), Thomas Roessler (W3C), Rigo Wenning (W3C) Selected Open Source Projects Patrik Bichsel (IBM), Carine Bournez (W3C), Thomas Roessler (W3C), Immanuel Scholz (TUD), Rigo Wenning (W3C) Conclusion Claudio d'Ardagna (UNIMI), Jan Camenisch (IBM), Benjamin Kellermann (TUD), Immanuel Scholz (TUD), Rigo Wenning (W3C)

This deliverable was rendered from HTML pages using Prince XML from YesLogic Pty Ltd. YesLogic has donated a license of Prince XML to W3C.

4 Table Of Contents

1 Introduction 9 1.1 Scope of this Document ...... 9 1.2 Historic Background ...... 9 1.3 PrimeLife Aims and Activities ...... 10 1.4 PrimeLife Project Overview ...... 10 1.5 PrimeLife's Approach to Privacy ...... 11 2 Target Technology Platforms 13 2.1 The Web ...... 13 2.1.1 Architecture ...... 13 2.1.2 Standards ...... 14 2.1.3 Evolving Web Application Development Paradigms - Web 2.0 ...... 16 2.1.4 Semantic Web ...... 17 2.1.5 Privacy technologies for the Web ...... 17 2.2 Service Oriented Architectures ...... 18 2.2.1 OASIS WS-Security ...... 19 2.2.2 OASIS WS-SecureConversation ...... 20 2.2.3 OASIS WS-Trust ...... 20 2.2.4 W3C WS-Policy...... 21 2.2.5 OASIS WS-SecurityPolicy ...... 22 2.2.6 Primelife Perspective ...... 22 3 Architectures and Frameworks 23 3.1 Identity Management ...... 23 3.1.1 IdM Framework (24760) ...... 23 3.1.2 A Framework for Access Management (29146) ...... 24 3.1.3 Entity authentication assurance (29115) ...... 25 3.2 Privacy ...... 25 3.2.1 A Privacy Framework (29100) ...... 25 3.2.2 A Privacy Reference Architecture (29101) ...... 26 4 Policy and rule languages 27 4.1 Extensible Access Control Markup Language (XACML) ...... 27 4.1.1 An XACML scenario...... 27 4.1.2 An example of XACML policy ...... 28 4.1.3 Relations to other proposals and to the PrimeLife project...... 30 4.1.4 Current status of the XACML proposal...... 30 4.2 The Rule Interchange Format (RIF) ...... 30 4.2.1 RIF Dialects ...... 31 4.2.2 Use Cases...... 31 4.3 P3P ...... 32 4.3.1 Status ...... 33 4.3.2 Conclusion ...... 33 4.4 APPEL ...... 34 4.4.1 Shortcomings ...... 34 4.4.2 Conclusion ...... 34 4.5 Enterprise Privacy Authorisation Language (EPAL) ...... 35 4.5.1 Structure of an EPAL policy ...... 35 4.5.2 An EPAL policy example ...... 35 4.5.3 An EPAL query example...... 36 4.5.4 A typical EPAL scenario ...... 37 4.5.5 Relations to other standards and to the PrimeLife project...... 37

5 4.5.6 Status of the EPAL proposal ...... 37 4.6 CARML ...... 37 4.6.1 Elements of the CARML language ...... 38 4.6.2 Status of the CARML proposal ...... 39 4.6.3 AAPML ...... 39 4.7 Identity Governance Framework ...... 39 4.8 PRIME Policy Languages...... 40 4.8.1 Rough use cases ...... 40 4.8.2 Distinctive features of PRIME languages ...... 41 4.8.3 Relation to standards ...... 42 4.9 WS-Policy...... 43 4.9.1 Structure of a policy...... 43 4.9.2 Normal form of a policy ...... 44 4.9.3 Compact form of a policy ...... 44 4.9.4 Nested policies...... 45 4.9.5 References to other policies...... 45 4.9.6 Intersection of policies...... 46 4.9.7 Relations to other proposals and to the PrimeLife project...... 48 4.9.8 Status of the WS-Policy proposal ...... 48 4.10 OASIS WS-XACML ...... 48 4.11 Security Policy Assertion Language (SecPAL) ...... 49 5 Authentication Infrastructure 50 5.1 The ITU-T X.509 Standard...... 50 5.1.1 X.509 Certificate and Certification Process...... 51 5.1.2 Evolution of the X509 standard ...... 52 5.2 PKIX ...... 53 5.2.1 X.509 Attribute Certificate and Privilege Management Infrastructure ...... 53 5.2.2 Relationship to PrimeLife ...... 54 5.3 XML Signature ...... 55 5.3.1 Specification Overview ...... 55 5.3.2 Status ...... 56 5.3.3 PrimeLife Impact ...... 56 6 User Control Infrastructure 58 6.1 Smart Cards: User-controlled Token for Privacy Protection ...... 58 6.1.1 Introduction ...... 58 6.1.2 Standardisation...... 60 6.1.3 Architectures ...... 60 6.1.4 Strategy and Actions ...... 61 6.2 Biometrics Standardisation and Privacy ...... 61 6.2.1 Biometrics Standardisation ...... 62 6.2.2 Architectures ...... 62 6.2.3 Strategy and Actions ...... 62 7 Identity Systems 64 7.1 OpenID ...... 64 7.1.1 Background ...... 64 7.1.2 Protocol Flow...... 64 7.1.3 Message Formats ...... 65 7.1.4 Trust and Privacy Properties ...... 65 7.1.5 Specification Development ...... 66 7.1.6 Open Source Implementations...... 66 7.1.7 PrimeLife Perspective on OpenID...... 66 7.2 Higgins ...... 67

6 7.3 CardSpace ...... 68 7.4 WS-Federation ...... 70 7.5 SAML ...... 70 7.5.1 Background ...... 71 7.5.2 Architecture ...... 71 7.5.3 Protocol Flow...... 71 7.5.4 Open Source Implementations...... 72 7.6 Liberty Identity Federation ...... 72 7.6.1 History and relationship with SAML...... 72 7.6.2 Liberty profiles...... 73 7.6.3 Profiles of the Single Sign-On and Federation Protocol ...... 73 7.6.4 Single Sign-On Protocol Flow Example: Liberty Artifact Profile...... 73 7.6.5 Liberty and CardSpace...... 74 7.6.6 PrimeLife and Liberty ...... 74 7.6.7 Open Source Implementations...... 74 7.7 Yadis ...... 75 7.7.1 Protocol flow ...... 75 7.7.2 The Yadis document ...... 75 7.7.3 Trust and privacy properties ...... 76 7.7.4 Opportunities for PrimeLife...... 76 7.7.5 Specification development...... 76 7.7.6 Open Source Implementations...... 76 8 Specification Developing Organisations 77 8.1 W3C ...... 77 8.2 IETF ...... 78 8.3 OASIS...... 78 8.4 Liberty Alliance ...... 79 8.5 TCG...... 80 8.6 ISO/IEC JTC 1 ...... 82 8.6.1 ISO/IEC JTC 1/SC 27/WG 5 ...... 82 8.6.2 ISO/IEC JTC 1/SC 17/WG 4 ...... 83 8.6.3 ISO/IEC JTC 1/SC 17/WG 11 ...... 83 8.6.4 ISO/IEC JTC 1/SC 37 ...... 84 9 Selected Open Source Projects 85 9.1 MozPETs: Mozilla Privacy Enhancement Technologies ...... 85 9.2 Firefox Plugins ...... 86 9.2.1 Formfiller and Identity Management Enhancement ...... 86 9.2.2 Privacy Enhancement...... 87 9.2.3 Trust Enhancement ...... 88 9.2.4 Other Firefox Plugins...... 89 9.2.5 Opportunities for PrimeLife...... 89 9.3 TOR...... 89 9.4 Privoxy ...... 90 9.5 Invisible Internet Project (I2P) ...... 91 9.6 Bandit ...... 91 9.6.1 Development Framework...... 92 9.7 Concordia Project ...... 92 9.8 Open-Source Identity System (OSIS)...... 93 9.8.1 Specification development...... 93 9.8.2 Open Source Interoperability Workshops ...... 93 9.9 Pamela Project ...... 93 9.10 OpenSSO ...... 94

7 9.10.1 Architectural Overview ...... 94 9.10.2 Opportunities for PrimeLife...... 94 9.10.3 Open Source Implementations...... 95 9.11 OAuth ...... 95 9.11.1 Protocol flow ...... 95 9.11.2 Trust and privacy properties ...... 96 9.11.3 Specification development...... 97 9.11.4 Open Source Implementations...... 97 9.12 Noserub...... 97 10 Conclusion 98 10.1 Results Available From PRIME ...... 98 10.1.1 Privacy Ontologies...... 98 10.1.2 The JRC Policy Workbench...... 99 10.1.3 Policy Language...... 100 10.1.4 SendPersonalData dialog ...... 100 10.1.5 Firefox plugins...... 101 10.1.6 DataTrack...... 103 10.1.7 Privacy Policy Decision Engine...... 104 10.1.8 BluES'n ...... 107 10.1.9 PRIME Policy Manager...... 107 10.1.10 Obligation Manager ...... 108 10.1.11 Identity Mixer ...... 109 10.2 Results Expected from PrimeLife...... 109 10.2.1 Activity 1 - Privacy Life ...... 109 10.2.2 Activity 2 - Mechanisms ...... 110 10.2.3 Activity 4 - User Interfaces ...... 110 10.2.4 Activity 5 - Policies ...... 111 10.2.5 Activity 6 - Infrastructures...... 112 10.3 Perspectives...... 112 10.4 Recommended next steps for standardisation & open source ...... 113

8 Chapter 1 Introduction

1.1 Scope of this Document

This deliverable aims at giving an overview of standardisation and open source initiatives that are relevant for PrimeLife. It decribes the state of work there as well as possible and actual cooperations. 1.2 Historic Background

Hidden data collection and the slow erosion of people’s privacy led to the start of the P3P work by W3C in 1997. Using P3P, a service can make statements about its privacy practises in both human and machine readable form to support the individual using the service in making informed decisions. Informed decisions by indivduals and procurers were also the aim of ISO/IEC's work on IS 15408 "Evaluation Criterial for IT Security", that started in 1991 and was enhanced by a "Privacy" Class covering Anonymity, Pseudonymity, Unlinkability, and Unobservability, before it was for the first time published in 1999.

With the advent of “Web 2.0”, the Web has become more interactive. Interactive services are built around online communities or offer personalised services. Exchanges of values, ideas and information require a certain level of trust or reputation. In some cases, access to the information is regulated. Currently, a number of initiatives in open source and standardisation exist: OpenID, CardSpace, Higgins, Liberty Alliance, OSIS, as well as language driven ones like FOAF, XACML or SAML. Mostly, the goal is to provide the notion of “Identity” across several completely decentralised services and add the necessary hooks for services to manage the relationship to its users. Often, the frameworks and languages are limited to a specific technology or service. Privacy or security are ignored or only minimally addressed.

As a consequence, identity management today consists of isolated initiatives. Industrial solutions provide control schemes without privacy support. The current standardisation efforts try to federate several solutions. Such federation, in turn, makes the application of Privacy- Enhancing Technologies (PET) even harder as the semantics must participate in the interoperability in order to survive the federation. The PRIME project demonstrated the use of

9 “sticky policies” to transport data protection information with the personal information itself. PRIME also demonstrated how the user may influence or even control the data processing after the transmission of personal data (user-centric approach). The MIT based projects TAMI and PAW showed that data found on the Web can be used to create hooks for data protection decisions, access control and other constraints on the use of data. This work allows opening up the constraint of data use from a strict user-input driven scheme to larger community considerations and enhancing the machine-processing capabilities at users’ hands. Transparency of data processing is the preferred approach.

Already the PRIME project influenced standardisation efforts in W3C and ISO/IEC. At W3C’s Privacy Workshop in October 2006, researchers and practitioners explored new directions in privacy policy languages and enforcement mechanisms. There was significant interest in exploring the interfaces between different, possibly domain-specific, policy languages. In May 2006, ISO/IEC JTC 1/SC 27 "IT Security techniques" has established Working Group (WG) 5 on “Identity Management and Privacy Technologies”. This initiative followed two SC 27 Study Periods on "Identity Management" and "Privacy". The new WG started work with projects such as “A framework for identity management” (24760), “A privacy framework” (29100) and “A privacy reference architecture” (29101). 1.3 PrimeLife Aims and Activities

PrimeLife will explore how current identity management schemes can be influenced to include privacy-enhancing tools and hooks. This necessarily means also influencing current standardisation efforts and creating new initiatives where needed. Within this context, PrimeLife will significantly advance the state of the art in standardisation. PrimeLife will work with selected bodies such as ITU, ISO/IEC JTC 1/SC 27/WG 5, OASIS, and W3C to influence the relevant standards activities to encourage them to incorporate advanced privacy- enhancing concepts and technologies. For example 7 privacy related projects are being worked on in SC 27/WG 5, and PrimeLife is establishing a liaison to this group due to its global outreach and its topical portfolio, overlapping significantly with that of PrimeLife.

PrimeLife also aims at bringing the technologies and concepts into the open source initiatives, in particular in the areas of anonymous credentials (where Partner IBM has made first contributions to Higgins), user interfaces, various kinds of policies (where so far only little, isolated efforts have been made), and also infrastructure components. The most prominent open source initiatives on identity management are OpenID, Higgins, and OSIS. However, they are only a first step in the right direction and do not yet provide end-to-end privacy, identity, and trust management. With such open source code available, it is expected that the currently developed global solutions will integrate PrimeLife solutions, thus, leading to a global impact of the project's results and thus to better privacy protection in the next generation services. Success will be measured by the (success of the) workshop for standardisation of the projects’ technologies and for the cooperation with other projects, by the contributions to standardisation bodies and (existing) open source communities, and by the extent to which these contributions get taken up. 1.4 PrimeLife Project Overview

Individuals in the Information Society want to protect their autonomy and retain control over personal information, irrespective of their activities. Information technologies hardly consider those requirements, thereby putting the privacy of the citizen at risk. Today, the increasingly collaborative character of the Internet enables anyone to compose services and to contribute

10 and distribute information. Individuals will contribute throughout their life leaving a life-long trail of personal data.

This raises substantial new privacy challenges: A first challenge is how to protect privacy in emerging Internet applications such as collaborative scenarios and virtual communities. A second challenge is how to maintain life-long privacy.

PrimeLife will resolve the core privacy and trust issues pertaining to these challenges. Its long-term vision is to counter the trend to life-long personal data trails without compromising on functionality. We will build upon and expand the sound foundation of the FP6 project PRIME that has shown how privacy technologies can enable citizens to execute their legal rights to control personal information in on-line transactions.

Resolving these issues requires substantial progress in many underlying technologies. PrimeLife will substantially advance the state of the art in the areas of human computer interfaces, configurable policy languages, Web service federations, infrastructures and privacy-enhancing cryptography.

PrimeLife will ensure that the community at large adopts privacy technologies. As one of the activities to this effect PrimeLife will work with the relevant open source communities and standardisation bodies. This document summaries these communities and bodies and relates them to the technologies and mechanisms PRIME has produced and PrimeLife aims to produce. 1.5 PrimeLife's Approach to Privacy

We envision that users will be able to act and interact electronically in an easy and intuitive fashion while retaining control of their personal data throughout their life. Users might use a multitude of different means to communicate with several partners employing a variety of platforms. For instance, a user Alice might authenticate to an on-line service created by a mash-up. She automatically logs on using her laptop and later confirms a payment transaction for an electronic article using her mobile phone. In all those transactions, despite many potentially untrusted services collaborate in the mash-up, Alice is able to reveal only the minimally necessary information to establish mutual trust and conduct the transaction. For instance, no service will learn any personal information about Alice. Nevertheless, a merchant is guaranteed payment for the services.

In other words, privacy needs to be addressed in a user centric way, i.e., in a way where the users are given control over their data. This requires that

1. the users be informed what data about them is requested (it might be that the data needs to be certified by a third party) and how that data is going to be used; and that 2. the users be provided with technologies that allow them to conduct transactions in such a way that only the necessary information needs to be revealed.

The first item requires that access control is done in an attribute-based way. Other than an ACL that lists which users are allowed to access, the attribute-based access control defines for each item what attributes (requirements, credentials) a requester needs to satisfy to get access to a resource. Next, access control needs to be done such that the user is not authenticated with respect to a user ID (and then checking whether that user has the required attributes) but rather by allowing access to any user who can provide proof that the attributes are satisfied. For this to work, various languages for policies, ontologies, and credential and

11 attribute formats are needed to communicate all these data. Moreover, the users need to be given intuitive user interfaces that guide them through such authorisation procedures. It should be able to investigate which data will be sent to which partners at what time and thereby maintain a profile (or partial identity) with the communication partners.

Let us discuss the second item. Some of the technologies we have already mentioned: access control mechanisms, various languages and components to work with them (e.g., editors, evaluation engines, ...), and user interfaces. In addition, we require anonymous (or private) credentials. These are essentially public key certificates that certify also attributes about the users (e.g., an electronic version of a driver's license) that allow users to control what information contained in the certificate shall be revealed at what time. So for instance, using an anonymous driver's license credential, user Alice could convince a bar tender that she is old enough to order a beer without disclosing any other information attributed to her in the license (in fact, she does not even have to reveal her birth date but only the fact that she was born a sufficient time ago to be old enough for that beer). This aspect of PrimeLife's approach is taken over from the PRIME project and hence can draw on the these results many of which are quite mature already. Therefore we deem this offers many opportunities for standardization and open source contributions.

However, the approach to reveal only the minimally necessary information does not address all the privacy problems arising. In many cases, e.g., social networks or wiki pages, the users indeed want to reveal personal information about them and also exchange these information. Moreover, as information is provided by many different parties, it becomes much harder to judge whether the information is trustworthy. The establish the latter, users will potentially have to reveal even more information about themselves in order for their communication partners to assess to what extent and with respect to what they can be trusted. PrimeLife aims to address these challenges as well. In this area, PrimeLife is to a large extend conducting basic research and therefore we here expect fewer opportunities for standardization and maybe just a few isolated contributions to open source communities.

12 Chapter 2 Target Technology Platforms

Data is exchanged between various kinds of computer systems connected to Internet. The target platform for PrimeLife work is generally the Web. This section reviews the architecture of the Web and its essential technology components. Although few standards are available for privacy and security on the Web in general, the deployment of Web services has led to dedicated work for Service-Oriented Architectures. 2.1 The Web

2.1.1 Architecture

The World Wide Web (see WEBARCH [WEBARCH]) is a global information space built on top of a set of relatively simple technologies which, in combination, have enabled its world wide deployment, and have catalysed the Internet's growth and use over the last 15 years.

Key design elements of the World Wide Web include:

Identification Uniform Resource Identifiers [URI] serve to identify resources on the Web. They provide an abstraction layer for resource identification across protocols and across document formats. New URI schemes can be introduced without having to change the surrounding format (e.g., HTML, SVG, MathML). Extension by URI scheme enables deployment of new protocols without having to change surrounding document formats. However, it is not true that dereferencing the same URI will always result in the same protocol interaction. Further, different URI schemes expose different methods -- the HTTP protocol, e.g., supports both retrieval and information posting methods (GET and POST).

Interaction While URIs provide the means for extensibility in protocol space, a social agreement (codified in the set of supported protocols in user agents) leads to the choice of HTTP as the primary protocol for the Web. Key properties of HTTP include safe information retrieval (the simple retrieval of a Web page is by convention free of side effects; side

13 effect bearing interactions can be distinguished) and negotiation of resource representations (depending on, e.g., language preferences and supported data formats).

Formats For data formats (as for protocols) the practically unlimited extensibility of the protocol and addressing layers is complemented by social conventions (codified in deployment) about the formats in use: Various variants of HTML, style and scripting languages, and a few graphics formats effectively form the backbone of today's Web.

2.1.2 Standards

This section summarises the specifications that are at the core of today's instantiation of the Web: HTTP as the primary retrieval protocol, HTML and CSS as the primary formats used for documents and their style, and the Document Object Model's application programming interfaces as used by the ECMAScript scripting language (more commonly referred to as JavaScript).

HTTP and TLS

The Hypertext Transfer Protocol (RFC 2616 [RFC 2616]) is a fundamentally stateless request/response protocol that is used to interact with Web resources. Methods that can be used in this interaction include both simple retrieval (GET; a so-called safe method, as it must not cause side effects), submission of information (using POST or PUT), and other manipulations (using, e.g., DELETE). HTTP further supports content and language negotiation, redirection functionality, and advanced mechanisms for caching and proxying of requests.

Browser cookies -- while broadly deployed -- are a notoriously underspecified aspect of HTTP; yet, they serve a critical function on today's Web, by adding session management to an otherwise stateless protocol.

Authentication within HTTP is limited to simple username and password based approaches, even though the protocol's framework can be extended to different authentication protocols. In practice, even these features go largely unused. Deployments mostly rely on HTML forms to solicit user names and passwords, and on cookie-based session management to tie a prior authentication transaction to a session. Identity systems for the Web take a similar approach: The basic identity transaction is either conducted on top of HTTP, or through a separate protocol, and the result of that transaction is then tied to a session.

Similarly, confidentiality and signature services are out of scope for HTTP. These functionalities are instead provided by the TLS protocol (formerly known as SSL). TLS, too, provides a framework for transporting user credentials (RFC 2818 [RFC 2818]).

Deployment of HTTP is virtually ubiquitous: HTTP servers can be found in about any network enabled device (often as configuration and control interface of choice); HTTP clients are found on mobile phones, gaming consoles, and of course personal computers.

The broad deployment of HTTP causes significant inertia: Changes to the HTTP protocol (e.g., a new version) can only be deployed mid-term. The ongoing standards effort at the IETF [HTTP bis] is at this time (April 2008) focused on specification maintenance and errata work; it is expected that this work will produce a higher-quality version of the HTTP/1.1 specification.

14 As far as security and identity mechanisms for HTTP are concerned, the currently active IETF working group is specifically chartered to only document the protocol's properties. Yet, there is a certain amount of momentum to further investigate security mechanisms for HTTP, and it is expected that this momentum will further materialise during the lifetime of Prime Life (see HTTP bis Security Properties [HTTPbis-security]).

HTML, CSS

The Hypertext Markup Language, HTML, is (like HTTP) at the root of the Web's stack of specifications. It gives authors the means to:

• Publish online documents with headings, text, tables, lists, photos, etc. • Retrieve online information via hypertext links, at the click of a button. • Design forms for conducting transactions with remote services, for use in searching for information, making reservations, ordering products, etc. • Include spread-sheets, video clips, sound clips, and other applications directly in their documents.

The object tag in HTML enables embedding of arbitrary objects, and subsumes the functionality of the deprecated applet tag. Extensions to HTML can also be based on using class names as an indicator for content's semantics. The microformat community [Microformats] is advocating this approach to embed semantic data with HTML documents.

Current versions of HTML are HTML 4.01 [HTML 4.01], the last SGML version of HTML, and XHTML 1.0 [XHTML 1.0], a reformulation of that language in XML.

Ongoing development is focusing on HTML5 [HTML5], an effort to provide an interoperable specification for HTML and associated APIs, and XHTML 2 [XHTML 2], the next generation of the XML-based XHTML.

Layout information for HTML (and other structured document formats, including XML applications) can be specified using Cascading Style Sheets [CSS 2].

Document Object Model, ECMAScript, XMLHttpRequest

The W3C Document Object Model (DOM [DOM Level 3 Core]) is an API that manipulates HTML and XML documents. It relies on a structure model of the document and defines accessors to the various components of this structure. It is platform-neutral and language neutral. It is organised in levels which specify required and optional features: Level 1 defines a core model and a basic API for HTML, Level 2 enhances the core model and adds views, events, style and traversal APIs. Level 3 has a more complete core model (including more DOM types) and provides load-and-save and validation APIs. All the W3C DOM Levels recommendations include bindings to Java and to ECMAScript. DOM is the preferred API to manipulate HTML and XML documents on the Web when accessing elements in non- sequential order is required.

ECMAScript, standardised by ECMA International (see ECMAScript [ECMAScript]) in 1999 is mostly used as a language to manipulate Document Object Models on the Web. It is the major language for client-side scripting, implemented in all common clients for all kinds of platforms. ECMA TC39 [ECMA TC39] is actively working on the ECMAScript 4 language.

15 The XMLHttpRequest [XMLHttpRequest] object is another neutral API that allows scripts to perform HTTP client requests without reloading the Web pages. It builds on some parts of the DOM model and constitutes the core of the Ajax technique. The goal of such a specification is to unify the techniques used for dynamicity of content and achieve the necessary interoperability that proprietary technologies (ActiveX, inline frames, proprietary applets, Flash...) prevent. It is currently a W3C Working Draft.

2.1.3 Evolving Web Application Development Paradigms - Web 2.0

The power of Web applications often comes from the easy combination of data and services across multiple sources: In the easiest case, a meeting description might include a map service inline, highlighting the meeting location. More complex mash-ups might combine any number of data sources, factor in personal information (e.g., travel plans) to answer the user's questions, and trigger activities on multiple possible services.

As application programming interfaces and client-side scripting have matured in recent browser generations, they have enabled an increasing shift of complexity toward the client side: Where much of the complexity of "Web 1.0" applications resided on the server side, "Web 2.0" and "AJAX" programming puts complexity on the client, and thinks of the server side as generic APIs that can be invoked by complex applications running on the client.

These choices have a profound impact on the security and privacy structure of the Web: On the one hand, there is increased susceptibility of services to abuse, as careless use of Web 2.0 design patterns might put security (and privacy) critical aspects of business logic on the client, and thereby in the hands of an attacker. More dangerously, mash-ups will often run scripts from different trust domains within a single domain of control (or within several, insufficiently isolated domains of control). As a result, the technical environment's enforcement of social and business agreements between different data processing parties becomes difficult in actually deployed mash-up environments.

Some recent research and development into JavaScript security models has the potential to help change this environment to improve the client-side Web programming environment's ability to enforce security and privacy policies. We specifically mention the open source Caja [Caja] project that extends JavaScript to include a capability based security model. This security model is deployable now, through a JavaScript-to-JavaScript compiler.

On the other hand, modern Web application development techniques improve the abilities of collaborating actors to share user data and track behaviour, beyond what is possible with simple cookies, HTML pages, and forms. Classical privacy-enhancing techniques that might, e.g., impose controls on cookies or warn before form submissions are losing their usefulness, as new technologies enable storage of client-side state (e.g. to enable offline Web applications), tracking of users, and exchange of information between different sites.

It is an open research question what leverage points are best suited to build the enforcement of privacy policies into Web applications, and to create business and technical incentives for Web application developers to disclose privacy intents.

Additional approaches to data sharing in Web 2.0 scenarios involve the passing of personal information and authorisations between different sites through (mostly) redirect patterns. Relevant developments include OAuth (see chapter 9) and OpenID (see chapter 7).

16 Primelife Perspective

Overall, mash-ups and the use of Web 2.0 programming patterns to process personal information are going to stay with us. In the context of Activity 1, PrimeLife will analyse these use cases in more detail.

Standardisation Efforts

Specifications relevant to Web 2.0 programming patterns are under development in a number of places: The W3C HTML Working Group [HTML WG] is developing the HTML5 [HTML 5] specification, which includes numerous APIs (including APIs for local storage of data, cross-domain communications, and relevant security models); additional related work is done by the W3C Web Application Formats [WAF WG] and Web API Working Groups [Web API]; a proposal to merge these groups is currently (May 2008) under consideration by the W3C membership. Some work on Caja [Caja] has found its way into the ECMAScript standardisation work at ECMA TC39 [ECMA TC39]. Other relevant work is either being done under the umbrella of open source projects or ad-hoc initiatives, and may make its way into more formal standardisation during the lifetime of the project.

2.1.4 Semantic Web

The Semantic Web provides a common framework that allows data and metadata to be shared and reused across application, enterprise, and community boundaries. Key ingredients of this framework include the simple, yet powerful data model of the Resource Description Framework [RDF-PRIMER]; a standardised query language [SPARQL]; and an ontology language [OWL] that enables machine-readable expression of the relationships between different concepts.

The usage of URI references as identifiers enables different parties to coin new terms without the risk of clashing with others, thereby enabling easier integration and mixing of data.

From a privacy perspective, Semantic Web technology will enable ever easier and ever more powerful aggregation of personal information. Where a lack of integration might in the past have helped individual privacy, the Semantic Web promise of overcoming that gap.

The power of Semantic Web technologies cuts both ways, however: It also eases the tasks of identity management by providing a framework that can be used to express not just personal information, but also privacy practices, policies, and preferences. As broader and more effective data integration becomes possible, distributed and scalable compliance monitoring becomes possible, leading to improved accountability of those who process personal information.

2.1.5 Privacy technologies for the Web

Very few Privacy-Enhancing Technologies (PET) are standardised today. There is a large variety of tools (see chapter 8) that do not interoperate. Wisdom put into one tool can't be used in another tool. This way it is also very hard for E-Commerce sites to obtain predictable results when designing their portals. P3P is the only major exception to this rule. It is discussed together with other relevant standards in more detail separately in this document, specifically P3P (see chapter 4) and APPEL (see chapter 4). APPEL as opposed to P3P, has never reached the status of Recommendation.

17 2.2 Service Oriented Architectures

The basis for this text is an updated version of Geuer-Pollmann/Claessens [Geuer-Pollmann/ Claessens]. The term ‘Web services’ is found nearly anywhere in the enterprise platforms and networking domains. Generally speaking, ‘Web service’ refers to the transfer of XML via internet protocols, such as HTTP or SMTP. The ‘Simple Object Access Protocol’ (SOAP Version 1.2, 2007 [SOAP12]) is an XML based protocol, which provides a definition how structured and typed information can be exchanged between peers in a distributed and decentralised environment.

In order to provide security, reliability, transaction abilities and rich meta-data support for Web services, additional specifications exist on top of the XML/SOAP stack. Figure 1 provides an overview of important Web service specifications and how they relate to each other. XML lays the basis for all the standards. Other basis technologies in this group are SOAP encoding service invocations, its transport protocols HTTP/UDP, and basic XML cryptography schemas. The next layer, called 'Messaging', groups standards that provide advanced communication features such as flow control, transactions, establishing of secure communication channels, and trust establishment. The layer 'Infrastructure and Profiles' layer contains standards which typically affect the interaction between multiple services, such as interoperability, trust establish between services/parties across organizational borders, and distributed management. The 'Metadata' group is somehow orthogonal to the latter three layers. It groups standards that deliver information how to find and invoke a service, e.g. address lookup, specific meta-data, security policy, and service interface.

The standards in Figure 1 are color coded, indicating the defining organization and the status of the specification. The core technologies in the XML space are all defined by the World Wide Web Consortium (W3C). Many of the higher-layer specifications are driven by multiple industry players, including Microsoft, IBM, BEA Systems, SAP and others. These specifications help to make progress in the interoperability between the different industry platforms, most notably Microsoft’s .NET [MS .NET] platform and IBM’s WebSphere [IBM WebSphere] software. The results are often donated to standards organisations like OASIS [OASIS], which will care for their future evolution.

18 'Figure 1: Overview of existing Web service specifications and their relations.'

2.2.1 OASIS WS-Security

The WS-Security [WS-Security] specification defines mechanisms for integrity and confidentiality protection, and data origin authentication for SOAP messages and selected parts thereof. The cryptographic mechanisms are utilised by describing how XML Signature and XML Encryption are applied to parts of a SOAP message. That includes processing rules so that a SOAP node (intermediaries and ultimate receivers) can determine the order in which parts of the message have to be validated or decrypted. These cryptographic properties are described using a specific header field, the header. This header provides a mechanism for attaching security-related information to a SOAP message, whereas multiple headers may exist inside a single SOAP message. Each of these headers is intended for consumption by a different SOAP intermediary. This property enables intermediaries to encrypt or decrypt specific parts of a message before forwarding it or enforces that certain parts of the message must be validated before the message is processed further. Besides the cryptographic processing rules for handling a message, WS-Security defines a generic mechanism for associating security tokens with the message. ‘Associating a security token’ means that one or more tokens are included in headers in the message and that a referencing mechanism is introduced to refer to these tokens. Tokens generally are either identification or cryptographic material or it may be expressions of capabilities (e.g. signed authorisation statements). For instance, the certificate for signature validation may be added into the header. That may be done by either placing it into the signature itself (which makes re-usage a bit complicated and fragile) or by directly making it a child of the header and referencing it from the signature. The latter use has the advantage that other signatures or security operations may directly refer to that token. WS-Security, available in version 1.1 since February 2007, defines a simple username token,

19 a container for arbitrary binary tokens (base64 encoded), a container for XML-formatted tokens, and an encrypted data token.

Additional specifications define various ‘token profiles’ that introduce special token formats. For instance, the X.509 Certificate Token Profile 1.1 [X.509 Certificate TP 1.1] defines how X.509 certificates, certificate chains or PKCS#7 certificate revocation lists may be used in conjunction with WS-Security. The ‘Username Token Profile 1.1’ extends the existing username token by adding literal plaintext passwords, hashed passwords, time variant parameters (nonce) and creation time stamps. The Rights Expression Language (REL) Token Profile 1.1 [REL TP 1.1] links WS-Security to ISO/IEC 21000-5. The Kerberos Token Profile 1.1 [Kerberos TP 1.1] defines how Kerberos tickets are embedded into SOAP messages and the SAML Token Profile 1.1 [SAML TP 1.1] defines how SAML 1.1 and 2.0 assertions (see also SAML 2.0 Bindings [SAML 2.0 Bindings]) can be included.

WS-Security is one of the basic security specifications in the Web service world. Therefore it is definitively relevant for PrimeLife.

2.2.2 OASIS WS-SecureConversation

The WS-Security specification introduced the concept of message level security. By utilising only WS-Security to encrypt and sign Web service messages, a lot of overhead related to key management is necessary. For instance, if a Web service requires each message being encrypted using a 2048-bit RSA operation and given the fact that 1000 service invocations may happen during the next 3 minutes, it becomes obvious that this concept does not scale very well. In the transport layer, HTTP 1.1 permits to keep an existing SSL/ TLS connection open so that subsequent requests to a Web server may be sent via the already established secured connection. WS-SecureConversation [WS-SecureConversation] brings this concept into the Web services world. This is done by introducing mechanisms to establish and share so-called ‘security contexts’. Based on established security contexts or arbitrary already existing shared secret keys, WS-SecureConversation provides mechanisms to derive shared key material (read: session keys). Security contexts can be established in three different ways. First, a security context token (SCT) may be retrieved using the mechanisms of WS-Trust. In that case, the requestor retrieves the SCT from some security token service that is trusted by the Web service. The second way is that the requestor creates an own SCT and sends that SCT to the Web service. The problem may be that the Web service may not trust the requestor to create an appropriate SCT and may reject that self-created SCT. A third option is that both the requestor and the Web service mutually agree on a security context using a challenge-and-response process. An established SCT is afterwards used to derive session keys. These session keys may then be used for subsequent message encryption and message authentication codes (symmetric ‘signatures’) with WS-Security.

In the scope of PrimeLife, as in most other Web service based solutions, WS- SecureConversation is a reasonable addition to WS-Security.

2.2.3 OASIS WS-Trust

The WS-Trust [WS-Trust 1.3] specification introduces the concept of ‘security token services’ (STS). A security token service is a Web service that can issue and validate security tokens. For instance, a Kerberos ticket granting server would be an STS in the non-XML world. A security token service offers functionality to issue new security tokens, to re-new existing tokens that are expiring and to check the validity of existing tokens. Additionally, a security token service can convert one security token into a different security token, thus

20 brokering trust between two trust domains. For example, a Web service describes required security tokens for Web service calls using WS-SecurityPolicy/PolicyAttachment. A requestor may want to call that specific Web service but may not have the right security tokens indicated by the policy. The Web service may require SAML credentials from a particular trust domain whereas the requestor only has an X.509 certificate from its own domain. By requesting the ‘right’ matching token (credential) from the security token service, the requestor may get back a token from the STS that can be included when calling the Web service in question. The decision what exactly the ‘right’ token is can be made either by the requestor or by the STS. The requestor may inspect the Web service’s policy and specifically ask the STS: "I have the attached X.509 certificate and need a SAML token". The other option is that the requestor includes its possessed tokens and states what Web service it intends to call: "I possess the following tokens and I would like to call the Web service http://foo/bar. Please give me whatever token may be appropriate." WS-Trust provides a rich interface that permits the implementation of various use cases. For instance, the requestor may include time variant parameters as entropy for a token generation process. The token service may return secret key material to the requestor (so-called proof-of-possession tokens) along with the requested security token, so that the requestor can prove that it possessed the security token. For instance, the requested security token may be a certificate whereas the proof-of-possession token is the associated private key. The security token service may also return multiple keys like a certificate along with its validation chain or it may create key exchange tokens with which the requestor can encrypt key material for the intended Web service. A requestor can also express requirements on algorithms and key strengths for required tokens. WS-Trust defines protocols including challenge-and-response protocols to obtain the requested security tokens, thus enabling the mitigation of man-in-the-middle and message replay attacks. The WS-Trust specification also permits that a requestor may need a security token to implement some delegation of rights to a third party. For instance, a requestor could request an authorisation token for a colleague that may be valid for a given time interval. WS-Trust utilises WS-Security for signing and encrypting parts of SOAP messages as well as WS-Policy/SecurityPolicy to express and determine what particular security tokens may be consumed by a given Web service.

WS-Trust is a basic building block that can be used to rebuild many of the already existing security protocols and make them fit directly in the Web services world by using Web service protocols and data structures. It is thus essential for service composition in PrimeLife.

2.2.4 W3C WS-Policy

The Web Services Policy Framework (WS-Policy) [WS-Policy 1.5] provides a general- purpose model to describe Web service related policies. A policy can describe properties, requirements and capabilities. For example, a policy may mandate that a particular Web service only provides services between 8:00 AM and 5:00 PM or that service requests must be signed using an X.509 certificate (of course not by the certificate but by its associated key). Policies also allow to define different available options, so that machines can figure out based on their own policy and a service’s policy what requests may be accepted and what requests may be not. WS-Policy by itself only provides a framework to describe logical relationships between policy assertions, without specifying any assertion. WS-PolicyAttachment [WS- PolicyAttachment 1.2] attaches policies to different subjects. ‘Web service related’ means policies apply to service endpoints or to XML data. A policy can be attached to an XML element (by embedding the policy itself or a link to the policy inside the element) or by linking from the policy to the subject that is described by the policy. WS-PolicyAttachment also defines how policies can be referenced from WSDL documents and how policies can be attached to UDDI entities and stored inside a UDDI repository. WS-MetadataExchange [WS-

21 MetadataExchange 1.1] defines protocols to retrieve metadata associated with a particular Web services endpoint. For example, a WS-Policy document can be retrieved from a SOAP node using WS-Metadata.

2.2.5 OASIS WS-SecurityPolicy

WS-SecurityPolicy [WS-SecurityPolicy 1.3] defines certain security-related assertions that fit into the WS-Policy framework. These assertions are utilised by WS-Security, WS-Trust and WS-SecureConversation. The ‘SecurityToken’ assertion tells a requestor what security tokens are required to call a given Web service (‘security tokens’ are described in the sections WS- Security and WS-Trust). Integrity and confidentiality assertions identify the message parts that have to be protected and it defines what algorithms are permitted. Visibility assertions identify what particular message parts have to remain unencrypted in order to allow SOAP nodes along the message path to operate on these parts. The ‘MessageAge’ assertion enables entities to constrain after what time a message is to be treated as expired.

2.2.6 Primelife Perspective

Web services are an important building block to realize web-based business processes. Hence, their implications in terms of security and privacy have to be considered to meet the project goals. In PrimeLife, Activity 6 will analyze this area in more detail and will investigate new approaches to address the main privacy issues.

22 Chapter 3 Architectures and Frameworks

This section presents work of the Working Group (WG) 5 "Identity Management and Privacy Technologies" of ISO/IEC JTC 1/SC 27 "IT Security techniques" which is described in more detail in the section about the Specification Developing Organisations (see chapter 8). PrimeLife is establishing a liaison with WG 5 due to its global outreach and its topical portfolio overlapping significantly with that of PrimeLife. The terminology in this section follows that of the respective standards. 3.1 Identity Management

3.1.1 IdM Framework (24760)

This standard aims to provide a framework for the definition of identity and the secure, reliable, and private management of identity information. This framework should be applicable to individuals as well as organisations of all types and sizes, in any environment and regardless of the nature of the activities they are involved in.

Identity Management (IdM) is the secure management of identities, the identification process during which an entity may be authenticated, and the information associated with the identification of an entity within some context. The entity might be anything that can be uniquely recognised, a person, an animal, a device, an object, a group, an organisation, an information object, etc. Entities may have multiple identities that may be used in different contexts, often called Partial Identities.

The context for the identification process might be within an organisation’s boundaries, or federated across organisations. This standard will cover the life cycle of identities and identity information as they are established, modified, suspended, terminated or archived. Information associated with identities may change over time and must therefore be carefully managed. Some associations of an entity might be informal and change frequently. Other associations might be formal, specific relationships, such as people, policy-based organisational roles, and financial accounts that remain stable over time. Identity attributes are often securely stored within tokens, directories, access devices, or database management systems.

23 Identities may be associated with policy-based roles, and these roles may be associated with duties and responsibilities, and privileges and permissions to access resources. An Identity Management System (IdMS) also needs to interact with other information systems that require or generate identity information.

This project is in the "Working Draft" stage.

3.1.2 A Framework for Access Management (29146)

This standard aims to provide a framework for the definition of Access Management and the secure management of the process to access information. This framework is applicable to any kind of users, individuals as well as organisations of all types and sizes, and should be useful to organisations at any location and regardless of the nature of the activities they are involved in.

Access Management (AcM) is the secure management of the processes to access information and the information associated with the accountability of an entity within some context. The entity might be anything that can be uniquely recognised, a person, an animal, a device, an object, a group, an organisation, a piece of information, etc. Entities may have multiple identities that may be used in different contexts. Identities may be federated or not. The processes include, without limitation, the identification, the authorisation, the entitlement and privilege management; the authentication, and the review of information usage. The intent is not to define in details the technical aspects of each of these processes but to identify the overall process of the access to the information. The different services composing an Access Management framework are typically

• an Identity Management service • an Entitlement, Privilege and Authorisation Management service • an Authentication Management service, and • a Usage Control and Monitoring service.

Other services may be identified during the development of the standard. A typical model for an Access Management framework is composed of several services that could be decoupled or integrated in a solution suite. The standard must clarify the relation between the framework and to the services mentioned and their interactions. The context for access might be within an organisation's boundaries or federated across organisations. This standard describes the life cycle of access and security services associated to that access as they are established, modified, suspended and terminated. Information associated with accesses may change over time and must therefore be carefully managed. The framework must provide the means to administrate the access definitions of all users and to publish the definitions to the information systems that it may serve. Access definitions may be associated with policy- based rules and roles, and these roles may be associated with duties, responsibilities, privileges and permissions to access resources. An Access Management System ensures the secure management of access definitions that also include the delivery of credentials to users entitled to roles and the secure maintenance thereof.

This project is in the "New Project" stage and a first Working Draft is expected for Summer 2008.

24 3.1.3 Entity authentication assurance (29115)

This project aims at describing the guidelines or principles that must be considered in entity authentication, assurance and the rationale for why it is important to an authentication decision, especially: a framework for assessing "how close" an entity is to the claimed one throughout an identity's life cycle;

• guidelines for how the strength of the authentication can be measured; and • the basis for a set of entity authentication assurance measures that are general and applicable to the entire life cycle of an identity including a wide range of authentication mechanisms.

This project is to take into account the following requirements:

• authentication metrics • authentication mechanisms • authentication protocols • characteristics of the device used to authenticate • location of the individual being authenticated • communications paths • relative ease of authentication manipulation by malicious behaviour • corrections and modification of errors • identifier types • identity proofing • privacy

This project is in the "Working Draft" stage. 3.2 Privacy

3.2.1 A Privacy Framework (29100)

This Project aims at providing a framework for defining privacy safeguarding requirements as they relate to PII (Personally Identifiable Information) processed by any information and communication system in any jurisdiction. The framework is to be applicable on an international level and addresses system specific issues on a high-level. It is general in nature and puts organisational, technical, procedural and regulatory aspects in perspective. It is the purpose of this international standard to provide guidance concerning information and communication system requirements for processing PII by setting a common privacy terminology, defining privacy principles when processing PII, categorising privacy features and relating all described information privacy aspects to existing security guidelines. The framework can serve as a basis for desirable additional privacy standardisation initiatives, for example for a technical reference architecture, for the implementation and use of specific privacy technologies, for an overall privacy management, for the assurance of privacy compliance for outsourced data processes, for privacy impact assessments or for specific engineering specifications.

The Privacy Framework is being developed for those individuals with an interest in the standardisation of privacy safeguarding controls as they relate to PII processed by enterprise ICT systems. This may include individuals involved in specifying, procuring, architecting, designing, developing, testing, administering and operating ICT systems. Recognising the

25 growing need to incorporate privacy requirements and privacy safeguarding controls in system development life cycles or, more specifically, in security development life cycles this International Standard addresses the target audience of ICT system developers in a separate section, providing a framework and guidelines for an approach to built-in privacy-enhancing functionalities already during the system development.

This project is in the "Working Draft" stage.

3.2.2 A Privacy Reference Architecture (29101)

This project aims at providing a privacy reference architecture model that describes best practices for a consistent, technical implementation of privacy safeguarding requirements as they relate to the processing of personally identifiable information in information and communication systems. It is to cover the various stages in data life cycle management and the required privacy functionalities for PII in each data life cycle, as well as positioning the roles and responsibilities of all involved parties. The privacy reference architecture aims at presenting a best practice, privacy-enhanced architecture model and provides guidance for planning and building system architectures that facilitate the proper handling of PII across system platforms. It sets out the necessary prerequisites to allow the categorisation of data and control over specific sets of data within various data life cycles. It is the purpose of this project to provide guidance concerning a consistent and effective technical implementation of privacy safeguarding requirements within information and communication systems. Therefore it establishes a privacy reference architecture that enables system architects to build necessary privacy safeguarding measures into the system in a cohesive way across system platforms and to combine them with existing security measures, all to improve the proper handling of PII overall. Additionally, the privacy reference architecture gives best practices in advancing the use of privacy-enhancing technologies.

Interested parties that would benefit from using the concepts of the privacy reference architecture include representatives from organisations designing, developing, implementing, and operating information and communication systems. Most likely, these are representatives from various IT organisation departments such as from development, support, and operations or business units or quality assurance and data protection units that have a specific interest in applying consistent architectural decisions to accomplish compliance with specific privacy requirements, rules and regulations.

This project is in the "Working Draft" stage.

26 Chapter 4 Policy and rule languages

This chapter provides an initial, descriptive review of a number of policy and rule languages, both established standards and research languages. This review will be further refined as the PrimeLife project's research work on policy languages proceeds. Based on that review, and on the project's research results, suitable avenues for standardisation of such results will be proposed. 4.1 Extensible Access Control Markup Language (XACML)

XACML is a general-purpose access control policy language. It provides an XML syntax both for a policy language and an access control decision language.

The policy language allows for descriptions of general access control requirements, while the decision language enables users to create queries about whether a given action on a certain resource should be allowed or not, and also to interpret the result. The response is always comprised of one of the following answers: Permit, Deny, Indeterminate (an error occurred or a decision cannot be made because some required information is missing) or Not Applicable (the request cannot be answered by this service because no applicable policy has been found).

4.1.1 An XACML scenario

In a typical XACML scenario, a user wants to perform some action on a resource. In this context, a resource is anything to which access can be controlled, such as an XQuery module or a Java method, and the requested action may be a query execution and a method invocation, respectively.

The user issues a request to the device protecting the resource (e.g.: filesystem or a Web server), which is called a Policy Enforcement Point (PEP). The PEP creates a request which consists of four collections of attributes, describing the Subjects (or users) making the request, the Resource being accessed, the Action to be performed on the resource, and the

27 Environment, comprised of additional information related to the request but not specifically linked to the previous three entities. The Environment attribute collection is optional.

The PEP sends this request to a Policy Decision Point (PDP), which processes it by looking for some policy that applies to it.

An XACML Policy consists of a single access control policy, in the form of a collection of rules. A policy specifies the conditions under which access to the requested resource can be allowed or must be denied. Each policy and each rule within the policy contain a set of applicability predicates used to determine whether the policy (or the rule) applies to a given decision request.

The applicability predicates are organised in an item called Target. A Target is a set of conditions for the Subject, the Resource, the Action, and the Environment that must be satisfied for a policy or rule to apply to a given request. Boolean functions are used to compare values found in a request with those included in the Target. If all the conditions of a Target are met, then the relevant policy or rule applies to the request. Target information not only is useful to check applicability, but also provides a policy indexing service, in that policies may be searched through by a PDP on the basis on their Target constraints. If a policy has no Target, it means that it is always applicable. If a particular type is missing (for example, there are no Subjects) the policy applies to all instances of that type (i.e. to all subjects). If there is more than one group of predicates of a given type, all predicates in at least one group for each type must be true.

A Target element may be followed by at most one Condition, comprised of a single predicate or a Boolean combination of predicates. XACML does not make any distinction among attributes in specifying whether they should appear in the Target or in the Condition predicates. If the Condition is missing, then the Condition is treated as implicitly true. If either the Target predicates or the Condition predicates evaluate to false, the policy or rule is considered not applicable to the request, and a Not Applicable value is returned. When a rule applies to a request, the value of its Effect attribute (Permit, or Deny) is returned as a result of the evaluation. As a policy typically contains multiple rules, XACML specifies a set of Combining Algorithms to compose possibly contradictory multiple results into a unique response. For instance, when a policy relies on the Deny Overrides Algorithm, if any evaluation returns Deny, or no evaluation permits, then the final combined result is also Deny. On the contrary, with a Permit Overrides Algorithm only one Permit sub-result is needed to yield a Permit final result.

The PDP is thus able to produce an answer on whether access should be granted (that is, a decision is taken). Such answer is returned to the PEP, which can then allow or deny access to the requester (that is, the decision is enforced).

PEP and PDP are here treated as two logically distinct units, but they might both be included in a single application, or be distributed across several servers.

Here follows a simplified example that illustrates the concepts above.

4.1.2 An example of XACML policy

A Request element generated to meet the request from Seth of the developers' group to read a document on the server would look like this:

28 [email protected] developers http://server.example.com/docs/guide.html read

A Policy that applies to such request follows:

http://server.example.com/docs/guide.html read developers

This policy, called ExamplePolicy, targets all requests of actions on a resource whose URI is http://server.example.com/docs/guide.html. It is comprised of a single rule (ReadRule) targeting all requests of actions whose action-id attribute is string read. If the requesting subject satisfies the condition of having a group attribute whose value is a developers string, then the Permit effect is returned as a result of the evaluation of the rule, and also of the policy. The PDP then sends to the PEP the following Response item:

29 Permit

XACML does not provide any description of the vocabularies used in the policies, whose definition lies outside the scope of this specification. Policies must include all the information needed to identify and evaluate each attribute used in the policy.

4.1.3 Relations to other proposals and to the PrimeLife project

XACML has raised some criticism regarding its low performance because of the overhead (processing time and memory) of the XML format and the lack of an easy integration into existing entitlement engines. Still, detailed comparisons in the literature [TREPALXACML] show that XACML provides a more comprehensive access control policy language than its most significant competitor EPAL (see chapter 4), and also a fully-featured privacy policy language.

4.1.4 Current status of the XACML proposal

The latest XACML version 2.0 was ratified by OASIS standards organisation on 1 February 2005. Version 3.0 is currently in preparation.

There is a number of open source implementations of the XACML standard:

• Sun's Implementation, version 1.2 (14/02/2006) [Sun XACML] ◦ Full support of XACML 2.0, no support for SAML ◦ Java 2.0 Platform, Standard Edition version 1.4.0 ◦ License: BSD License (old?? copyright years 2003-2004) • Enterprise-Java-XACML from Google Code, (beta) version 0.0.14 (08/02/2008) [Enterprise-Java-XACML] ◦ It is not based upon the Sun Microsystems' implementation ◦ Full support of XACML 2.0, intended support of XACML 3.0 ◦ License: Apache License 2.0 • SICSACML: XACML 3.0 [SICSACML] ◦ It is based upon the Sun Microsystems' implementation ◦ Java implementation of XACML 3.0 draft, released as patch for (1). It implements a PDP for XACML 3.0. ◦ License: BSD License

Further information about the state of XACML is available from the OASIS XACML Technical Committee's home page [OASIS XACML]. 4.2 The Rule Interchange Format (RIF)

The Rule Interchange Format is a family of XML languages being developed by the Rule Interchange Format (RIF [RIF]) Working Group at W3C to allow computer-processable rules to be transferred between rule systems. RIF is a work-in-progress, and what is described here is a vision for how RIF is expected to materialise over the coming months and years, with

30 basic features completed first. All the features described here are subject to change as the Working Group proceeds.

4.2.1 RIF Dialects

The RIF family of languages (each called a "dialect") is organised to match the different kinds of technologies used to work with rules. It starts with a common language, RIF Core, which corresponds to the shared subset (the intersection) of major rule languages. Rules written in this XML language can be translated with relative ease to nearly all other major rule languages. Simple rules can be be written in RIF Core, or translated to RIF Core, but many practical rule bases will only be transferable using some other dialect. Expressively, RIF Core is datalog (the language of positive Horn clauses without function terms), along with some primitive datatypes (such as integers and strings), and the common operations on those datatypes (such as addition and string concatenation). The syntax of RIF is designed to be quite general, and to be straightforward to parse and generate, so it is naturally verbose. A part of an example rule about the perishability of items, from one of the RIF working drafts, is given here:

... item deliverydate scheduledate diffduration diffdays cpt:perishable item ...

Above RIF Core, RIF splits into two main branches, "production rules" and "logic rules". Production rules take the form "if (some condition) becomes true then do (something)", or if- condition-then-action. This is the style of rule handled by the major business rules products. Logic rules take the form "if (some condition is true) then (some other condition must be true)", or if-condition-then-condition. This is the style of rule handled by pure Prolog, by FOL logic theorem provers, and by many systems with varying degrees of acceptance over the past 40+ years. Along the logic branch, the RIF Basic Logic Dialect, or RIF BLD provides a common subset language. It extends RIF Core by adding function terms and equality, along with argument naming and membership/subclass structures. It does not, however, include any form of negation, since logic languages diverge sharply around the types of negation they implement, primarily (monotonic) classical negation as opposed to some form of (non- monotonic) negation-as-failure. In the future, the Working Group expects to provide some dialects aligned with some of these branches.

4.2.2 Use Cases

The use cases [RIF Use Cases] and applications for RIF cover much of the space of distributed information systems. Wherever there is information processing, the option of using a rule system (instead of imperative programs) arises, and for some application areas

31 rule systems have become widely adopted. High-profile areas include credit scoring (Fair Isaac, the company behind FICO credit scores, is a major rule system vendor and a founding participant in the RIF Working Group), regulatory compliance in banking, and health care delivery. To date, most major rule systems have been closed-architecture, single-provider systems. With RIF, we begin to see the possibility of rules being developed on one system and then easily moved to another, allowing customers to avoid vendor lock-in, developing a stronger market, and encouraging more investment in long-term rule bases. Rule interoperability enables many more applications, though, when distributed systems can exchange rules. For example, vendors can publish complex, dynamic pricing structures (as rules), and then customers can (computationally) determine the most efficient purchases to initiate. Complex negotiations, with ensuing efficiencies become possible all along the supply chain, as trading partners are able to selectively expose their business logic (their rule bases) and search for synergies. In the life sciences, rule interchange can be pivotal in both research and health care delivery. In research, data integration is an enormous problem because of the vast variety of medical research; effectively mining that data, to find the relevant bits of a particular task, is essential. Rule systems (and related Semantic Web technologies, like the OWL Web Ontology Language) can greatly ease the integration effort. On the clinical side, rule systems can help physicians make diagnoses and orchestrate treatments (including detecting likely errors). Since many of these benefits increase with the scale of the market for rules (more users of rulebases, more providers of rulebases), a common interchange format should significantly improve the end benefits for research and to patients. 4.3 P3P

The Platform for Privacy Preferences Project [P3P] enables Web sites to express their privacy practices in a standard format that can be retrieved automatically and interpreted easily by user agents. P3P user agents will allow users to be informed of site practices (in both machine- and human-readable formats) and to automate decision-making based on these practices when appropriate. Thus users need not read the privacy policies at every site they visit.

Although P3P provides a technical mechanism for ensuring that users can be informed about privacy policies before they release personal information, it does not provide a technical mechanism for making sure sites act according to their policies. Products implementing this P3P may provide some assistance in that regard, but that was left to specific implementations. However, P3P provides the basis for tools capable of alerting and advising the user. Wherever notices are required in laws or self-regulatory programmes, P3P can provide a very user- friendly way to provide them. In addition, P3P does not include mechanisms for transferring data or for securing personal data in transit or storage, but it can be used by tools that decide on data flow.

P3P has 3 components:

• a protocol: The P3P protocol allows user agents to discover privacy metadata about a given Web site. This is done either via a well-known location, a HTTP header, or a link-tag inside the - element of an HTML page. • a vocabulary: The P3P vocabulary allows to express data handling practices like identification of the party making the statement, retention period, secondary uses, disclosure to third parties, dispute resolution mechanisms and more. • a data schema language: The base data schema is an internationalised schema to express personal data. It is used to express the object of the statement element. This

32 allows to have a level of detail that can go down to one policy per data item, e.g. to treat the given name different than the family name.

4.3.1 Status

The P3P 1.0 Specification [P3P 1.0 Spec] became a W3C Recommendation on 16 April 2002 after 5 years of intense development with numerous obstacles. The initial idea of a wallet service and a negotiation protocol [Removing Data Transfer from P3P] proved to be too ambitious and P3P was toned down to a pure policy language able to express data collection and usage services in a machine readable syntax.

The first rather rudimentary implementation was Microsoft's Internet Explorer 6.0. It only analysed the HTTP headers that contained tokens called Compact Policy [Compact Policy] in the P3P Specification. Based on the privacy practices expressed in the tokens, cookies were blocked or allowed. Many Web sites reacted quickly and installed P3P Policies. Privacy Bird [Privacy Bird] is a plug-in that uses P3P Policies to match against preferences and has a good reporting tool. Whenever there is a mismatch, the bird turns into angry red and the policy report shows exactly where the mismatch between the user's preferences and the site's policy is. The most complete tool implemented was the P3P Implementation [JRC P3P Resource Centre] of JRC [JRC]. This is a Java based proxy implementation, but development has stopped.

After experiences with P3P, feedback triggered two further Workshops: One on incremental improvements [Future of P3P Workshop 2002] and another on the long term vision [Future of P3P Workshop 2003]. The first workshop triggered further work on P3P 1.1 [P3P 1.1 Spec] that added user agent guidelines and improved the protocol allowing sites to identify related sites. P3P 1.0 did not say anything about the user agent. After testing, it appeared that people wanted to have a standard answer to a standard situation. The user agent guidelines reflected this. Even today, the challenge of an efficient privacy dashboard is still open, let alone the integration into the Web browsers.

But in the aftermath of 9/11 governments started massively to increase data collection. The privacy debate shifted back where it came from, notably privacy as a right against the government. This debate took near to all interest away from privacy in private services and development slowed down. P3P 1.1 [P3P 1.1 Spec] was published as a Working Group Note.

4.3.2 Conclusion

P3P work remains very important for PrimeLife as it was the first attempt to express data handling in a structured machine readable way. It shows the way forward and resulted not only in a lot of Web site implementations, but also in a flood of research publications that tried to further the approach taken by P3P. Despite the fact that it was not addressed in the P3P Specification, the use of policies for data governance was always one aspect of the work and IBM realised this approach with the Tivoli Privacy manager handling data in the backend using a P3P engine. Today, the level of P3P implementations on Web servers remains high.

PrimeLife can take advantage of the existing large scale implementation of P3P on Web sites to acquire metadata and draw conclusions. Many of the P3P challenges are still unresolved and can be further advanced by the research done in PrimeLife.

33 4.4 APPEL

APPEL [APPEL] specifies a language for describing collections of preferences regarding P3P policies between P3P agents. Using this language, a user can express preferences in a set of preference-rules (called a ruleset), which can then be used by the user agent to make automated or semi-automated decisions regarding the acceptability of machine-readable privacy policies from P3P enabled Web sites.

At some point in time, the P3P Specification Working Group thought that a common interchange language for specifying user preferences which is understood by all P3P implementations is a condition for user acceptance and adoption of this technology. Several other efforts have addressed a similar problem for other communities, PICS Rules, and Profiles 0.94 for example. An interchangeable format for preference rules would allow data protection professionals to disseminate minimum guidelines and default privacy protection levels to users who have neither the time nor the knowledge to create it themselves.

4.4.1 Shortcomings

In matching P3P policies, there is a huge range of options, which an agent can try to look for. APPEL gives a suggested specification for a matching algorithm and interchangeable XML rule format, which is in fact the only existing interoperable format for preference files. However, to implement a user interface to the full range of possibilities within APPEL results in an extremely complex interface. Only one utility was ever built, designed as part of the JRC P3P project [JRC P3P Resource Centre] tried to come up with an interface. This interface proved to be even too complex for experts.

Additionally, APPEL had some unpredictable behaviour as a P3P policy could be written in two different ways but having the same meaning and the APPEL rule would lead to the same unexpected results. This was due to the fact that APPEL was the first trial to construct a rule language based on RDF technologies. There were suggestions to improve the user interface by a move of P3P to a formalised ontology, but this was not taken up by the P3P Specification Working Group due to a lack of support from the community and the implementers. As an endpoint APPEL was published as a Working Group Note on 15 April 2002. It is used in most P3P implementations (e.g. PrivacyBird [Privacy Bird]) as an import format and subsequently translated into the internal format of the application.

The world moved on. There was a lot of interest in APPEL and rules in general as part of the Semantic Web [Semantic Web] technology. The engineers realised that they needed a new clean approach that is not tight to privacy only. After long scientific discussions, W3C chartered the Rule Interchange Format (RIF) Working Group [RIF] that is still running. A privacy ontology was developed by the PRIME Project [PRIME]. RIF is only a framework for rulesets. There is the option for PrimeLife to combine the privacy ontology developed by PRIME with the framework set by RIF in order to develop a new RIF-compliant rule language. The PRIME obligation language is an obvious candidate to learn more and to refine the approach to rules.

4.4.2 Conclusion

APPEL is a very important historical step towards rule languages and was slightly premature. APPEL helps to understand the very fundamental issues concerning semantics and syntax raised by a privacy preference language. But it is of no use to take up APPEL as is as there

34 are new languages and technologies out there to be used to have a much cleaner approach to privacy rules. 4.5 Enterprise Privacy Authorisation Language (EPAL)

Enterprise Privacy Authorisation Language [EPAL] is a framework for managing collected personal data that aims at enabling enterprises to formalise and enforce privacy practices.

Privacy practice prescriptions are embodied in the form of a policy.

4.5.1 Structure of an EPAL policy

An EPAL policy is given in the form of a XML data structure and includes three main sections.

• The Policy Information identifies the policy providing information about the Issuer, the Version Number, the Start Date, the End Date, the Replacement Policy Name, the Replacement Policy Version. • A Vocabulary Reference provides a pointer in the form of an URI to an EPAL vocabulary that describes all the components that can be used in the rules in the following section. These components deal with the entities typically involved in a transaction: Data Users, Data Categories, Actions, Purposes, Conditions, Obligations, Context Models. • The Ruling Set includes the rules that define whether a Data User is allowed or denied to perform an Action on a Data Category for a certain Purpose under specific Conditions. The rules are ordered with descending precedence, that is, if a rule applies (i.e. allows or denies a request), subsequent rules are ignored. To be applicable to a request, a rule must include the same elements (such as Data Users, Actions...) as in the request and all of its Conditions must be met.

Every EPAL policy is characterised by an optional Global Condition and by a mandatory Default Ruling. The Default Ruling ("allow", "deny, or "not-applicable") is returned as a result of the policy evaluation if the Global Condition is false or no rule within the policy is applicable.

4.5.2 An EPAL policy example

A simplified example of an EPAL policy follows:

Guenter... IBM Research ...

35 This condition is true if the data-subject is a child according to COPPA (i.e., age<=13). 13 Parent consent for collection. true

The policy above allows to collect children's data and store it provided parent consent has been given. All elements whose definition is not provided in this epal-policy element are defined in vocabularies to which references are provided in the policy itself.

4.5.3 An EPAL query example

A typical EPAL authorisation query looks like follows. This is an example to which the policy above applies.

36 0123456789

The refid attribute of the Container element refers to the container of the policy that is instantiated for the authorisation evaluation. Moreover, it contains one or more attribute elements with the actual attribute values to be used to evaluate the relevant conditions. Some policies with an Obligation element, may also state that when a certain access is allowed, some specific additional steps must be taken by the requestor.

4.5.4 A typical EPAL scenario

In a typical EPAL scenario, a customer of the enterprise views the privacy policy statement (specified e.g. with P3P (see chapter 4)), accepts it and sends in personal identifiable information data. The consent and the relevant policy are logged, and the privacy management enforcement monitors ensure that only data accesses by the enterprise's employees are allowed that conform to the privacy policy.

4.5.5 Relations to other standards and to the PrimeLife project

As illustrated above, EPAL provides the possibility of an integration with the P3P standard, which can be used to receive privacy policy preferences from end users that are then stored and enforced in an enterprise's IT system relying on EPAL. However, this approach should not be taken as paradigmatic, in that the EPAL policy language has a greater expressivity, so that if there were only a P3P-based end user interface, EPAL would not be exploited at full.

An important contribution from this proposal is the focus on the issue of enforcement, through the PEP abstraction. This is a significant step on a critical path that also PrimeLife must deal with.

4.5.6 Status of the EPAL proposal

EPAL was submitted to W3C in 2003 for consideration as a privacy policy language standard. 4.6 CARML

CARML (Client Attribute Requirements Markup Language) is a specification language that aims at defining application identity requirements, that is, what identity information an application needs and how the application will use it.

A CARML document is an XML document that enables applications working on identity- related data to declare their data requirements and intended usage of personally identifiable information.

The CARML specification provides a standard way for a client application to request for data from a service provider, but it does not guarantee that the client application will receive what is requested.

37 It is assumed that applications only require a fixed set of interactions dealing with identity data, as follows:

• information about a user is looked up by means of one or more indexes, which typically consists of a subject name derived by an authentication process, or an attribute (e.g. social security number) • some tests are performed to check whether a subject has a property (with unknown value), or a property with a value that will be known at runtime, or a property with a fixed value • the values for a sequence of named properties are retrieved • some attributes in the form of name/value pairs associated with a subject can be modified.

CARML allows for the definition of a localised data dictionary for applications, but it is mostly expected that developer tools using CARML would promote the usage of schemata and dictionaries already specified by an enterprise.

4.6.1 Elements of the CARML language

CARML relies on the SAML syntax as described in SAML V2.0 [Semantic Web] for several of its elements.

For subject indexes, a SubjectIndexes element is introduced, including one IndexNameIdentifier element or one or more IndexAttribute elements. In the following example, the client declares that a pair of values (e-mail address and Country) will be used as indexes, the client will provide an e-mail address and the service provider is to assume that a static attribute of Country="US" is to be used:

urn:oasis:names:tc:SAML:1.1:nameid-format:emailAddress US

When an application needs to send boolean questions to a service provider, those questions are specified in the form of Property elements. Property elements may include a description that aids the identity service managers in defining the appropriate values. The following example includes a sequence of properties: the first asks the question whether the user has property AboveEighteen, the second states that an EmploymentLevel will be provided at runtime and the third that the application wishes to check whether the user has a Department property and whether it is set to value Information Technology.

Information Technology

Finally, CARML defines an Attribute element that lists the attributes (specified on the basis of the SAML attribute schema) the client application is requesting to be returned with the subject by the service provider, like in the following example:

For each attribute-related request, the reply may include no values, one or more values, or an exception indicating unauthorised requests or filtering due to policy conditions. Attribute declarations can include a Modifiable permission flag to enable users to modify the relevant values, as in the following example, in which Language is modifiable, while Country is read- only:

SubjectIndexes, Property, and Attribute elements are included in a NamedInteraction element. An IRData element contains one or more NamedInteraction elements.

4.6.2 Status of the CARML proposal

The latest working draft on CARML was published in 2006.

4.6.3 AAPML

AAPML (Attribute Authority Policy Markup Language) is an XACML (see chapter 3) profile which aims at enabling owners of identity-related data to specify in the form of policies the conditions under which information in their control may be used by other applications. 4.7 Identity Governance Framework

The Identity Governance Framework [IGF] is an open initiative which aims at tackling the issues related to the management of identity related information across enterprise IT systems. This initiative includes proposals of specifications for a common framework to define usage policies (AAPML Specification [AAPML Spec]), attribute requirements (CARML Specification [CARML Spec]), and the relevant developer APIs. These proposals are meant to enable businesses to guarantee documentation, control, and auditing with respect to acquirement, use, storage, and propagation of identity-related data through applications and systems.

39 Oracle announced the initiative together with the founding participants (Computer Associates, Layer 7 Technologies, HP, Novell, Ping Identity, Securent, Sun Microsystems) in November 2006. The initiative was submitted royalty-free in February 2007 to the Liberty Alliance consortium, which aims at building a more trusted Internet by addressing the technology, business and privacy aspects of digital identity management and whose Management Board includes representatives from AOL, Ericsson, Fidelity Investments, France Telecom, HP, Intel, Novell, NTT, Oracle, and Sun Microsystems. 4.8 PRIME Policy Languages

The PRIME project is a large-scale research effort aimed at developing an identity management system able to protect user personal information and to provide a framework that can be smoothly integrated with current architectures and online services. In this context an important service for helping users to keep control over their personal information is represented by access control solutions enriched with the ability to support privacy requirements. To fully address the requirements posed by a privacy-aware access control system, the following different types of privacy policies have been defined in the context of PRIME.

• Access control policies. They govern access/release of data/services managed by the party (as in traditional access control). Access control policies define authorisation rules concerning access to data/services. Authorisations correspond to traditional (positive) rules usually enforced in access control systems. An access control rule is an expression of the form: with [] can on