Candid: Can-Do Decentralized Identity with Legacy Compatibility, Sybil-Resistance, and Accountability

1 CanDID: Can-Do Decentralized Identity with Legacy Compatibility, Sybil-Resistance, and Accountability Deepak Maram∗{, Harjasleen Malvaiy{, Fan Zhang∗{, Nerla Jean-Louisz{, Alexander Frolovy{, Tyler Kell∗{, Tyrone Lobbanx, Christine Moyx, Ari Juels∗{, Andrew Millerz{ ∗Cornell Tech, yCornell University, zUIUC, xJ. P. Morgan, {IC3, The Initiative for CryptoCurrencies & Contracts Abstract—We present CanDID, a platform for practical, user- 1) Legacy compatibility: Most proposed decentralized identity friendly realization of decentralized identity, the idea of empow- systems presume the existence of a community of issuers ering end users with management of their own credentials. of digitally signed credentials. But such issuers may not While decentralized identity promises to give users greater control over their private data, it burdens users with management arise—and existing credential issuers may not begin to of private keys, creating a significant risk of key loss. Existing and issue digitally signed variants of existing credentials—until proposed approaches also presume the spontaneous availability DID infrastructure sees use. The result is a bootstrapping of a credential-issuance ecosystem, creating a bootstrapping problem. A big impediment to DID adoption is the inability problem. They also omit essential functionality, like resistance to of proposed systems to leverage the data on users available Sybil attacks and the ability to detect misbehaving or sanctioned users while preserving user privacy. in existing web services that don’t issue credentials. CanDID addresses these challenges by issuing credentials 2) Sybil-resistance: Proposed decentralized identity systems in a user-friendly way that draws securely and privately on do not deduplicate user identities. Unique per-user iden- data from existing, unmodified web service providers. Such tities are critical, though, in many systems: anonymous legacy compatibility similarly enables CanDID users to leverage voting, fair currency distribution (“airdrops”) [14], etc. their existing online accounts for recovery of lost keys. Using 3) Accountability: It is challenging both to provide user pri- a decentralized committee of nodes, CanDID provides strong confidentiality for user’s keys, real-world identities, and data, vacy, i.e., conceal users’ real-world identities, and achieve yet prevents users from spawning multiple identities and allows compliance with regulations such as Know-Your-Customer identification (and blacklisting) of sanctioned users. (KYC) / Anti-Money-Laundering (AML). Particularly im- We present the CanDID architecture and report on experi- portant is an ability to screen users of the system, i.e., ments demonstrating its practical performance. identify and bar identified criminal users on sanctions lists, as is done in the sanctions screening process performed by financial institutions. I. INTRODUCTION 4) Key recovery: In DID systems, users bear the burden of Identity management lies at the heart of any user-facing managing their own private keys. Key recovery is poten- system, be it a social media platform, online game, or col- tially the Achilles’ heel of such systems, as it is well known laborative tool. Backlash against the handling of personal that users regularly lose valuable keys. Billions of dollars information by large tech firms [2], [3] has recently spawned of cryptocurrency have vanished because of lost keys [60]. a new approach to identity management called decentralized Proposed solutions to these problems are problematic in identity—a.k.a. self-sovereign identity [7], [10], [32], [74]. various ways. For example, for key management / recovery, Decentralized identity systems allow users to gather and users can delegate or escrow their private keys with an online manage their own credentials under the banner of self-created service (like Coinbase for cryptocurrency [4])—but that then decentralized identifiers (DIDs). By controlling private keys effectively re-centralizes identity management. W3C proposes associated with DIDs, users are empowered to disclose or a quorum of trusted parties to enable key recovery, but omits withhold their credentials as desired in online interactions. details [74]; Microsoft plans to unveil a new approach, but Enterprises also benefit by limiting the liability associated with details remain forthcoming at the time of writing [53]. storage of sensitive user data [49]. The other three enumerated challenges, legacy compat- The most commonly cited use cases for DIDs involve ibility, Sybil-resistance, and (privacy-preserving) sanctions users authorizing release of personal credentials from user screening, have seen little or no treatment in proposed decen- devices to websites [61]. For example, an online job applicant tralized identity systems, and treatment relevant to such sys- might release a digitally signed credential from her university tems in only a few works in the research literature, e.g., [22], showing that she has received a bachelor’s degree and a proof [23], [26], [28], [38], [81]. of residency in the country in which she is applying. Initiatives A. CanDID such as the Decentralized Identity Foundation [32] and Decen- 1 tralized Identifiers (DID) working group of W3C [74] are de- In this paper, we present CanDID , a decentralized-identity veloping standards and use cases to support such transactions. system that aims to address the four major challenges They largely fail, however, to address four main technical and highlighted above, while providing strong privacy properties. usability goals that we target in this work. Specifically, these CanDID can act as a freestanding service or can be coupled goals are especially challenging to achieve, as we seek to do, 1The name means “honestly presenting information”; we also use it to in a privacy-preserving way: signify that users “can do decentralized identities (DIDs)”. 2 with other decentralized identity systems. It is decentralized ❶ Port identities ❶ Port identities in the sense that it relies on a committee of nodes, which may ❷ Deduplication represent distinct entities. ❸ DID Issuance CanDID consists of two subsystems: An identity system for Legacy identity issuing and managing credentials, and a key recovery system. providers CanDID committee 1) Identity system: CanDID leverages an oracle— ❹ Use DID specifically, either DECO [86] or Town Crier [85]—to port (a) High-level credential-issuance workflow. identities and credentials securely from existing web services. These services can be social media platforms, online bank Backup DID key & set authenticator to P accounts, e-mail accounts, etc. The oracles used by CanDID allow it to scrape websites in order to construct trustworthy ❶ Obtain an ❶ Obtain an authentication proof authentication proof ❷Verify proofs credentials without providers needing to explicitly create DID-compatible credentials or even be aware of CanDID, ❸ Release DID key Legacy easing the way for bootstrapping a credential ecosystem. authenticator ❹ Use DID key P CanDID committee Credential privacy: In support of its strong privacy ob- (b) Key backup and recovery workflow. jectives, CanDID allows users to construct credentials that reveal information selectively via zero-knowledge arguments. Fig. 1: CanDID architecture overview and workflow. For instance, a user can construct a credential proving that she is at least 18 years of age. In doing so, she need not reveal her actual birthdate either to committee nodes or to 2) Key-recovery system: Our approach allows users to entities to which she presents the credential. Second, CanDID leverage existing web authentication schemes and engage in provides strong membership privacy. Committee nodes and a familiar, user-friendly workflow to recover their keys. Users web-identity providers not only cannot learn users’ real-world may store their private keys on whatever devices, (e.g., mobile identities, but cannot learn user membership, i.e., whether a phones) that they use regularly. Users can back up their keys given real-world user is active in CanDID. with the CanDID committee (privately, via secret-sharing) and As in other DID schemes, e.g., [10], [32], [74], CanDID prespecify recovery accounts on web services of their choice, supports the use of pairwise credentials. That is, it permits along with a recovery policy (e.g., authentication of 2-out-of-3 users to generate credentials unique to each user-service rela- accounts). To recover her key, a user proves successful logins tionship and unlinkable to those used in other relationships. under her chosen policy. Fig. 1b shows the recovery workflow. CanDID can in principle alternatively support fully anonymous Key-recovery privacy: CanDID’s use of oracles allows a user credentials. Conversely, CanDID is compatible with models in to prove she was successful in logging into a preselected which users register pseudonymous decentralized identifiers account, but without revealing account information to com- (DIDs) on a blockchain or other distributed ledger. mittee nodes or CanDID use to web service providers. Na¨ıve Novel capabilities: Beyond credential issuance, CanDID’s approaches, e.g., use of OAuth, would leak such information. identity system includes two new and distinctive capabilities: Sybil-resistance and sanctions screening. B. Contributions and Paper Organization a) Sybil-resistance: CanDID supports deduplication of To summarize, CanDID offers a practical approach to decen- identities with respect to unique numerical identifiers like So- tralized identity

Candid: Can-Do Decentralized Identity with Legacy Compatibility, Sybil-Resistance, and Accountability

Blockchain and the Future of the Internet: a Comprehensive Review

Blockchain for the Trustworthy Decentralized Web Architecture

Self-Sovereign Digital Identity Leveraging Blockchain Networks – Opportunities and Challenges

Sovrin: a Protocol and Token for Self-Sovereign Identity & Decentralized Trust Pg 2 of 42 PART ONE

THE GREAT DECENTRALIZATION: HOW WEB 3.0 WILL WEAKEN COPYRIGHTS Amendment Protection in These Cases

Running Head: PEER-TO-PEER DECENTRALIZED PLATFORMS

Decentralized Identity Management for Public Transporation

Crypto, What Is It Good For? an Overview of Cryptocurrency Use Cases

A Decentralized Information Sharing Platform Based on Blockchain

Crypto Currency & Blockchain Technology: a Decentralized Future

The Nail Finds a Hammer

Oyster Protocol Comes In