A Decentralized and Secure Computing Paradigm Gihan J

Total Page:16

File Type:pdf, Size:1020Kb

A Decentralized and Secure Computing Paradigm Gihan J 1 Blockchain as a Service: A Decentralized and Secure Computing Paradigm Gihan J. Mendis∗, Yifu Wu∗, Jin Wei∗, Moein Sabounchi∗, and Rigoberto Roche’y ∗Department of Computer and Information Technology Purdue University, West Lafayette, Indiana yNASA Glenn Research Center, Cleveland, Ohio Abstract—Thanks to the advances in machine learning, data- able to contribute to machine learning tasks in a privacy- driven analysis tools have become valuable solutions for various preserving manner with reasonable financial incentivation. To applications. However there still remain essential challenges to achieve this goal, in this paper we develop a decentralized and develop effective data-driven methods because of the need to acquire a large amount of data and to have sufficient computing secure computing infrastructure that enables an effective and power to handle the data. In many instances these challenges privacy-preserving collaboration between the available end- are addressed by relying on a dominant cloud computing ven- users and data creators that are called computing nodes in this dor, but, although commercial cloud vendors provide valuable paper. These computing nodes can have restricted computing platforms for data analytics, they can suffer from a lack of power and limited computing intelligence. Additionally, they transparency, security, and privacy-perservation. Furthermore, reliance on cloud servers prevents applying big data analytics can be scattered and untrustworthy to each other. in environments where the computing power is scattered. To In recent years, several techniques have been proposed address these challenges, a decentralize, secure, and privacy- to achieve decentralized and privacy-preserving computing. preserving computing paradigm is proposed to enable an asyn- In [6], Shokri et al. proposed a privacy-preserving deep- chronized cooperative computing process amongst scattered and learning mechanism with secure multi-party computations to untrustworthy computing nodes that may have limited computing power and computing intelligence. This paradigm is designed update the single initial deep learning model. However, the by exploring blockchain, decentralized learning, homomorphic deep learning model is centralized while multiple parties encryption, and software defined networking(SDN) techniques. contribute to the model training in a manner that guaranties The performance of the proposed paradigm is evaluated via the privacy-preservation. Federated learning introduced in [7]– different scenarios in the simulation section. [10] is a distributed machine learning method that enables Index Terms—Blockchain, Decentralized and Secure Learning, model training on decentralized data. In this method, multiple Machine Learning, Privacy, Security copies of the central model can be available in distributed computing devices for training, which eliminates the sin- gle point of failure. However, both of these two methods I. INTRODUCTION must be managed by one centralized authoritative controlling Due to the advances in sensing and computing, data-driven agent, which may raise security and privacy concerns. To methods, such as machine learning techniques [1], have be- address these issues, in our work we develop a decentral- come very promising solutions for different applications [2]– ized and secure computing paradigm, which does not have [4]. However, there are two essential challenges in imple- any centralized authoritative controlling agents, by exploiting menting these techniques: (1) acquisition of large amount of blockchain, machine learning, and homomorphic encryption arXiv:1807.02515v3 [cs.CR] 4 Sep 2019 data, and (2) requirement of enough computing power, which technologies. Blockchain is an emerging technology, which enforces the reliance on a dominant cloud computing vendor. can be considered as an immutable and decentralized digital Although the cloud servers provide valuable platforms for big ledger [11], [12]. A blockchain is a growing list of records, data analytics, there remain several essential challenges in called blocks, which are linked via cryptography. Each block adopting the commercial cloud vendors: (1) transparency, (2) contains the hash value of the previous block, the transaction security, and (3) privacy [5]. With the rise of awareness for data, and the timestamp. Due to the inherent cryptographic data privacy, end-users have become reluctant to share their chaining, if a malicious party tries to manipulate certain data. Also, in another perspective, user data are becoming a transaction data, it will causes the changes of the hash values valuable asset. In some domains, such as the healthcare sector, of the block containing the transaction and those of all the federal and civil service offices, there is an abundance of subsequent blocks, which can be easily detected. Therefore, valuable data, however, due to privacy laws and regulations, generally speaking, blockchain technology provides a very these data cannot be shared with the third party. Furthermore, promising solution for integrity security. Amongst various the reliance on cloud servers also limit the potentials of existing blockchain platforms, Ethereum and Bitcoin are two applying big data analytics for the environment where the of the most widely adopted ones [11]–[13]. Compared with computing power is scattered. Therefore, it is meaningful to Bitcoin, Ethereum platform provides a trustful automation of develop a reliable infrastructure in which end-users or data programs via smart contracts that run on virtual machines. creators are able to secure the ownership of data while being In our work, we exploit Ethereum blockchain to execute 2 the secure and privacy-preserving decentralized computing Because of this, most profitable parties in the field of machine functionalities automatically. learning are large organizations, which hold both valuable Furthermore, our computing paradigm enables the effec- big data and sufficient computational power to process it. tive decentralized and cooperative learning via an effective As illustrated in Fig. 1(a), these large organizations collect learning-model fusion mechanism. Fusing multiple learning data from data contributors to advance the capabilities of their models is an active area of research and fusion strategies in machine learning techniques. One of the essential challenges literature can be divided into two main categories: (1) Late for creating large-scale datasets via data acquisition, from mul- fusion that comprise predicting the labels based on the labels tiple parties, is the issue of privacy and the related concerns. given by each learning model to be fused and (2) Early fusion Potential data providers may not get motivated to share the that takes the feature vectors given by the individual learning data because of the high potential for data privacy violations. models as the inputs and learns a classifier on top of them. Additionally, collecting tremendous raw data from multiple Although late fusion requires lower computational cost com- parties results in a huge demand on communication bandwidth pared with early fusion in many practical applications as stated and a dramatically increased attack surface. Furthermore, a in [14]–[16], early fusion can achieve a more optimal way to large amount of computational power is required by the central combine learned models compared with late fusion [17]. In server to process the collected big data. this work, our learning-model fusion mechanism belongs to the type of early fusion, in which the feature vectors to be fused present features with the highest level of abstraction. One solution is the implementation of distributed learning Additionally, we consider two strategies for designing the architectures [6]–[10]. As shown in Fig. 1(b), in distributed fusion mechanism: (1) using a fully connected structure with learning, rather than collecting and processing data in a single a single hidden-layer to map concatenated features to labels central server, data processing is distributed partially to the and (2) implementing gradual fusion to explore the uniqueness individual data providers. By doing so, the distributed learning of the individual learning models and the correlation amongst is implemented in such a way, that the computing contribu- the learning models. tors process their local data by training the given machine To further enhance the security and achieve privacy- learning models or their own machine learning models and preservation, we design a encryption interface with a zero- then share the trained model with a central controlling agent. knowledge proof protocol by exploiting homomorphic encryp- Since the data are not shared, we can say that the data tion (HE), which enables evaluating the performance of the privacy is preserved in this architecture. Additionally, the contributed learning models without revealing the sensitive machine learning models are trained in distributed locations details of the learning models. HE technology is one form with smaller sets of data, and thus the computational power of encryption that allows the computation operations to be required by the individual computing contributors is much directly implemented in cipherspace and achieves an encrypted lower, compared with that of a central server. However, in this results that, when decrypted, match the results of the opera- solution, the machine learning architecture is fully controlled tions as if they had been performed on the plainspace [18]. The by
Recommended publications
  • P2P Business Applications: Future and Directions
    Communications and Network, 2012, 4, 248-260 http://dx.doi.org/10.4236/cn.2012.43029 Published Online August 2012 (http://www.SciRP.org/journal/cn) P2P Business Applications: Future and Directions Pankaj Pankaj, Micki Hyde, James A. Rodger MIS and Decision Sciences, Eberly College of Business & Information Technology, Indiana University of Pennsylvania, Indiana, USA Email: [email protected] Received May 20, 2012; revised June 18, 2012; accepted July 17, 2012 ABSTRACT Since the launch of Napster in June 1999, peer-to-peer technology (P2P) has become synonymous with file sharing ap- plications that are the bane of the recording industry due to copyright infringements and consequent revenue losses. P2P promised a revolution in business computing which has not arrived. It has become synonymous with illegal file sharing and copyright violations. Meanwhile the information systems industry has undergone a paradigm change, and we are supposedly living in a world of cloud computing and mobile devices. It is pertinent to examine if P2P, as a revolution- ary technology, is still relevant and important today and will be in future. One has to examine this question in the con- text of the fact that P2P technologies have matured but have however had limited adoption outside file-sharing in the consumer space. This paper provides a detailed analysis of P2P computing and offers some propositions to answer the question of the relevancy of P2P. It is proposed that P2P is still a relevant technology but may be reshaped in the com- ing years in a different form as compared to what exists today.
    [Show full text]
  • From Technology to Society: an Overview of Blockchain-Based DAO
    > REPLACE THIS LINE WITH YOUR PAPER IDENTIFICATION NUMBER < 1 From Technology to Society: An Overview of Blockchain-based DAO Lu Liu, Sicong Zhou, Huawei Huang1, Zibin Zheng1 Decentralized Autonomous Organization (DAO) is believed to play a significant role in our future society governed in a decentralized way. In this article, we first explain the definitions and preliminaries of DAO. Then, we conduct a literature review of the existing studies of DAO published in the recent few years. Through the literature review, we find out that a comprehensive survey towards the state-of-the-art studies of DAO is still missing. To fill this gap, we perform such an overview by identifying and classifying the most valuable proposals and perspectives closely related to the combination of DAO and blockchain technologies. We anticipate that this survey can help researchers, engineers, and educators acknowledge the cutting-edge development of blockchain-related DAO technologies. Index Terms—Blockchain, Contracts, DAO, Fault Tolerant System, Governance I. INTRODUCTION Blockchain-based technologies have been deeply adopted by User multiple applications that are closely related to every corner of User Contract our daily life, such as cryptocurrencies, tokenomics, business User applications, Internet-of-Things (IoT) applications, and etc.. Decentralized Autonomous Organization (DAO), as one of the Exchange blockchain applications shown in Fig. 1, is growing rapidly Miner Developer and drawing great attention from both academia and the Contract governments around the world. Although DAO has brought Developer a lot of opportunities to blockchain technologies, we are sur- prised to find out that an overview of DAO in the perspective Contract Exchange Miner of blockchains is still missing.
    [Show full text]
  • The Technology Copyright © ${Date}
    1 The Technology Copyright © ${Date}. ${Publisher}. All rights reserved. © ${Date}. ${Publisher}. Copyright Copyright © ${Date}. ${Publisher}. All rights reserved. © ${Date}. ${Publisher}. Copyright 1 Blockchains, Bitcoin, and Decentralized Computing Platforms At their core, blockchains are decentralized databases, maintained by a dis­ tributed network of computers. They blend together a variety of dif er ent technologies— including peer-­to-­peer networks, public-­private key cryptography, and consensus mechanisms—to create a novel type of database. We provide he re a short description of how blockchains work, and unpack and contextualize their key technological components. ­UNTIL THE BIRTH of the Internet, computers suffered in isolation. They were islands, lacking a way to connect to one another except by using cumber- some cables. That all changed in the late 1950s. With the Soviets successfully launching Sputnik into space, and with fears of the Cold War mounting, re- searchers at the Rand Corporation began to explore a new computing para- digm—in hopes of developing a system that would be able to withstand a nuclear catastrophe.1 In August 1964, af ter years of research, Paul Baran, one of the Rand researchers, reported a breakthrough. By relying on a technology called packet switching, Baran was able to send fragments of information from one computer to another and have these fragments reassembled, almost like magic.2 Armed with Baran’s research, the Advanced Research Proj ects Agency Copyright © ${Date}. ${Publisher}. All rights reserved. © ${Date}. ${Publisher}. Copyright (ARPA) at the U.S. Department of Defense used this new technology to 13 14 BLOCKCHAIN AND THE LAW create the first network of computers, ARPAnet, later renamed DARPAnet after “Defense” was added to the beginning of the agency’s name, helping researchers and academics to share files and exchange resources with one an- other.
    [Show full text]
  • Research Problems of Decentralized Systems with Largely Autonomous Nodes
    RESEARCH PROBLEMS OF DECENTRALIZED SYSTEMS WITH LARGELY AUTONOMOUS NODES by Jerome H. Saltzer Massachusetts Institute of Technology A currently popular systems research project is to explore the possibilities and problems for computer system organization that arise from the rapidly falling cost of computing hardware. Interconnecting fleets of mini- or micro-computers and putting intelligence in terminals and concentrators to produce so-called "distributed systems" has recently been a booming development activity. While these efforts range from ingenious to misguided, many seem to miss a most important aspect of the revolution in hardware costs: that more than any other factor, the en_~ cost of acquiring and operating a free-standing, complete computer system has dropped and continues to drop rapidly. Where a decade ago the capital outlay required to install a computer system ranged from $150,000 up into the millions, today the low end of that range is below $15,000 and dropping. The consequence of this particular observation for system structure comes from the next level of analysis. In most organizations, decisions to make capital acquisitions tend to be more centralized for larger capita] amounts, and less centralized for smaller capital amounts. On this basis we may conjecture that lower entry costs for computer systems will lead naturally to computer acquisition decisions being made at points lower in a management hierarchy. Further, because a lower-level organization usually has a smaller mission, those smaller-priced computers will tend to span a smaller range of applications, and in the limit of the argument will be dedicated to a single application.
    [Show full text]
  • Cloud Customer Architecture for Blockchain
    Cloud Customer Architecture for Blockchain Executive Overview Blockchain technology has the potential to radically alter the way enterprises conduct business as well as the way institutions process transactions. Businesses and governments often operate in isolation but with blockchain technology participants can engage in business transactions with customers, suppliers, regulators, potentially spanning across geographical boundaries. Blockchain technology, at its core, features an immutable distributed ledger, a decentralized network that is cryptographically secured. Blockchain architecture gives participants the ability to share a ledger, through peer to peer replication, which is updated every time a block of transaction(s) is agreed to be committed. The technology can reduce operational costs and friction, create transaction records that are immutable, and enable transparent ledgers where updates are nearly instantaneous. It may also dramatically change the way workflow and business procedures are designed inside an enterprise and open up new opportunities for innovation and growth. Blockchain technology can be viewed from a business, legal and technical perspective: • From a business perspective, blockchain is an exchange network that facilitates transfer of value, assets, or other entities between willing and mutually agreeing participants, ensuring privacy and control of data to stakeholders • From a legal perspective, blockchain ledger transactions are validated, indisputable transactions, which do not require intermediaries or trusted third-party legal entities. • From a technical perspective, blockchain is a replicated, distributed ledger of transactions with ledger entries referencing other data stores (for additional information related to ledger transactions). Cryptography is used to ensure that network participants see only the parts of the ledger that are relevant to them, and that transactions are secure, authenticated and verifiable, in the context of permissioned business blockchains.
    [Show full text]
  • Centralized Versus Decentralized Computing: Organizational Considerations and Management Options
    Centralized versus Decentralized Computing: Organizational Considerations and Management Options JOHN LESLIE KING Department of Information and Computer Science, University of Cahfornia, Irvine, Irvine, California 92717 The long.standing debate over whether to centralize or decentralize computing is examined in terms of the fundamental organizational and economic factors at stake. The traditional debate is evaluated, and found to focus predominantly on issues of efficiency versus effectiveness, with solutions based on a rationalistic strategy of optimizing in this trade-off. A behavioral assessment suggests that the driving issues in the debate are the politics of organization and resources, centering on the issue of control. The economics of computing depolyment decisions are presented as an important issue, but one that often serves as a field of argument in which political concerns are dealt with. The debate in this light appears to be unresolvable in the long run, although effective strategies can be developed when the larger issues are recognized. The current situation facing managers of computing, given the advent of small and comparatively inexpensive computers, is examined in detail, and a set of management options for dealing with this persistent issue is presented. Categories and Subject Descriptors: K.6.0 [Management of Computing and Information Systems]: General; K.6.1 [Management of Computing and Information Systems]: Project and People Management; K.6.4 [Management of Computing and Information Systems]: System Management
    [Show full text]
  • A Decentralized Marketplace Application on the Ethereum Blockchain
    2018 IEEE 4th International Conference on Collaboration and Internet Computing A Decentralized Marketplace Application on The Ethereum Blockchain R. Vishnu Prasad Ram Dantu Aditya Paul Paula Mears Department of Computer Science Department of Computer Science Department of Computer Science Department of Computer Science University of North Texas University of North Texas University of North Texas University of North Texas Denton, USA Denton, USA Denton, USA Denton, USA [email protected] [email protected] [email protected] [email protected] Kirill Morozov Department of Computer Science University of North Texas Denton, USA [email protected] Abstract—Modern centralized online marketplaces used as a platform to build truly decentralized such as eBay offer an alternative option for consumers to applications with no central point of failure and no both sell and purchase goods with relative ease. However, hierarchical ownership of user data. This has significant drawbacks to these marketplaces include the platform’s advantages for data security, privacy, and ownership, as ability to block merchants at their own whim, the fees paid well as the potential to dramatically reduce middle-man to the platform when listing a product and when selling a costs. The issue of user trust in decentralized product, and the lack of privacy of users’ data. In this applications that run on a blockchain platform is one that paper, we propose an application that remedies all three of has been studied [3] and developed on recently, and it is these drawbacks through use of the Ethereum blockchain now possible to build application that the user can trust platform.
    [Show full text]
  • Rethinking General-Purpose Decentralized Computing
    Rethinking General-Purpose Decentralized Computing Enis Ceyhun Alp∗ Eleftherios Kokoris-Kogias∗ [email protected] [email protected] EPFL EPFL Georgia Fragkouli Bryan Ford [email protected] [email protected] EPFL EPFL Abstract 1 Introduction While showing great promise, smart contracts are difficult Decentralized systems have been studied extensively over to program correctly, as they need a deep understanding of the past decades [14, 17, 40, 44, 50]. One of the most recent, cryptography and distributed algorithms, and offer limited and arguably the most popular, decentralized system is the functionality, as they have to be deterministic and cannot blockchain and its most promising application is smart con- operate on secret data. In this paper we present Protean, tracts. Smart contracts are user-defined programs that are a general-purpose decentralized computing platform that executed by a network of nodes that reach consensus on addresses these limitations by moving from a monolithic program state. Ethereum, the world’s second largest cryp- execution model, where all participating nodes store all the tocurrency, claims to be a general-purpose decentralized state and execute every computation, to a modular execution- computer that can support execution of arbitrary code in model. Protean employs secure specialized modules, called smart contracts. However, Ethereum and its successors [1, 2] functional units, for building decentralized applications that fall short of their claim due to several shortcomings. First, are currently insecure or impossible to implement with smart they do not support non-deterministic operations and cannot contracts. Each functional unit is a distributed system that securely operate on private data.
    [Show full text]
  • Decentralized P2P Broker for M2M and Iot Applications †
    proceedings Proceedings Decentralized P2P Broker for M2M and IoT Applications † Iván Froiz-Míguez 1,2 , Paula Fraga-Lamas 1,2,* and Tiago M. Fernández-Caramés 1,2,* 1 Department of Computer Engineering, Faculty of Computer Science, Universidade da Coruña, 15071 A Coruña, Spain; [email protected] 2 CITIC Research Center, Universidade da Coruña, 15071 A Coruña, Spain * Correspondence: [email protected] (P.F.-L.); [email protected] (T.M.F.-C.); Tel.: +34-981-167-000 (ext. 6051) (P.F.-L.) † Presented at the 3rd XoveTIC Conference, A Coruña, Spain, 8–9 October 2020 Published: 20 August 2020 Abstract: The recent increase in the number of connected IoT devices, as well as the heterogeneity of the environments where they are deployed, has derived into the growth of the complexity of Machine-to-Machine (M2M) communication protocols and technologies. In addition, the hardware used by IoT devices has become more powerful and efficient. Such enhancements have made it possible to implement novel decentralized computing architectures like the ones based on edge computing, which offload part of the central server processing by using multiple distributed low-power nodes. In order to ease the deployment and synchronization of decentralized edge computing nodes, this paper describes an M2M distributed protocol based on Peer-to-Peer (P2P) communications that can be executed on low-power ARM devices. In addition, this paper proposes to make use of brokerless communications by using a distributed publication/subscription protocol. Thanks to the fact that information is stored in a distributed way among the nodes of the swarm and since each node can implement a specific access control system, the proposed system is able to make use of write access mechanisms and encryption for the stored data so that the rest of the nodes cannot access sensitive information.
    [Show full text]
  • Rethinking General-Purpose Decentralized Computing
    Rethinking General-Purpose Decentralized Computing Enis Ceyhun Alp∗ Eleftherios Kokoris-Kogias∗ [email protected] [email protected] EPFL EPFL Georgia Fragkouli Bryan Ford [email protected] [email protected] EPFL EPFL Abstract 1 Introduction While showing great promise, smart contracts are difficult Decentralized systems have been studied extensively over to program correctly, as they need a deep understanding of the past decades [14, 17, 40, 44, 50]. One of the most recent, cryptography and distributed algorithms, and offer limited and arguably the most popular, decentralized system is the functionality, as they have to be deterministic and cannot blockchain and its most promising application is smart con- operate on secret data. In this paper we present Protean, tracts. Smart contracts are user-defined programs that are a general-purpose decentralized computing platform that executed by a network of nodes that reach consensus on addresses these limitations by moving from a monolithic program state. Ethereum, the world’s second largest cryp- execution model, where all participating nodes store all the tocurrency, claims to be a general-purpose decentralized state and execute every computation, to a modular execution- computer that can support execution of arbitrary code in model. Protean employs secure specialized modules, called smart contracts. However, Ethereum and its successors [1, 2] functional units, for building decentralized applications that fall short of their claim due to several shortcomings. First, are currently insecure or impossible to implement with smart they do not support non-deterministic operations and cannot contracts. Each functional unit is a distributed system that securely operate on private data.
    [Show full text]
  • A Conceptual Architecture in Decentralizing Computing, Storage, and Networking Aspect of Iot Infrastructure
    IoT Article A Conceptual Architecture in Decentralizing Computing, Storage, and Networking Aspect of IoT Infrastructure Yustus Eko Oktian , Elizabeth Nathania Witanto and Sang-Gon Lee * College of Software Convergence, Dongseo University, Busan 47011, Korea; [email protected] (Y.E.O.); [email protected] (E.N.W.) * Correspondence: [email protected] Abstract: Since the inception of the Internet of Things (IoT), we have adopted centralized architecture for decades. With the vastly growing number of IoT devices and gateways, this architecture struggles to cope with the high demands of state-of-the-art IoT services, which require scalable and responsive infrastructure. In response, decentralization becomes a considerable interest among IoT adopters. Following a similar trajectory, this paper introduces an IoT architecture re-work that enables three spheres of IoT workflows (i.e., computing, storage, and networking) to be run in a distributed manner. In particular, we employ the blockchain and smart contract to provide a secure computing platform. The distributed storage network maintains the saving of IoT raw data and application data. The software-defined networking (SDN) controllers and SDN switches exist in the architecture to provide connectivity across multiple IoT domains. We envision all of those services in the form of separate yet integrated peer-to-peer (P2P) overlay networks, which IoT actors such as IoT domain owners, IoT users, Internet Service Provider (ISP), and government can cultivate. We also present several IoT workflow examples showing how IoT developers can adapt to this new proposed architecture. Based on the presented workflows, the IoT computing can be performed in a trusted Citation: Oktian, Y.E.; Witanto, E.N.; and privacy-preserving manner, the IoT storage can be made robust and verifiable, and finally, we can Lee, S.-G.
    [Show full text]
  • Download Whitepaper
    WHITEPAPER WWW.IAGON.COM v4.5 1 TABLE OF CONTENTS OVERVIEW .............................................................................................................................3 INTRODUCTION .....................................................................................................................5 MARKET OUTLOOK OF CLOUD STORAGE SERVICES .............................................................6 MARKET OUTLOOK OF CLOUD COMPUTING SERVICES ..........................................................7 IAGON’S AI-BASED COMPUTATIONAL PROCESSES ...............................................................8 IAGON’S MULTIPLE BLOCKCHAIN SUPPORT .........................................................................9 IAGON’S SECURE LAKE TECHNOLOGY ..................................................................................9 IAGON’S SMART COMPUTING GRID PLATFORM AND AI-TRACKER TECHNOLOGY .................11 CASE STUDY ........................................................................................................................12 REGULATIONS .....................................................................................................................13 REINFORCEMENT LEARNING ...............................................................................................13 DATA MINING ........................................................................................................................14 BLOCKCHAIN .......................................................................................................................15
    [Show full text]