Research Directions in Blockchain Data Management and Analytics

Research Directions in Blockchain Data Management and Analytics

Short Paper Research Directions in Blockchain Data Management and Analytics Hoang Tam Vo Ashish Kundu Mukesh Mohania IBM Research – Australia IBM Research – Yorktown Heights, IBM Research – Australia USA ABSTRACT tracking objects as they traverse the export/import supply chain Blockchain technology has emerged as a primary enabler for while enforcing shipping and expediting incremental payments. verification-driven transactions between parties that do not have Similarly, blockchain technology also has the potential to dis- complete trust among themselves. Bitcoin uses this technology rupt insurance industry. It has been used in a car micro-insurance to provide a provenance-driven verifiable ledger that is based on application to enable the concept of pay-as-you-go insurance [20]. consensus. Nevertheless, the use of blockchain as a transaction This application allows drivers who rarely use cars to only pay in- service in non-cryptocurrency applications, for example, busi- surance premium for particular trips rather than the hefty yearly ness networks, is at a very nascent stage. While the blockchain premium. By transparently storing on blockchain all the data supports transactional provenance, the data management commu- pertaining to the actual trip and premium payment, every party nity and other scientific and industrial communities are assessing in the insurance contract including the driver, the insurance com- how blockchain can be used to enable certain key capabilities for pany, and the financial institution is confident that the data are business applications. tamper-proof and traceable. This guarantees that any insurance We have reviewed a number of proof of concepts and early claim request regarding to a trip can be processed quickly and adoptions of blockchain solutions that we have been involved indisputably, hence offering a better customer experience. Fur- spanning diverse use cases to draw common data life cycle, persis- ther, as the micro-insurance application also requires accessing tence as well as analytics patterns used in real-world applications multiple risk analytic databases such as past driving behaviour with the ultimate aim to identify new frontier of exciting re- statistics and past vehicle runtime statistics for computing pre- search in blockchain data management and analytics. In this miums, a system architecture that allows for maintaining and paper, we discuss several open topics that researchers could in- analyzing both on-chain and off-chain data was also proposed. 1 crease focus on: (1) leverage existing capabilities of mature data In fact, Gartner’s 2016 research report identified blockchain as and information systems, (2) enhance data security and privacy one of the key platform-enabling technologies to track. Nonethe- assurances, (3) enable analytics services on blockchain as well less, while there is currently no standard in the blockchain space, as across off-chain data, and (4) make blockchain-based systems there is a growing consensus that blockchain is entering its peak active-oriented and intelligent. of inflated expectations. The report anticipated that it would take 5 to 10 years for blockchain technology to get mainstream adop- tion. Further, most of nowadays blockchain efforts, especially 1 INTRODUCTION when applied to business environments, are still in a nascent Blockchain (a.k.a. distributed ledger [31]) is an emerging plat- state. Research perspectives and challenges related to blockchain form that is designed to support transactions services within a have been presented in [12], but they are mainly for cryptocur- multi-party business network, with the goal of enabling signifi- rencies and public blockchain environments. The time is ripe cant cost and risk reductions for all parties through the creation for database community to get more deeply involved in solving of innovative new business models. Data maintained within the open problems pertaining to data management and analytics in a distributed ledger can only be accessed through the execution permissioned blockchain network for business applications [28]. of a smart contract [29] (i.e., a stored procedure call on the dis- As the building blocks of blockchain include some combina- tributed ledger) that describes rules that govern a transaction. tion of database, transaction, encryption, consensus and other In addition, the design of blockchain technology ensures that distributed system technologies, it is natural to investigate if no one business entity can modify, delete, or even append any it is possible to utilize existing capabilities of mature data and record to the ledger without the consensus from other business information systems through robust integration into blockchain entities in the network, making the system useful for ensuring systems. There exist open research issues such as multi-storage the immutability of data and legal documents. and index support, novel transaction concurrency model, scalable Given the aforementioned important features, blockchain tech- transaction throughput, master and reference data management, nology has taken the world by storm in the recent years for smart contract management, data security and privacy assur- its promise to transform every industry. For instance, it has ances, as well as information leakage prevention. started to be used in a wider range of applications, e.g., Inter- Furthermore, even though the blockchain database is useful net of Things (IoT) [11]. A blockchain enables IoT devices to for transparent persistence of streaming business data, there is no send data for inclusion in a shared transaction repository with one-size-fits-all database solution for an application [30]. While tamper-resistant records, and enables business parties to access blockchain is originally designed to maintain transaction data, and supply IoT data without the need for central control and there is a growing interest in providing analytics capabilities in management. Blockchain for IoT can optimize supply chains by blockchain-based data systems. Particularly, in this paper we shall elaborate on specific research problems such as built-in analytics © 2018 Copyright held by the owner/author(s). Published in Proceedings of the 21st for blockchain, and data integration and analytics across on-chain International Conference on Extending Database Technology (EDBT), March 26-29, 2018, ISBN 978-3-89318-078-3 on OpenProceedings.org. and off-chain data. Distribution of this paper is permitted under the terms of the Creative Commons license CC-by-nc-nd 4.0. 1http://www.gartner.com/newsroom/id/3412017 Series ISSN: 2367-2005 445 10.5441/002/edbt.2018.43 Last but not least, with recent advances in areas such as infor- data stores such as key-value, document, SQL and spatial data mation retrieval, machine learning, and AI, there is a tremendous stores simultaneously in the same blockchain system. opportunity to bring cognitive capabilities, e.g., understanding, Additionally, blockchain is originally not designed to store dig- learning, and contextual awareness into blockchain-based data ital documents, which, however, are a popular type of data shared systems so as to make them active-oriented and intelligent. We in a business network as observed in a majority of blockchain so- shall discuss open research issues encountered while developing lutions that we have been involved. These digital records are usu- intelligent blockchain-based data systems. Our list is by no means ally large, and their aggregate size grows significantly over time. comprehensive and other research opportunities exist as well. It is infeasible to store these data directly on the blockchain due to several constraints such as storage size, bandwidth and trans- 2 BACKGROUND action throughput. One possible solution is to store these records Blockchain. Blockchain technology [31] provides a framework in a third-party offline storage, and maintain their locations and for building a distributed ledger that can provide consensus, a digital hash of the data on the blockchain for verification. Nev- provenance, immutability and finality of transaction data. The ertheless, this approach requires integration of blockchain and use of blockchain was first popularised by Bitcoin1 [ ], which is offline storages. Thus, it is critical to develop blockchain systems a cryptocurrency. In a blockchain, a group of ledger entries, i.e., with built-in offline storage strategies for handling big data. a list of transactions, are periodically accumulated into a block We also observe from the implementation of those blockchain which contains a cryptographic hash of the prior block linking solutions that rich queries (e.g., conditionals, operators etc.) of the blocks together. This way of chaining the blocks allows the data on blockchain are typically read-only and based on non- global order of the ledger entries to be established and to verify primary keys. To deal with these situations, explicit smart con- that the content of a particular block have not been modified. tracts for maintaining secondary indices have to be developed. Every node in this distributed system maintains its own copy This motivates exciting research problems related to index man- of the blockchain and participates in an appropriate consensus agement in blockchain-based data systems. mechanism to keep the replicated data in sync across nodes. Master data management. Unlike blockchains used in pub- For example, Bitcoin uses a consensus protocol called “Proof-of- lic cryptocurrency

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    4 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us