Quantum K Means Algorithm

Total Page:16

File Type:pdf, Size:1020Kb

Quantum K Means Algorithm DEGREE PROJECT IN INFORMATION AND COMMUNICATION TECHNOLOGY, SECOND CYCLE, 30 CREDITS STOCKHOLM, SWEDEN 2019 Quantum K means Algorithm SUMSAM U. KHAN KTH ROYAL INSTITUTE OF TECHNOLOGY SCHOOL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCE Quantum K-means Algorithm SUMSAM U. KHAN Master in Computer Science Date: August 27, 2019 Supervisor KTH: Baptiste Cavarec Examiner: Mats Bengtsson School of Electrical Engineering and Computer Science Host company: Ericsson AB Supervisor Ericsson: Gemma Vall Llosera and Ahsan Javed Awan iii Abstract Quantum algorithms are being extensively researched nowadays seeing the potential of providing exponential speed up when compared to classical algo- rithm execution. This speed-up can play a big role in machine learning where training a model is usually very slow. Training a machine learning model re- quires manipulating large vectors and quantum computers inherently are re- ally fast at manipulating and computing large vectors and tensor products. However, current quantum computers have certain limitations with respect to qubit’s coherence times and noise. These barriers reduce their effectiveness in solving problems with high accuracy. In pursuit of having better results on cur- rent term noisy quantum computers, implementations of quantum algorithms with simpler circuits are desired. To address this problem, three different novel methods of the quantum version of the K-means clustering algorithm are pre- sented with optimized shallow depth quantum circuit design. Experimental results on the quantum computer IBMQX2 are demonstrated which show a significant improvement in the accuracy of the quantum K-means algorithm. iv Sammanfattning Forskning inom kvantalgoritmer har ökat kraftig under de senaste åren p.g.a. en lovande exponentiell speed up för visa beräkningar som kräver en mängd stor data och tar en lång tida i en klassikt dator. Den här speed up:en kan spela en stor roll i maskininlärning där träningen av en modell är vanligtvis mycket långsam. Träning av en maskininlärningsmodell kräver att man mani- pulerar stora vektorer och kvantdatorer är i själva verket snabba att manipu- lera och beräkna stora vektorer och tensorprodukter. Nuvarande kvantdatorer har emellertid vissa begränsningar med avseende på qubit-koherens och brus. Dessa hindrar deras effektivitet vid att lösa problem med hög noggrannhet. I strävan efter att ha bättre resultat på de befintliga brus-påverkade kvantdatorer, är det önskvärt att implementera kvantalgoritmer med enklare kvantkretsar. Vi adresserar problematiken med tre olika nya metoder för kvant K-means algo- rithm som har en optimerad grunddjupkvantkretsdesign. Experimentella re- sultat på kvantmaskinen IBMQX2 visas och de visar en betydande förbättring av precision hos kvant K-means algoritmen. Contents 1 Introduction 1 1.1 Problem . .3 1.2 Objective . .3 1.3 Methodology . .3 2 Background and Related Work 5 2.1 Quantum Computation . .5 2.1.1 Qubit . .5 2.1.2 Entanglement . .7 2.1.3 Quantum Gates . .8 2.1.4 Quantum Circuit Model . 11 2.1.5 Amplitude Encoding . 11 2.1.6 Interference . 12 2.2 K-Means Clustering Algorithm . 13 2.2.1 K-Means . 13 2.2.2 Quantum K Means . 14 2.2.3 Distance Based Classifier . 16 3 Quantum K-means Implementations 19 3.1 Quantum K-means using distance based classifier . 19 3.1.1 Basic Implementation . 19 3.1.2 Optimized 4 qubit model . 21 3.1.3 Multi-cluster Quantum K-means . 28 3.2 Constant Depth circuit using Negative Rotations . 35 3.3 Distance Calculation using Destructive Interference . 38 4 Evaluation Methodology 43 4.1 Input Data . 43 4.1.1 Random Dataset . 43 4.1.2 Iris Dataset . 44 v vi CONTENTS 4.1.3 MNIST Dataset . 45 4.2 Execution Environment . 47 4.2.1 IBM Qiskit Simulator . 47 4.2.2 IBMQX2 quantum machine . 48 4.3 Evaluation Metric . 49 5 Results and discussion 50 5.1 Basic Implementation . 50 5.2 Optimized 4 qubit Model . 52 5.3 Multi Cluster Quantum K-means . 53 5.4 Constant Depth circuit using Negative Rotations . 55 5.5 Distance Calculation using Destructive Interference . 56 5.6 Analysis . 56 6 Conclusion 59 Bibliography 60 List of Figures 2.1 Qubit on a unit circle. .6 2.2 Quantum Circuit for 2 qubit entanglement . 11 2.3 SwapTest circuit . 15 2.4 Distance Based Classifier Circuit for loading two training vec- tors and one test vector(test and training vectors taken from Iris Dataset [33]). Step B: Load test vector; Step C: Load train- ing vector 1; Step D: Load training vector 2; Step E: Flip the class label for training vector 2; Step F: Interference and measurement [29]. 18 3.1 Distance Based Classifier Circuit with arbitrary training vec- tors Step B: Load test vector; Step C: Load training vector 1; Step D: Load training vector 2; Step E: Flip the class label for training vector 2; Step F: Interference and measurement . 21 3.2 Optimized Interference Circuit . 22 3.3 Original Configuration. θ = jθt − θcj ............. 23 3.4 New Configuration. θ = jθt − θcj ............... 23 3.5 Original Configuration with 2 training vectors c1 and c2 and ◦ ◦ ◦ one test vector t. θt = 45 , θc1 = 0 and θc2 = 180 ...... 25 3.6 Interfering vectors from fig. 3.5 . 26 3.7 Quantum state transition of vector ~t from equation (3.7) . 26 3.8 Quantum state transition of vector ~t0 from equation (3.8) . 26 3.9 Quantum state transition of vector ~c1 from equation (3.9) . 27 3.10 Quantum state transition of vector ~c2 from equation (3.10) . 27 3.11 Combined final state of the interfering vectors . 27 3.12 Quantum state transition flow of input vectors . 28 3.13 Arbitrary State preparation circuit. Implemented using Qiskit’s from Ref. [34]. 30 vii viii LIST OF FIGURES 3.14 Flowchart for quantum K-means. (Blue: Classical Processes, Green: Quantum Process) . 32 3.15 Multi cluster QK means strategy . 33 3.16 Sequence Diagram . 34 3.17 Possible parallelization of QK means algorithm for multi clus- tering problem. Small blue boxes represent input vectors. QC block is a quantum chip and C block is a classical computer. Small yellow boxes represent the state of the algorithm. 35 3.18 Vectors 180o apart . 36 3.19 2 qubit Negative Rotations Quantum Circuit . 37 3.20 K qubit negative rotation circuit . 37 3.21 nk qubit negative rotation circuit . 38 3.22 Generic n qubit Quantum Interference Circuit. Step A: Pre- pare the desired state; Step B: Interference and Measurement . 39 3.23 2 qubit quantum interference circuit Step A: Prepare the de- sired state Step B: Interference and Measurement . 40 3.24 2 qubit quantum interference circuit with equal relative angu- lar difference Step A: Prepare the desired state Step B: Inter- ference and Measurement . 41 3.25 Swap Gate Breakdown . 41 4.1 Randomly Generated Dataset . 43 4.2 Randomly Generated Dataset Normalized . 44 4.3 Iris Dataset left: Standardized , right: Standardized and Nor- malized . 44 4.4 MNIST Dataset sample images . 45 4.5 PCA reduced MNIST dataset with true class labels . 46 4.6 PCA reduced MNIST dataset with true class labels containing digits 0,3,4,7 . 47 4.7 IBMQX2 qubit connectivity taken from [40] . 48 5.1 Results basic implementation on IBM’s quantum simulator . 51 5.2 Results basic implementation on IBMQX2 . 51 5.3 Results of optimized circuit IBM’s quantum simulator . 52 5.4 Results of optimized circuit on IBM’s quantum chip IBMQX2 52 5.5 Result for Iris dataset left: Multi Cluster quantum K-means on simulator, right: Scikit-learn K-means . 53 5.6 Result Iris Dataset multi cluster quantum K-means IBMQX2 . 54 5.7 Result: left: multi cluster quantum K-means right: scikit- learn kmeans . 54 LIST OF FIGURES ix 5.8 Result MNIST multi cluster quantum K-means IBMQX2 . 55 5.9 Result: left: Iris dataset right: MNIST dataset . 55 5.10 Result: left: Iris dataset right: MNIST dataset . 56 Chapter 1 Introduction Quantum computers are known to solve certain difficult problems exponen- tially faster than classical computers do. Integer factorization on a classical computer is one such problem that grows exponentially in time with the in- crease on input data but it can be solved in polynomial time using Shor’s algo- rithm [1] on a Quantum Computer. Similarly, Grover’s algorithm [2] provides a quadratic speed up on searching an item from an unsorted array. The list of such algorithms that provide a Quantum Speedup has grown over the duration of last two decades ([3] cites more than 250 quantum algorithms) and scien- tists have developed and tested algorithms that theoretically provide a quantum speed-up [4]. Another quantum discipline where quantum computing promises a speed- up is Machine learning (ML). Quantum Machine Learning (QML) has gained a lot of prominence and an increasing number of published literature that cov- ers the principles and techniques of QML is being produced [5]. Significant work has been done to take advantage of quantum processing power for ma- chine learning tasks. QML techniques are being developed by researchers that can solve ML problems faster than the best known classical algorithms for the respective problems. Supervised (e.g. support vector machines), unsuper- vised (e.g. clustering) and reinforcement quantum machine learning methods have been proposed that provide exponential speed-ups as compared to their classical counterparts [6, 7, 8, 9]. The gap between mathematical models and actual implementations of the algorithms has been reduced over the past few years. There has been a lot of development in the Quantum software world and there are well over fifty 1 2 CHAPTER 1. INTRODUCTION open-source software platforms available for quantum development. A list of these development platforms is available at [10].
Recommended publications
  • Quantum Machine Learning for Data Scientists
    Quantum machine learning for data scientists Dawid Kopczyk Quantee Limited, Manchester, United Kingdom This text aims to present and explain quantum machine learning algorithms to a data scientist in an accessible and consistent way. The algorithms and equations presented are not written in rig- orous mathematical fashion, instead, the pressure is put on examples and step by step explanation of difficult topics. This contribution gives an overview of selected quantum machine learning algo- rithms, however there is also a method of scores extraction for quantum PCA algorithm proposed as well as a new cost function in feed-forward quantum neural networks is introduced. The text is divided into four parts: the first part explains the basic quantum theory, then quantum compu- tation and quantum computer architecture are explained in section two. The third part presents quantum algorithms which will be used as subroutines in quantum machine learning algorithms. Fi- nally, the fourth section describes quantum machine learning algorithms with the use of knowledge accumulated in previous parts. Contents I. Introduction 2 II. Basic quantum theory 3 A. Quantum states 3 B. Quantum observables 5 C. Measurement 7 D. Assembling quantum states 8 E. Density matrix 9 III. Quantum computation 11 A. Qubit 11 B. Quantum gates 12 C. Quantum parallelism 14 IV. Quantum algorithms 15 A. Grover's search algorithm 16 1. Preliminaries 16 2. Example 16 3. Algorithm 19 4. Summary 19 B. Quantum minimization algorithm 19 1. Preliminaries 19 2. Example 20 3. Algorithm 22 4. Summary 22 arXiv:1804.10068v1 [quant-ph] 25 Apr 2018 C. Quantum Fourier transform algorithm 23 1.
    [Show full text]
  • Arxiv:1709.02779V1 [Quant-Ph] 8 Sep 2017 I
    Machine learning & artificial intelligence in the quantum domain Vedran Dunjko Institute for Theoretical Physics, University of Innsbruck, Innsbruck 6020, Austria Max Planck Institute of Quantum Optics, Garching 85748, Germany Email: [email protected] Hans J. Briegel Institute for Theoretical Physics, University of Innsbruck Innsbruck 6020, Austria Department of Philosophy, University of Konstanz, Konstanz 78457, Germany Email: [email protected] Abstract. Quantum information technologies, on the one side, and intelligent learning systems, on the other, are both emergent technologies that will likely have a transforming impact on our society in the future. The respective underlying fields of basic research { quantum information (QI) versus machine learning and artificial intelligence (AI) { have their own specific questions and challenges, which have hitherto been investigated largely independently. However, in a growing body of recent work, researchers have been prob- ing the question to what extent these fields can indeed learn and benefit from each other. QML explores the interaction between quantum computing and machine learning, inves- tigating how results and techniques from one field can be used to solve the problems of the other. In recent time, we have witnessed significant breakthroughs in both directions of influence. For instance, quantum computing is finding a vital application in providing speed-ups for machine learning problems, critical in our \big data" world. Conversely, machine learning already permeates many cutting-edge technologies, and may become instrumental in advanced quantum technologies. Aside from quantum speed-up in data analysis, or classical machine learning optimization used in quantum experiments, quan- tum enhancements have also been (theoretically) demonstrated for interactive learning tasks, highlighting the potential of quantum-enhanced learning agents.
    [Show full text]
  • Quantum Speed-Up for Unsupervised Learning
    Mach Learn (2013) 90:261–287 DOI 10.1007/s10994-012-5316-5 Quantum speed-up for unsupervised learning Esma Aïmeur · Gilles Brassard · Sébastien Gambs Received: 10 February 2012 / Revised: 10 February 2012 / Accepted: 13 July 2012 / Published online: 31 August 2012 © The Author(s) 2012. This article is published with open access at Springerlink.com Abstract We show how the quantum paradigm can be used to speed up unsupervised learn- ing algorithms. More precisely, we explain how it is possible to accelerate learning algo- rithms by quantizing some of their subroutines. Quantization refers to the process that par- tially or totally converts a classical algorithm to its quantum counterpart in order to improve performance. In particular, we give quantized versions of clustering via minimum spanning tree, divisive clustering and k-medians that are faster than their classical analogues. We also describe a distributed version of k-medians that allows the participants to save on the global communication cost of the protocol compared to the classical version. Finally, we design quantum algorithms for the construction of a neighbourhood graph, outlier detection as well as smart initialization of the cluster centres. Keywords Unsupervised learning · Clustering · Quantum learning · Quantum information processing · Grover’s algorithm 1 Introduction Consider the following scenario, which illustrates a highly challenging clustering task. Imagine that you are an employee of the Department of Statistics of the United Nations. Your boss gives you the demographic data of all the Earth inhabitants and asks you to anal- Editor: Shai Shalev-Shwartz. E. Aïmeur · G. Brassard () Département d’informatique et de recherche opérationnelle, Université de Montréal, C.P.
    [Show full text]
  • Quantum Machine Learning This Page Intentionally Left Blank Quantum Machine Learning What Quantum Computing Means to Data Mining
    Quantum Machine Learning This page intentionally left blank Quantum Machine Learning What Quantum Computing Means to Data Mining Peter Wittek University of Borås Sweden AMSTERDAM • BOSTON • HEIDELBERG • LONDON NEW YORK • OXFORD • PARIS • SAN DIEGO SAN FRANCISCO • SINGAPORE • SYDNEY • TOKYO Academic Press is an imprint of Elsevier Academic Press is an imprint of Elsevier 525 B Street, Suite 1800, San Diego, CA 92101-4495, USA 225 Wyman Street, Waltham, MA 02451, USA The Boulevard, Langford Lane, Kidlington, Oxford OX5 1GB, UK 32 Jamestown Road, London NW1 7BY, UK First edition Copyright c 2014 by Elsevier Inc. All rights reserved. No part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopying, recording, or any information storage and retrieval system, without permission in writing from the publisher. Details on how to seek permission, further information about the Publisher’s permissions policies and our arrangement with organizations such as the Copyright Clearance Center and the Copyright Licensing Agency, can be found at our website: www.elsevier.com/permissions. This book and the individual contributions contained in it are protected under copyright by the Publisher (other than as may be noted herein). Notice Knowledge and best practice in this field are constantly changing. As new research and experience broaden our understanding, changes in research methods, professional practices, or medical treatment may become necessary. Practitioners and researchers must always rely on their own experience and knowledge in evaluating and using any information, methods, compounds, or experiments described herein. In using such information or methods they should be mindful of their own safety and the safety of others, including parties for whom they have a professional responsibility.
    [Show full text]
  • Data Clustering Based on Langevin Annealing with a Self-Consistent Potential
    QUARTERLY OF APPLIED MATHEMATICS VOLUME LXXVII, NUMBER 3 SEPTEMBER 2019, PAGES 591–613 https://doi.org/10.1090/qam/1521 Article electronically published on October 11, 2018 DATA CLUSTERING BASED ON LANGEVIN ANNEALING WITH A SELF-CONSISTENT POTENTIAL By KYLE LAFATA (Department of Radiation Oncology, Physics Division, Department of Physics, and Medical Physics Graduate Program, Duke University, Durham, North Carolina 27705 ), ZHENNAN ZHOU (Beijing International Center for Mathematical Research, Peking University, Beijing, People’s Republic of China 100871 ), JIAN-GUO LIU (Departments of Mathematics and Physics, Duke University, Durham, North Carolina 27705 ), and FANG-FANG YIN (Department of Radiation Oncology, Physics Division and Medical Physics Graduate Program, Duke University, Durham, North Carolina 27705 ) Abstract. This paper introduces a novel data clustering algorithm based on Langevin dynamics, where the associated potential is constructed directly from the data. To in- troduce a self-consistent potential, we adopt the potential model from the established Quantum Clustering method. The first step is to use a radial basis function to construct a density distribution from the data. A potential function is then constructed such that this density distribution is the ground state solution to the time-independent Schr¨odinger equation. The second step is to use this potential function with the Langevin dynamics at subcritical temperature to avoid ergodicity. The Langevin equations take a classical Gibbs distribution as the invariant measure, where the peaks of the distribution coin- cide with minima of the potential surface. The time dynamics of individual data points Received July 16, 2018. 2010 Mathematics Subject Classification. Primary 62H30, 82C31, 60H10, 37M25.
    [Show full text]
  • Algorithm for Data Clustering in Pattern Recognition Problems Based on Quantum Mechanics
    VOLUME 88, NUMBER 1 PHYSICAL REVIEW LETTERS 7JANUARY 2002 Algorithm for Data Clustering in Pattern Recognition Problems Based on Quantum Mechanics David Horn and Assaf Gottlieb School of Physics and Astronomy, Raymond and Beverly Sackler Faculty of Exact Sciences, Tel Aviv University, Tel Aviv 69978, Israel (Received 16 July 2001; published 20 December 2001) We propose a novel clustering method that is based on physical intuition derived from quantum me- chanics. Starting with given data points, we construct a scale-space probability function. Viewing the latter as the lowest eigenstate of a Schrödinger equation, we use simple analytic operations to derive a potential function whose minima determine cluster centers. The method has one parameter, determin- ing the scale over which cluster structures are searched. We demonstrate it on data analyzed in two dimensions (chosen from the eigenvectors of the correlation matrix). The method is applicable in higher dimensions by limiting the evaluation of the Schrödinger potential to the locations of data points. DOI: 10.1103/PhysRevLett.88.018702 PACS numbers: 89.75.Kd, 02.70.–c, 03.65.Ge, 03.67.Lx µ ∂ Clustering of data is a well-known problem of pattern s2 Hc ϵ 2 =2 1 V͑x͒ c ෇ Ec . recognition, covered in textbooks such as [1–3]. The prob- 2 (2) lem we are looking at is defining clusters of data solely by H V the proximity of data points to one another. This problem is Here we rescaled and of the conventional quantum s one of unsupervised learning, and is in general ill defined. mechanical equation to leave only one free parameter, .
    [Show full text]
  • Quantum Machine Learning
    ESANN 2020 proceedings, European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning. Online event, 2-4 October 2020, i6doc.com publ., ISBN 978-2-87587-074-2. Available from http://www.i6doc.com/en/. Quantum Machine Learning Jos´eD. Mart´ın-Guerrero1, Lucas Lamata2 1- Departament d'Enginyeria Electr`onica ETSE-UV, Universitat de Val`encia,Spain [email protected] 2- Departamento de F´ısicaAt´omica,Molecular y Nuclear Universidad de Sevilla, Spain [email protected] Abstract. Machine Learning (ML) is becoming a more and more popular field of knowledge, being a term known not only in the academic field due to its successful applications to many real-world problems. The advent of Deep Learning and Big Data in the last decade has contributed to make it even more popular. Many companies, both large ones and SMEs, have created specific departments for ML and data analysis, being in fact their main activity in many cases. This current exploitation of ML should not mislead us; while it is a mature field of knowledge, there is still room for many novel contributions, namely, a better understanding of the underlying Mathe- matics, proposal and tuning of algorithms suitable for new problems (e.g., Natural Language Processing), automation and optimization of the search of parameters, etc. Within this framework of new contributions to ML, Quantum Machine Learning (QML) has emerged strongly lately, speeding up ML calculations and providing alternative representations to existing approaches. This special session includes six high-quality papers dealing with some of the most relevant aspects of QML, including analysis of learning in quantum computing and quantum annealers, quantum versions of classical ML models {like neural networks or learning vector quantization{, and quantum learning approaches for measurement and control.
    [Show full text]
  • Quantum Clustering Drives Innovations: a Bibliometric and Patentometric Analysis
    University of Nebraska - Lincoln DigitalCommons@University of Nebraska - Lincoln Library Philosophy and Practice (e-journal) Libraries at University of Nebraska-Lincoln Winter 1-1-2021 Quantum clustering drives innovations: A bibliometric and patentometric analysis Shradha Deshmukh [email protected] Preeti Mulay Symbiosis Institute of Technology (SIT), Symbiosis International (Deemed University) (SIU), Pune, India Follow this and additional works at: https://digitalcommons.unl.edu/libphilprac Part of the Other Computer Engineering Commons Deshmukh, Shradha and Mulay, Preeti, "Quantum clustering drives innovations: A bibliometric and patentometric analysis" (2021). Library Philosophy and Practice (e-journal). 5072. https://digitalcommons.unl.edu/libphilprac/5072 Quantum clustering drives innovations: A bibliometric and patentometric analysis Shradha Deshmukh, Preeti Mulay Symbiosis Institute of Technology, Symbiosis International (Deemed University), Pune, India. [email protected], [email protected] Abstract The paper presents a bibliometric analysis from 2014 to 2020 of the emerging and engaging field of quantum computing called Quantum Machine Learning (QML). The study discusses the analysis results from the comprehensive high indexed databases worldwide such as Institute of Electrical and Electronics Engineers (IEEE), Scopus, Web of Science (WOS), Google Scholar and the Association for Computing Machinery (ACM). Tools like iMapbuilder, IBM and SPSS Statistics are used to provide meaningful insights and flawless representations of the extracted data. There has been little research to provide a macroscopic overview of renowned authors, subject areas, funding agencies and patent applications related to Quantum Clustering (QC). The result and analysis of this study show an interesting fact, most researchers are now aware of quantum technology from the past few years.
    [Show full text]
  • Quantum Spectral Clustering Through a Biased Phase Estimation Algorithm
    TWMS J. App. Eng. Math. V.10, N.1, 2020, pp. 24-33 QUANTUM SPECTRAL CLUSTERING THROUGH A BIASED PHASE ESTIMATION ALGORITHM AMMAR DASKIN1, x Abstract. In this paper, we go through the theoretical steps of the spectral clustering on quantum computers by employing the phase estimation and the amplitude amplification algorithms. We discuss circuit designs for each step and show how to obtain the clustering solution from the output state. In addition, we introduce a biased version of the phase estimation algorithm which significantly speeds up the amplitude amplification process. The complexity of the whole process is analyzed: It is shown that when the circuit rep- resentation of a data matrix of order N is produced through an ancilla based circuit in which the matrix is written as a sum of L number of Householder matrices; the computa- tional complexity is bounded by O(2mLN) number of quantum gates. Here, m represents the number of qubits involved in the phase register of the phase estimation algorithm. Keywords: Spectral Clustering, Quantum Algorithms.. AMS Subject Classification: 81P68, 68Q12,62H30 1. Introduction In recent years, the research in quantum algorithms for machine learning problems has gained substantial momentum. Some of these algorithms include the application of quantum random walk [1] to the community detection in quantum networks [2], quantum nearest neighbor methods [3] for clustering problems, the deep learning in the context of quantum computing [4], and an accelerated unsupervised learning algorithm with the help of quantum based subroutines [5]. Furthermore, quantum algorithms for topological and geometric analysis of data [6] and quantum principal component analysis [7] are introduced.
    [Show full text]
  • Quantum Clustering
    Quantum Clustering David Horn and Assaf Gottlieb School of Physics and Astronomy Raymond and Beverly Sackler Faculty of Exact Sciences Tel Aviv University, Tel Aviv 69978, Israel October 25, 2018 Abstract We propose a novel clustering method that is based on physical intuition derived from quantum mechanics. Starting with given data points, we construct a scale-space probability function. Viewing the latter as the lowest eigenstate of a Schr¨odinger equa- tion, we use simple analytic operations to derive a potential function whose minima determine cluster centers. The method has one parameter, determining the scale over which cluster structures are searched. We demonstrate it on data analyzed in two dimensions (chosen from the eigenvectors of the correlation matrix). The method is applicable in higher dimensions by limiting the evaluation of the Schr¨odinger potential to the locations of data points. In this case the method may be formulated in terms of distances between data points. 1 Introduction Clustering of data is a well-known problem of pattern recognition, covered in textbooks such as [1, 2, 3]. The problem we are looking at is defining clusters of data solely by the proximity of data points to one another. This problem is one of unsupervised learning, and is in general ill defined. Solutions to such problems can be based on intuition derived from Physics. A good example of the latter is the algorithm by [4] that is based on associating points with Potts-spins and formulating an appropriate model of statistical mechanics. We propose an alternative that is also based on physical intuition, this one being derived from quantum mechanics.
    [Show full text]
  • Clustering Using Quantum Persistent Homology
    Master's Thesis Clustering using Quantum Persistent Homology R.H.A.J. Meijer 4205197 Submitted in partial fulfillment of the requirements for the Master of Science degree in Computing Science. Written at Utrecht University & TNO The Hague. dr. E.J. van Leeuwen N.M.P. Neumann, MSc dr. C.P. de Campos S.R. den Breeijen, MSc October 1st, 2019 Abstract Persistent homology is a way to analyse the shape of a dataset rather than its semantics. This is done by assigning the dataset holes in different dimensions, borrowing this theory from the mathematical area of topology. The dataset is treated as being sampled from a topological space, which is in turn approximated at different resolutions. The persistent Betti numbers then count the number of holes in these spaces, registering for how long the holes persist as the resolution increases. Computing such features classically may prove costly, with some steps possibly requiring exponential time. In a 2016 paper, a polynomial time quantum algorithm for the approximation of persistent homology is promised. In this thesis it is shown these methods are generally incapable of uniquely determining the persistent Betti numbers in arbitrary dimensions. The algorithm does succeed in the zeroth dimension. After treating the prerequisites, the methods from this paper are explained, the above statements are proven, and finally the quantum algorithm is related to agglomerative hierarchical clustering and spectral clustering. Contents 1 Introduction 1 1.1 Topics . .1 1.1.1 Quantum Computing . .1 1.1.2 Cluster Analysis . .2 1.1.3 Persistent Homology . .3 1.2 Academic Context .
    [Show full text]
  • K-Means Clustering on Noisy Intermediate Scale Quantum Computers
    1 K-Means Clustering on Noisy Intermediate Scale Quantum Computers Sumsam Ullah Khan1;2, Ahsan Javed Awan1 and Gemma Vall-Llosera1 1Department of Cloud Systems and Platforms, Ericsson Research, Sweden 2School of Electrical Engineering and Computer Science, Royal Institute of Technology, Sweden fsumsam.khan, ahsan.javed.awan, [email protected] Abstract—Real-time clustering of big performance data gener- computer requires to re-design the classical algorithms so they ated by the telecommunication networks requires domain-specific are bound to the principles of quantum mechanics. high performance compute infrastructure to detect anomalies. In this paper, we evaluate noisy intermediate-scale quantum (NISQ) Current and upcoming quantum computers also termed computers characterized by low decoherence times, for K-means Noisy Intermediate Scale Quantum computers (NISQ) do not clustering and propose three strategies to generate shorter-depth provide sufficient fault tolerance [1]. Qubits on the current quantum circuits needed to overcome the limitation of NISQ quantum devices have low coherence times. This leads to computers. The strategies are based on exploiting; i) quantum interference, ii) negative rotations and iii) destructive interfer- decoherence and errors in the computation [2]. Algorithm ence. By comparing our implementations on IBMQX2 machine implementations with shallow depth quantum circuits can for representative data sets, we show that NISQ computers can provide us with better results on NISQ computers considering solve the K-means clustering problem with the same level of that the complexity of implementation is measured as the total accuracy as that of classical computers. number of elementary gates required to build the circuit [3].
    [Show full text]