All-Optical Machine Learning Using Diffractive Deep Neural Networks

Total Page:16

File Type:pdf, Size:1020Kb

All-Optical Machine Learning Using Diffractive Deep Neural Networks Title: All-Optical Machine Learning Using Diffractive Deep Neural Networks Authors: Xing Lin 1,2,3, †, Yair Rivenson 1,2,3, †, Nezih T. Yardimci 1,3 , Muhammed Veli 1,2,3 , Mona Jarrahi 1,3 and Aydogan Ozcan 1,2,3,4,* Affiliations: 1Electrical and Computer Engineering Department, University of California, Los Angeles, CA, 90095, USA 2Bioengineering Department, University of California, Los Angeles, CA, 90095, USA 3California NanoSystems Institute (CNSI), University of California, Los Angeles, CA, 90095, USA 4Department of Surgery, David Geffen School of Medicine, University of California, Los Angeles, CA, 90095, USA *Correspondence to: [email protected] †These authors contributed equally to this work. Abstract: We introduce an all-optical Diffractive Deep Neural Network (D 2NN) architecture that can learn to implement various functions after deep learning-based design of passive diffractive layers that work collectively. We experimentally demonstrated the success of this framework by creating 3D-printed D 2NNs that learned to implement handwritten digit classification and the function of an imaging lens at terahertz spectrum. With the existing plethora of 3D-printing and other lithographic fabrication methods as well as spatial-light-modulators, this all-optical deep learning framework can perform, at the speed of light, various complex functions that computer-based neural networks can implement, and will find applications in all-optical image analysis, feature detection and object classification, also enabling new camera designs and optical components that can learn to perform unique tasks using D2NNs. 1 Main Text Deep learning is one of the fastest-growing machine learning methods of this decade (1), and it uses multi- layered artificial neural networks implemented in a computer to digitally learn data representation and abstraction, and perform advanced tasks, comparable to or even superior than the performance of human experts. Recent examples where deep learning has made major advances in machine learning include medical image analysis (2), speech recognition (3), language translation (4), image classification (5), among many others (1, 6). Beyond some of these mainstream applications, deep learning methods are also being used for solving inverse imaging problems (7-13 ). Here we introduce an all-optical deep learning framework, where the neural network is physically formed by multiple layers of diffractive surfaces that work in collaboration to optically perform an arbitrary function that the network can statistically learn. We term this framework as Diffractive Deep Neural Network (D 2NN) and demonstrate its learning capabilities through both simulations and experiments. A D 2NN can be physically created by using several transmissive and/or reflective layers, where each point on a given layer represents an artificial neuron that is connected to other neurons of the following layers through optical diffraction (see Fig. 1A). Following the Huygens’ Principle, our terminology is based on the fact that each point on a given layer acts as a secondary source of a wave, the amplitude and phase of which are determined by the product of the input wave and the complex-valued transmission or reflection coefficient at that point. Therefore, an artificial neuron in a diffractive deep neural network is connected to other neurons of the following layer through a secondary wave that is modulated in amplitude and phase by both the input interference pattern created by the earlier layers and the local transmission/reflection coefficient at that point. As an analogy to standard deep neural networks (see Fig. 1D), one can consider the transmission/reflection coefficient of each point/neuron as a multiplicative “bias” term, which is a learnable network parameter that is iteratively adjusted during the training process of the diffractive network, using an error back-propagation method. After this numerical training phase implemented in a computer, the D 2NN design is fixed and the transmission/reflection coefficients of the neurons of all the layers are determined. This D 2NN design, once physically fabricated using e.g., 3D-printing, 2 lithography, etc., can then perform, at the speed of light propagation, the specific task that it is trained for, using only optical diffraction and passive optical components/layers, creating an efficient and fast way of implementing machine learning tasks. To experimentally validate this all-optical deep learning framework, we used terahertz (THz) part of the electromagnetic spectrum and a standard 3D-printer to fabricate different layers that collectively learn to perform unique functions using optical diffraction through D 2NNs. In our analysis and experiments, we focused on transmissive D 2NN architectures (Fig. 1) with phase-only modulation at each layer. For example, using five 3D-printed transmission layers, containing a total of 0.2 million neurons and ~8.0 billion connections that are trained using deep learning, we experimentally demonstrated the function of a handwritten digit classifier. This D2NN design formed a fully-connected network and achieved 91.75% classification accuracy on MNIST dataset (14 ) and was also validated experimentally. Furthermore, based on the same D 2NN framework, but using different 3D-printed layers, we experimentally demonstrated the function of an imaging lens using five transmissive layers with 0.45 million neurons that are stacked in 3D. This second 3D-printed D 2NN was much more compact in depth, with 4 mm axial spacing between successive network layers, and that is why it had much smaller number of connections (<0.1 billion) among neurons compared to the digit classifier D 2NN, which had 30 mm axial spacing between its layers. While there are various other powerful methods to achieve handwritten digit classification and design lenses, the main point of these results is the introduction of the diffractive neural network as an all-optical machine learning engine that is scalable and power-efficient to implement various functions using passive optical components, which present large degrees of freedom that can be learned through training data. Optical implementation of learning in artificial neural networks is promising due to the parallel computing capability and power efficiency of optical systems (15 -17 ). Being quite different in its physical operation principles compared to previous opto-electronics based deep neural networks (15, 18 -20 ), the D 2NN framework provides a unique all-optical deep learning engine that efficiently operates at the speed of light using passive components and optical diffraction. Benefiting from various high-throughput and wide-area 3D fabrication 3 methods used for engineering micro- and nano-scale structures, this approach can scale up the number of neurons and layers of the network to cover e.g., tens of millions of neurons and hundreds of billions of connections over large area optical components, and can optically implement various deep learning tasks that involve e.g., image analysis (2), feature detection (21 ), or object classification (22 ). Furthermore, it can be used to design optical components with new functionalities, and might lead to new camera or microscope designs that can perform various unique imaging tasks that can be statistically learned through diffractive neural networks. As we discuss later in the manuscript, reconfigurable D2NNs can also be implemented using various transmission or reflection based spatial light modulators, providing a flexible optical neural network design that can learn an arbitrary function, be used for transfer learning, and further improve or adjust its performance as needed by e.g., new data or user feedback. Results D2NN Architecture. The architecture of a D2NN is depicted in Fig. 1. Although a D2NN can be implemented in transmission or reflection modes by using multiple layers of diffractive surfaces, without loss of generality here we focus on coherent transmissive networks with phase-only modulation at each layer, which is approximated as a thin optical element. In this case, each layer of the D2NN modulates the wavefront of the transmitted field through the phase values (i.e., biases) of its neurons. Following the Rayleigh-Sommerfeld diffraction equation, one can consider every single neuron of a given D2NN layer as a secondary source of a wave that is composed of the following optical mode (23 ): 5ͯ5 ͥ ͥ %ͦ_- 'ʚ ʛ Ĝ , (1) ͫ$ ͬ, ͭ, ͮ = -v ʠͦ_- + %Z ʡ ͙ͬͤ ʠ Z ʡ where l represents the l-th layer of the network, i represents the i-th neuron located at of layer l, is ʚͬ$, ͭ$, ͮ$ʛ the illumination wavelength, ͦ ͦ ͦ and . The amplitude and relative ͦ = ǭʚͬ − ͬ$ʛ + ʚͭ − ͭ$ʛ + ʚͮ − ͮ$ʛ ͞ = √−1 phase of this secondary wave are determined by the product of the input wave to the neuron and its transmission 4 coefficient (ͨ), both of which are complex-valued functions. Based on this, for the l-th layer of the network, one can write the output function ( ') of the i-th neuron located at as: ͢$ ʚͬ$, ͭ$, ͮ$ʛ ' ' ' 'ͯͥ ' %∆W , (2) ͢$ʚͬ, ͭ, ͮʛ = ͫ$ ʚͬ, ͭ, ͮʛ ∙ ͨ$ ʚͬ$, ͭ$, ͮ$ʛ ∙ ∑& ͢& ʚͬ$, ͭ$, ͮ$ʛ = ͫ$ ʚͬ, ͭ, ͮʛ ∙ |̻| ∙ ͙ where we define ' 'ͯͥ as the input wave to i-th neuron of layer l, refers to the ͡$ʚͬ$, ͭ$, ͮ$ʛ = ∑& ͢& ʚͬ$, ͭ$, ͮ$ʛ |̻| relative amplitude of the secondary wave, and ∆ refers to the additional phase delay that the secondary wave encounters due to the input wave to the neuron and its transmission coefficient. These secondary waves diffract between the layers and interfere with each other forming a complex wave at the surface of the next layer, feeding its neurons. The transmission coefficient of a neuron is composed of amplitude and phase terms, i.e., ' ' ' , and since we focus on a phase-only D 2NN architecture the ͨ$ ʚͬ$, ͭ$, ͮ$ʛ = ͕$ʚͬ$, ͭ$, ͮ$ʛ͙ͬͤʚ͞&$ ʚͬ$, ͭ$, ͮ$ʛʛ amplitude ' is assumed to be a constant, ideally 1, ignoring the optical losses – a topic that we ͕$ʚͬ$, ͭ$, ͮ$ʛ expand in the Discussion section.
Recommended publications
  • Large-Scale Optical Neural Networks Based on Photoelectric Multiplication
    PHYSICAL REVIEW X 9, 021032 (2019) Large-Scale Optical Neural Networks Based on Photoelectric Multiplication Ryan Hamerly,* Liane Bernstein, Alexander Sludds, Marin Soljačić, and Dirk Englund Research Laboratory of Electronics, MIT, 50 Vassar Street, Cambridge, Massachusetts 02139, USA (Received 12 November 2018; revised manuscript received 21 February 2019; published 16 May 2019) Recent success in deep neural networks has generated strong interest in hardware accelerators to improve speed and energy consumption. This paper presents a new type of photonic accelerator based on coherent detection that is scalable to large (N ≳ 106) networks and can be operated at high (gigahertz) speeds and very low (subattojoule) energies per multiply and accumulate (MAC), using the massive spatial multiplexing enabled by standard free-space optical components. In contrast to previous approaches, both weights and inputs are optically encoded so that the network can be reprogrammed and trained on the fly. Simulations of the network using models for digit and image classification reveal a “standard quantum limit” for optical neural networks, set by photodetector shot noise. This bound, which can be as low as 50 zJ=MAC, suggests that performance below the thermodynamic (Landauer) limit for digital irreversible computation is theoretically possible in this device. The proposed accelerator can implement both fully connected and convolutional networks. We also present a scheme for backpropagation and training that can be performed in the same hardware. This architecture will enable a new class of ultralow-energy processors for deep learning. DOI: 10.1103/PhysRevX.9.021032 Subject Areas: Interdisciplinary Physics, Optoelectronics, Photonics I. INTRODUCTION ideally compatible with training as well as inference.
    [Show full text]
  • Self-Learning Photonic Signal Processor with an Optical Neural Network Chip
    Self-learning photonic signal processor with an optical neural network chip Hailong Zhou1Ϯ, Yuhe Zhao1Ϯ, Xu Wang1, Dingshan Gao1, Jianji Dong1 and Xinliang Zhang1 1Wuhan National Laboratory for Optoelectronics, School of Optical and Electronic Information, Huazhong University of Science Ϯ and Technology, Wuhan 430074, China; These authors contributed equally to this work. e-mail: [email protected]; [email protected]. Photonic signal processing is essential in the optical communication and optical computing. Numerous photonic signal processors have been proposed, but most of them exhibit limited reconfigurability and automaticity. A feature of fully automatic implementation and intelligent response is highly desirable for the multipurpose photonic signal processors. Here, we report and experimentally demonstrate a fully self-learning and reconfigurable photonic signal processor based on an optical neural network chip. The proposed photonic signal processor is capable of performing various functions including multichannel optical switching, optical multiple-input- multiple-output descrambler and tunable optical filter. All the functions are achieved by complete self-learning. Our demonstration suggests great potential for chip-scale fully programmable optical signal processing with artificial intelligence. Photonic signal processing is widespread both in the optical communication and optical computing. Almost all aspects of signal processing functions were developed, such as multichannel optical switching1-7, mode (de)multiplexer8,9, optical filter10-14, polarization management15-17, waveguide crossing18,19 and logic gates20-22. Plentiful photonic signal processing devices have been reported based on either discrete components or photonic integrated circuits1,9,15,20-24. Photonic signal processing devices based on discrete components are usually bulkier and less power efficient, whereas a photonic integrated signal processing chip has a much smaller footprint and a higher power efficiency.
    [Show full text]
  • An Optical Neural Chip for Implementing Complex-Valued
    ARTICLE https://doi.org/10.1038/s41467-020-20719-7 OPEN An optical neural chip for implementing complex- valued neural network ✉ ✉ H. Zhang1,M.Gu 2,3 , X. D. Jiang 1 , J. Thompson 3, H. Cai4, S. Paesani5, R. Santagati 5, A. Laing 5, Y. Zhang 1,6, M. H. Yung7,8, Y. Z. Shi 1, F. K. Muhammad1,G.Q.Lo9, X. S. Luo 9, B. Dong9, D. L. Kwong4, ✉ ✉ L. C. Kwek 1,3,10 & A. Q. Liu 1 Complex-valued neural networks have many advantages over their real-valued counterparts. 1234567890():,; Conventional digital electronic computing platforms are incapable of executing truly complex- valued representations and operations. In contrast, optical computing platforms that encode information in both phase and magnitude can execute complex arithmetic by optical inter- ference, offering significantly enhanced computational speed and energy efficiency. However, to date, most demonstrations of optical neural networks still only utilize conventional real- valued frameworks that are designed for digital computers, forfeiting many of the advantages of optical computing such as efficient complex-valued operations. In this article, we highlight an optical neural chip (ONC) that implements truly complex-valued neural networks. We benchmark the performance of our complex-valued ONC in four settings: simple Boolean tasks, species classification of an Iris dataset, classifying nonlinear datasets (Circle and Spiral), and handwriting recognition. Strong learning capabilities (i.e., high accuracy, fast convergence and the capability to construct nonlinear decision boundaries) are achieved by our complex-valued ONC compared to its real-valued counterpart. 1 Quantum Science and Engineering Centre (QSec), Nanyang Technological University, 50 Nanyang Ave, 639798 Singapore, Singapore.
    [Show full text]
  • Class-Specific Differential Detection in Diffractive Optical Neural Networks
    Class-specific Differential Detection in Diffractive Optical Neural Networks Improves Inference Accuracy Jingxi Li 1,2,3†, Deniz Mengu 1,2,3†, Yi Luo 1,2,3 , Yair Rivenson 1,2,3 , Aydogan Ozcan 1,2,3* 1Electrical and Computer Engineering Department, University of California, Los Angeles, CA, 90095, USA 2Bioengineering Department, University of California, Los Angeles, CA, 90095, USA 3California NanoSystems Institute, University of California, Los Angeles, CA, 90095, USA †Equal contributing authors. *Corresponding author: [email protected] Abstract Optical computing provides unique opportunities in terms of parallelization, scalability, power efficiency and computational speed, and has attracted major interest for machine learning. Diffractive deep neural networks have been introduced earlier as an optical machine learning framework that uses task-specific diffractive surfaces designed by deep learning to all-optically perform inference, achieving promising performance for object classification and imaging. Here we demonstrate systematic improvements in diffractive optical neural networks based on a differential measurement technique that mitigates the strict non-negativity constraint of light intensity. In this differential detection scheme, each class is assigned to a separate pair of detectors, behind a diffractive optical network, and the class inference is made by maximizing the normalized signal difference between the photodetector pairs. Using this differential detection scheme, involving 10 photodetector pairs behind 5 diffractive layers with a total of 0.2 Million neurons, we numerically achieved blind testing accuracies of 98.54%, 90.54% and 48.51% for MNIST, Fashion-MNIST and grayscale CIFAR-10 datasets, respectively. Moreover, by utilizing the inherent parallelization capability of optical systems, we reduced the cross-talk and optical signal coupling between the positive and negative detectors of each class by dividing the optical path into two jointly-trained diffractive neural networks that work in parallel.
    [Show full text]
  • Fiber Echo State Network Analogue for High-Bandwidth Dual-Quadrature Signal Processing
    Vol. 27, No. 3 | 4 Feb 2019 | OPTICS EXPRESS 2387 Fiber echo state network analogue for high-bandwidth dual-quadrature signal processing MARIIA SOROKINA,SERGEY SERGEYEV, AND SERGEI TURITSYN Aston University, B4 7ET, United Kingdom *[email protected] Abstract: All-optical platforms for recurrent neural networks can offer higher computational speed and energy efficiency. To produce a major advance in comparison with currently available digital signal processing methods, the new system would need to have high bandwidth and operate both signal quadratures (power and phase). Here we propose a fiber echo state network analogue (FESNA) — the first optical technology that provides both high (beyond previous limits) bandwidth and dual-quadrature signal processing. We demonstrate applicability of the designed system for prediction tasks and for the mitigation of distortions in optical communication systems with multilevel dual-quadrature encoded signals. © 2019 Optical Society of America under the terms of the OSA Open Access Publishing Agreement 1. Introduction Machine learning, in particular neural networks (NNs), have recently attracted renewed and fast- growing interest due to novel designs, advanced hardware implementations, and new applications. The surge of interest in recurrent neural networks (RNNs) is due to the broad spectrum of successful applications, which ranges from image analysis and classification, speech recognition, and language translation to more specific tasks like solving inverse imaging problems or signal processing in wireless and optical communications. In particular, in optical communications, RNNs have been used to compensate for nonlinear signal distortions [1–3] and nonlinearity mitigation for QPSK and 16-QAM modulation formats [4–6]. However, many applications are limited because the typical scale of RNNs requires a significant amount of time for training and processing, which represents a challenging computational burden.
    [Show full text]
  • 2. Quantum Patents
    6.2.21_HANEY_FINAL (DO NOT DELETE) 6/2/2021 7:44 PM ARTICLE QUANTUM PATENTS BRIAN HANEY1 ABSTRACT Quantum Patents are patents with claims relating to quantum computing. The number of Quantum Patents granted by the USPTO is rapidly increasing each year. In addition, Quantum Patents are increasing in total market value. While the literature on technology patents is visibly scaling, literature specifically fo- cused on Quantum Patents is non-existent. This Article draws on a growing body of quantum computing, intellectual property, and technology law scholarship to provide novel Quantum Patent analysis and critique. This Article contributes the first empirical Quantum Patent review, including novel technology descriptions, market modeling, and legal analysis relating to Quantum Patent claims. First, this Article discusses the two main technical ap- proaches to Quantum Computing. This discussion explores the relationship be- tween Adiabatic Quantum Computers and Gate-Model Quantum Computers, as well as various quantum software frameworks. Second, this Article models an evolving Quantum Patent dataset, offering economic insights, claims analysis, and patent valuation strategies. The data models provide insight into an un- charted patent market alcove, shining light on a completely new economy. ABSTRACT......................................................................................................... 64 I. INTRODUCTION .............................................................................................. 65 II. QUANTUM COMPUTERS
    [Show full text]
  • Perspective on Photonic Memristive Neuromorphic Computing Elena Goi1,2, Qiming Zhang1,2, Xi Chen1,2, Haitao Luan1,2 and Min Gu1,2*
    Goi et al. PhotoniX (2020) 1:3 https://doi.org/10.1186/s43074-020-0001-6 PhotoniX REVIEW Open Access Perspective on photonic memristive neuromorphic computing Elena Goi1,2, Qiming Zhang1,2, Xi Chen1,2, Haitao Luan1,2 and Min Gu1,2* * Correspondence: [email protected]. cn Abstract 1Laboratory of Artificial-Intelligence Nanophotonics, School of Science, Neuromorphic computing applies concepts extracted from neuroscience to develop RMIT University, Melbourne, Victoria devices shaped like neural systems and achieve brain-like capacity and efficiency. In 3001, Australia this way, neuromorphic machines, able to learn from the surrounding environment 2Centre for Artificial-Intelligence Nanophotonics, School of to deduce abstract concepts and to make decisions, promise to start a technological Optical-Electrical and Computer revolution transforming our society and our life. Current electronic implementations Engineering, University of Shanghai of neuromorphic architectures are still far from competing with their biological for Science and Technology, Shanghai 200093, China counterparts in terms of real-time information-processing capabilities, packing density and energy efficiency. A solution to this impasse is represented by the application of photonic principles to the neuromorphic domain creating in this way the field of neuromorphic photonics. This new field combines the advantages of photonics and neuromorphic architectures to build systems with high efficiency, high interconnectivity and high information density, and paves the way to ultrafast, power efficient and low cost and complex signal processing. In this Perspective, we review the rapid development of the neuromorphic computing field both in the electronic and in the photonic domain focusing on the role and the applications of memristors.
    [Show full text]
  • All-Optical Machine Learning Using
    RESEARCH OPTICAL COMPUTING diffraction. On the basis of the calculated error with respect to the target output, determined by the desired function, the network structure and its neuron phase values are optimized via an error All-optical machine learning using back-propagation algorithm, which is based on the stochastic gradient descent approach used diffractive deep neural networks in conventional deep learning (14). To demonstrate the performance of the D2NN Xing Lin1,2,3*, Yair Rivenson1,2,3*, Nezih T. Yardimci1,3, Muhammed Veli1,2,3, framework, we first trained it as a digit classifier Yi Luo1,2,3, Mona Jarrahi1,3, Aydogan Ozcan1,2,3,4† to perform automated classification of hand- written digits, from 0 to 9 (Figs. 1B and 2A). For Deep learning has been transforming our ability to execute advanced inference tasks using this task, phase-only transmission masks were computers. Here we introduce a physical mechanism to perform machine learning by designed by training a five-layer D2NN with demonstrating an all-optical diffractive deep neural network (D2NN) architecture that can 55,000 images (5000 validation images) from the implement various functions following the deep learning–based design of passive MNIST (Modified National Institute of Stan- diffractive layers that work collectively. We created 3D-printed D2NNs that implement dards and Technology) handwritten digit data- classification of images of handwritten digits and fashion products, as well as the function base (15). Input digits were encoded into the of an imaging lens at a terahertz spectrum. Our all-optical deep learning framework can amplitude of the input field to the D2NN, and perform, at the speed of light, various complex functions that computer-based neural the diffractive network was trained to map input networks can execute; will find applications in all-optical image analysis, feature detection, digits into 10 detector regions, one for each digit.
    [Show full text]
  • Implementation of Optical Deep Neural Networks Using the Fabry-Pérot Interferometer
    Implementation of Optical Deep Neural Networks using the Fabry-Pérot Interferometer BENJAMIN DAVID STEEL1,* 1Computer Science and Electronics Student, University of Bristol, Bristol, UK *[email protected] Abstract: Future developments in deep learning applications requiring large datasets will be limited by power and speed limitations of silicon based Von-Neumann computing architectures. Optical architectures provide a low power and high speed hardware alternative. Recent publications have suggested promising implementations of optical neural networks (ONNs), showing huge orders of magnitude efficiency and speed gains over current state of the art hardware alternatives. In this work, the transmission of the Fabry-Perot Interferometer (FPI) is proposed as a low power, low footprint activation function unit. Numerical simulations of optical CNNs using the FPI based activation functions show accuracies of 98% on the MNIST dataset. An investigation of possible physical implementation of the network shows that an ONN based on current tunable FPIs could be slowed by actuation delays, but rapidly developing optical hardware fabrication techniques could make an integrated approach using the proposed FPI setups a powerful solution for previously inaccessible deep learning applications. 1. Introduction Deep learning techniques have resulted in significant performance gains in pattern recognition problems in the last decade, in fields from image and speech recognition to language translation [1–4]. However, deep neural networks modelled on a traditional general purpose Von-Neumann computer result in slow and power hungry performance due to the computation bottleneck of data movement [5]. Recent development in hardware architectures tailored towards deep learning applications has allowed performance gains, using hardware such as GPUs, application-specific integrated circuits (ASICs) and field-programmable gate arrays (FPGAs) [5–11].
    [Show full text]
  • A Photonic Recurrent Neuron for Time-Series Classification
    > REPLACE THIS LINE WITH YOUR PAPER IDENTIFICATION NUMBER (DOUBLE-CLICK HERE TO EDIT) < 1 A Photonic Recurrent Neuron for Time-Series Classification George Mourgias-Alexandris, Nikolaos Passalis, George Dabos, Angelina Totović, Anastasios Tefas and Nikos Pleros demonstrations typically restricted to proof-of-principle Abstract— Neuromorphic photonics has turned into a key validations that are subsequently utilized in true application research area for enabling neuromorphic computing at much scenarios by means of ex-situ benchmarking procedures [9], higher data-rates compared to their electronic counterparts, [15], [28], where offline post-processing of the data is required. improving significantly the (Multiply-and-Accumulate) MAC/sec. RNNs, which form typically the cornerstone of all necessary At the same time, time-series classification problems comprise a large class of Artificial Intelligence (AI) applications where speed Deep Learning (DL)-based time-series analysis applications are and latency can have a decisive role in their hardware deployment still attempting their very first elementary steps in migrating to roadmap, highlighting the need for ultra-fast hardware their optical version. To this end, a large set of speed- and implementations of simplified Recurrent Neural Networks (RNN) latency-critical time-series applications like pattern that can be extended in more advanced Long-Short-Term- classification, forecasting, text processing, natural language Memory (LSTM) and Gated Recurrent Unit (GRU) machines. processing, finance applications and routing address Herein, we experimentally demonstrate a novel Photonic Recurrent Neuron (PRN) for classifying successfully a time-series identification, have only been partially addressed in the optical vector with 100-psec optical pulses and up to 10Gb/s data speeds, landscape, mainly through photonic Reservoir Computing (RC) reporting on the fastest all-optical real-time classifier.
    [Show full text]
  • Journey from Optical Neural Networks to Photonic Chips
    INTERNATIONAL JOURNAL OF SCIENTIFIC & TECHNOLOGY RESEARCH VOLUME 8, ISSUE 08, AUGUST 2019 ISSN 2277-8616 Journey From Optical Neural Networks To Photonic Chips Neha Soni, Enakshi Khular Sharma, Amita Kapoor Abstract: In recent years, there has been a rapid expansion in two fields, photonics and artificial neural networks (ANNs). ANNs based on the basic property of a biological neuron, has become the solution for a wide variety of problems in many fields, such as prediction, modeling, control, recognition, etc. and many of them have reached to the hardware implementation phase. Photonics on the other hand, with several advantageous features like inherent parallelism, high speed of information processing (photon), high capacity data storage, etc. has become a natural choice for researchers for the implementation of ANNs. This combination of photonics and ANNs has resulted in novel realizations of various ANN models. In this paper, we attempt to survey the optical realizations of various neural network models made in last the 30 years. We focus on self-organizing neural networks, associative memories, and perceptron neural networks. We also survey the state-of-the-art photonic chips for the realization of ANNs. Index Terms:Feed-Forward Networks, Hopfield Neural Networks, Neuromorphic Chips, Optical Neural Networks, Self-Learning Algorithms. ——————————————————— 1 INTRODUCTION An optical neural network (ONN) is a physical implementation In contrast, photonics has numerous benefits for surpassing of an artificial neural network (ANN) with optical components the constraints of electronic realizations. It has inherent like laser, lens, light modulators, mirrors, liquid crystals etc. In parallelism and uses photons to transport information instead 1987, Mostafa and Psaltis [1] for the very first time focused on of electric signals.
    [Show full text]
  • Microcombs for Ultrahigh Bandwidth Optical Data Transmission and Neural Networks M.Tan,1 X.Xu,2 B.Corcoran,2 J.Wu,1A
    Microcombs for ultrahigh bandwidth optical data transmission and neural networks M.Tan,1 X.Xu,2 B.Corcoran,2 J.Wu,1A. Boes,3 T.Nguyen,3 S.Chu,4 B.Little,5 R.Morandotti,6,7A.Mitchell,3 and D.J. Moss1 1 Optical Sciences Centre, Swinburne University of Technology, Hawthorn, VIC 3122, Australia 2 Department of Electrical and Computer Systems Engineering, Monash University, Clayton, 3800 VIC, Australia 3 School of Engineering, RMIT University, Melbourne, VIC 3001, Australia 4 Department of Physics, City University of Hong Kong, Tat Chee Avenue, Hong Kong, China. 5 Xi’an Institute of Optics and Precision Mechanics Precision Mechanics of CAS, Xi’an, China. 6 INRS-Énergie, Matériaux et Télécommunications, 1650 Boulevard Lionel-Boulet, Varennes, Québec, J3X 1S2, Canada. 7 Inst. of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu 610054, PR China ABSTRACT We report ultrahigh bandwidth applications of Kerr microcombs to optical neural networks and to optical data transmission, at data rates from 44 Terabits/s (Tb/s) to approaching 100 Tb/s. Convolutional neural networks (CNNs) are a powerful category of artificial neural networks that can extract the hierarchical features of raw data to greatly reduce the network complexity and enhance the accuracy for machine learning tasks such as computer vision, speech recognition, playing board games and medical diagnosis [1-7]. Optical neural networks can dramatically accelerate the computing speed to overcome the inherent bandwidth bottleneck of electronics. We use a new and powerful class of micro-comb called soliton crystals that exhibit robust operation and stable generation as well as a high intrinsic efficiency with an extremely low spacing of 48.9 GHz.
    [Show full text]