Analog Signal Processing Solutions and Design of Memristor-Cmos Analog Co-Processor for Acceleration of High-Performance Computing Applications

Total Page:16

File Type:pdf, Size:1020Kb

Analog Signal Processing Solutions and Design of Memristor-Cmos Analog Co-Processor for Acceleration of High-Performance Computing Applications University of Massachusetts Amherst ScholarWorks@UMass Amherst Doctoral Dissertations Dissertations and Theses July 2018 ANALOG SIGNAL PROCESSING SOLUTIONS AND DESIGN OF MEMRISTOR-CMOS ANALOG CO-PROCESSOR FOR ACCELERATION OF HIGH-PERFORMANCE COMPUTING APPLICATIONS Nihar Athreyas University of Massachusetts Amherst Follow this and additional works at: https://scholarworks.umass.edu/dissertations_2 Part of the Electronic Devices and Semiconductor Manufacturing Commons, Hardware Systems Commons, Signal Processing Commons, and the VLSI and Circuits, Embedded and Hardware Systems Commons Recommended Citation Athreyas, Nihar, "ANALOG SIGNAL PROCESSING SOLUTIONS AND DESIGN OF MEMRISTOR-CMOS ANALOG CO-PROCESSOR FOR ACCELERATION OF HIGH-PERFORMANCE COMPUTING APPLICATIONS" (2018). Doctoral Dissertations. 1215. https://doi.org/10.7275/11884361.0 https://scholarworks.umass.edu/dissertations_2/1215 This Open Access Dissertation is brought to you for free and open access by the Dissertations and Theses at ScholarWorks@UMass Amherst. It has been accepted for inclusion in Doctoral Dissertations by an authorized administrator of ScholarWorks@UMass Amherst. For more information, please contact [email protected]. ANALOG SIGNAL PROCESSING SOLUTIONS AND DESIGN OF MEMRISTOR-CMOS ANALOG CO- PROCESSOR FOR ACCELERATION OF HIGH-PERFORMANCE COMPUTING APPLICATIONS A Dissertation Presented by NIHAR ATHREYAS Submitted to the Graduate School of the University of Massachusetts Amherst in partial fulfillment of the requirements for the degree of DOCTOR OF PHILOSOPHY May 2018 ELECTRICAL AND COMPUTER ENGINEERING © Copyright by Nihar Athreyas 2018 All Rights Reserved ANALOG SIGNAL PROCESSING SOLUTIONS AND DESIGN OF MEMRISTOR-CMOS ANALOG CO- PROCESSOR FOR ACCELERATION OF HIGH-PERFORMANCE COMPUTING APPLICATIONS A Dissertation Presented by NIHAR ATHREYAS Approved as to style and content by: ______________________________________________ Dev Gupta, Co-Chair ______________________________________________ J. Joshua Yang, Co-Chair ______________________________________________ Patrick Kelly, Member ______________________________________________ Blair Perot, Member ____________________________________________ Christopher V. Hollot, Department Head Electrical and Computer Engineering DEDICATION To Mom, Dad, Lakshmi, My family and My Dearest Friends (too many to mention by name but, they know who they are) who have been a constant source of motivation and support, at times when I needed it (or not) And To Tom Brady and the Patriots for the never give up attitude, and without whom Sundays would be very boring. ACKNOWLEDGMENTS It gives me immense pleasure to thank those who have made this work possible. I would like to start by thanking Dr. Dev Gupta for believing in me and hiring me as his research assistant 5 years ago. He has had a strong influence on my professional life and he has always pointed me in the right direction, every time I needed it. I’m lucky to have him as my advisor and mentor. I would like to thank Dr. Joshua Yang who co-advised me on this thesis. He always offered valuable suggestions and encouragement. I would like to extend my gratitude to Dr. Patrick Kelly who has always been a constant source of motivation and support. I would like to express my gratitude to Dr. Blair Perot for his valuable time, suggestions and advice and for getting me interested in partial differential equations. I want to thank Dr. Qiangfei Xia for all his support and guidance. I would also like to thank Jai Gupta and Abbie Mathew of Spero Devices for accommodating me and supporting me in every possible way. It’s always a pleasure working with you guys. I would like to thank the University of Massachusetts, Amherst for providing a wonderful environment and resources for the completion of this project. A special thank you to my family and all my friends for the support they provided me with when the going got tough and everything else that they have done. v ABSTRACT ANALOG SIGNAL PROCESSING SOLUTIONS AND DESIGN OF MEMRISTOR-CMOS ANALOG CO- PROCESSOR FOR ACCELERATION OF HIGH-PERFORMANCE COMPUTING APPLICATIONS MAY 2018 NIHAR ATHREYAS B.E., VISVESVARAYA TECHNOLOGICAL UNIVERSITY M.S., UNIVERSITY OF MASSACHUSETTS, AMHERST Ph.D., UNIVERSITY OF MASSACHUSETTS, AMHERST Directed by: Professor Dev Gupta and Professor Joshua Yang Emerging applications in the field of machine vision, deep learning and scientific simulation require high computational speed and are run on platforms that are size, weight and power constrained. With the transistor scaling coming to an end, existing digital hardware architectures will not be able to meet these ever-increasing demands. Analog computation with its rich set of primitives and inherent parallel architecture can be faster, more efficient and compact for some of these applications. The major contribution of this work is to show that analog processing can be a viable solution to this problem. This is demonstrated in the three parts of the dissertation. In the first part of the dissertation, we demonstrate that analog processing can be used to solve the problem of stereo correspondence. Novel modifications to the algorithms are proposed which improves the computational speed and makes them efficiently implementable in analog hardware. The analog domain implementation provides further speedup in computation and has lower power consumption than a digital implementation. In the second part of the dissertation, a prototype of an analog processor was developed using commercially available off-the-shelf components. The focus was on providing experimental vi results that demonstrate functionality and to show that the performance of the prototype for low-level and mid-level image processing tasks is equivalent to a digital implementation. To demonstrate improvement in speed and power consumption, an integrated circuit design of the analog processor was proposed, and it was shown that such an analog processor would be faster than state-of-the-art digital and other analog processors. In the third part of the dissertation, a memristor-CMOS analog co-processor that can perform floating point vector matrix multiplication (VMM) is proposed. VMM computation underlies some of the major applications. To demonstrate the working of the analog co-processor at a system level, a new tool called PSpice Systems Option is used. It is shown that the analog co- processor has a superior performance when compared to the projected performances of digital and analog processors. Using the new tool, various application simulations for image processing and solution to partial differential equations are performed on the co-processor model. vii TABLE OF CONTENTS Page ACKNOWLEDGMENTS ...................................................................................................................... v ABSTRACT ........................................................................................................................................ vi LIST OF TABLES ................................................................................................................................ xii LIST OF FIGURES ............................................................................................................................. xiii CHAPTER 1. INTRODUCTION ................................................................................................................... 1 1.1. Analog signal processing solution for image alignment ........................................ 3 1.2. Analog Signal Processing Solution for Machine Vision Applications ..................... 5 1.3. Memristor-CMOS Analog Co-Processor for Acceleration of High- Performance Computing Applications ................................................................... 6 2. ANALOG SIGNAL PROCESSING FOR IMAGE ALIGNMENT ................................................... 8 2.1. Patch Based Stereo Correspondence Algorithm.................................................... 8 2.2. Review of Related Work ........................................................................................ 9 2.3. Reasons for Choosing Normalized Cross-correlation .......................................... 10 2.4. The Normalized Cross-correlation Algorithm ...................................................... 10 2.4.1. Fast NCC Algorithm: Sum Table Method for Calculating Denominator ........................................................................................... 12 2.4.2. Fast NCC Algorithm: Calculating the numerator .................................... 13 2.4.3. Modifications to the NCC algorithm ....................................................... 14 2.4.4. Reducing the number of shifts ............................................................... 14 2.4.5. Using diagonal elements ......................................................................... 15 2.5. Per-Pixel Stereo Correspondence Algorithm ....................................................... 17 2.5.1. Review of related work ........................................................................... 17 2.5.2. Disparity Estimation using overlapping blocks ....................................... 17 2.6. Hardware Architecture ........................................................................................ 18 2.6.1. Implementing the modified NCC algorithm in hardware ....................... 20 2.6.2. Implementing the SAD algorithm in hardware ....................................... 21 viii 2.7. Stereo Correspondence Algorithm Results .........................................................
Recommended publications
  • High Frequency Integrated MOS Filters C
    N94-71118 2nd NASA SERC Symposium on VLSI Design 1990 5.4.1 High Frequency Integrated MOS Filters C. Peterson International Microelectronic Products San Jose, CA 95134 Abstract- Several techniques exist for implementing integrated MOS filters. These techniques fit into the general categories of sampled and tuned continuous- time niters. Advantages and limitations of each approach are discussed. This paper focuses primarily on the high frequency capabilities of MOS integrated filters. 1 Introduction The use of MOS in the design of analog integrated circuits continues to grow. Competitive pressures drive system designers towards system-level integration. System-level integration typically requires interface to the analog world. Often, this type of integration consists predominantly of digital circuitry, thus the efficiency of the digital in cost and power is key. MOS remains the most efficient integrated circuit technology for mixed-signal integration. The scaling of MOS process feature sizes combined with innovation in circuit design techniques have also opened new high bandwidth opportunities for MOS mixed- signal circuits. There is a trend to replace complex analog signal processing with digital signal pro- cessing (DSP). The use of over-sampled converters minimizes the requirements for pre- and post filtering. In spite of this trend, there are many applications where DSP is im- practical and where signal conditioning in the analog domain is necessary, particularly at high frequencies. Applications for high frequency filters are in data communications and the read channel of tape, disk and optical storage devices where filters bandlimit the data signal and perform amplitude and phase equalization. Other examples include filtering of radio and video signals.
    [Show full text]
  • Lecture 10: Recurrent Neural Networks
    Lecture 10: Recurrent Neural Networks Fei-Fei Li & Justin Johnson & Serena Yeung Lecture 10 - 1 May 4, 2017 Administrative A1 grades will go out soon A2 is due today (11:59pm) Midterm is in-class on Tuesday! We will send out details on where to go soon Fei-Fei Li & Justin Johnson & Serena Yeung Lecture 10 - 2 May 4, 2017 Extra Credit: Train Game More details on Piazza by early next week Fei-Fei Li & Justin Johnson & Serena Yeung Lecture 10 - 3 May 4, 2017 Last Time: CNN Architectures AlexNet Figure copyright Kaiming He, 2016. Reproduced with permission. Fei-Fei Li & Justin Johnson & Serena Yeung Lecture 10 - 4 May 4, 2017 Last Time: CNN Architectures Softmax FC 1000 Softmax FC 4096 FC 1000 FC 4096 FC 4096 Pool FC 4096 3x3 conv, 512 Pool 3x3 conv, 512 3x3 conv, 512 3x3 conv, 512 3x3 conv, 512 3x3 conv, 512 3x3 conv, 512 Pool Pool 3x3 conv, 512 3x3 conv, 512 3x3 conv, 512 3x3 conv, 512 3x3 conv, 512 3x3 conv, 512 3x3 conv, 512 Pool Pool 3x3 conv, 256 3x3 conv, 256 3x3 conv, 256 3x3 conv, 256 Pool Pool 3x3 conv, 128 3x3 conv, 128 3x3 conv, 128 3x3 conv, 128 Pool Pool 3x3 conv, 64 3x3 conv, 64 3x3 conv, 64 3x3 conv, 64 Input Input VGG16 VGG19 GoogLeNet Figure copyright Kaiming He, 2016. Reproduced with permission. Fei-Fei Li & Justin Johnson & Serena Yeung Lecture 10 - 5 May 4, 2017 Last Time: CNN Architectures Softmax FC 1000 Pool 3x3 conv, 64 3x3 conv, 64 3x3 conv, 64 relu 3x3 conv, 64 3x3 conv, 64 F(x) + x 3x3 conv, 64 ..
    [Show full text]
  • The Equivalence of Digital and Analog Signal Processing *
    Reprinted from I ~ FORMATIO:-; AXDCOXTROL, Volume 8, No. 5, October 1965 Copyright @ by AcademicPress Inc. Printed in U.S.A. INFORMATION A~D CO:";TROL 8, 455- 4G7 (19G5) The Equivalence of Digital and Analog Signal Processing * K . STEIGLITZ Departmentof Electrical Engineering, Princeton University, Princeton, l\rew J er.~e!J A specific isomorphism is constructed via the trallsform domaiIls between the analog signal spaceL2 (- 00, 00) and the digital signal space [2. It is then shown that the class of linear time-invariaIlt realizable filters is illvariallt under this isomorphism, thus demoIl- stratillg that the theories of processingsignals with such filters are identical in the digital and analog cases. This means that optimi - zatioll problems illvolving linear time-illvariant realizable filters and quadratic cost functions are equivalent in the discrete-time and the continuous-time cases, for both deterministic and random signals. FilIally , applicatiolls to the approximation problem for digital filters are discussed. LIST OF SY~IBOLS f (t ), g(t) continuous -time signals F (jCAJ), G(jCAJ) Fourier transforms of continuous -time signals A continuous -time filters , bounded linear transformations of I.J2(- 00, 00) {in}, {gn} discrete -time signals F (z), G (z) z-transforms of discrete -time signals A discrete -time filters , bounded linear transfornlations of ~ JL isomorphic mapping from L 2 ( - 00, 00) to l2 ~L 2 ( - 00, 00) space of Fourier transforms of functions in L 2( - 00,00) 512 space of z-transforms of sequences in l2 An(t ) nth Laguerre function * This work is part of a thesis submitted in partial fulfillment of requirements for the degree of Doctor of Engineering Scienceat New York University , and was supported partly by the National ScienceFoundation and partly by the Air Force Officeof Scientific Researchunder Contract No.
    [Show full text]
  • Image Retrieval Algorithm Based on Convolutional Neural Network
    Advances in Intelligent Systems Research, volume 133 2nd International Conference on Artificial Intelligence and Industrial Engineering (AIIE2016) Image Retrieval Algorithm Based on Convolutional Neural Network Hailong Liu 1, 2, Baoan Li 1, 2, *, Xueqiang Lv 1 and Yue Huang 3 1Beijing Key Laboratory of Internet Culture and Digital Dissemination Research, Beijing Information Science & Technology University, Beijing 100101, China 2Computer School, Beijing Information Science and Technology University, Beijing 100101, China 3Xuanwu Hospital Capital Medical University, 100053, China *Corresponding author Abstract—With the rapid development of computer technology low, can’t meet the needs of people. In order to overcome this and the increasing of multimedia data on the Internet, how to difficulty, the researchers from the image itself, and put quickly find the desired information in the massive data becomes forward the method of image retrieval based on content. The a hot issue. Image retrieval can be used to retrieve similar images, method is to extract the visual features of the image content: and the effect of image retrieval depends on the selection of image color, texture, shape etc., the image database to be detected features to a certain extent. Based on deep learning, through self- samples for similarity matching, retrieval and sample images learning ability of a convolutional neural network to extract more are similar to the image. The main process of the method is the conducive to the high-level semantic feature of image retrieval selection and extraction of features, but there are "semantic using convolutional neural network, and then use the distance gap" [2] between low-level features and high-level semantic metric function similar image.
    [Show full text]
  • Analog Signal Processing Circuits in Organic Transistor Technology a Dissertation Submitted to the Department of Electrical Engi
    ANALOG SIGNAL PROCESSING CIRCUITS IN ORGANIC TRANSISTOR TECHNOLOGY A DISSERTATION SUBMITTED TO THE DEPARTMENT OF ELECTRICAL ENGINEERING AND THE COMMITTEE ON GRADUATE STUDIES OF STANFORD UNIVERSITY IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF DOCTOR OF PHILOSOPHY Wei Xiong October 2010 © 2011 by Wei Xiong. All Rights Reserved. Re-distributed by Stanford University under license with the author. This work is licensed under a Creative Commons Attribution- Noncommercial 3.0 United States License. http://creativecommons.org/licenses/by-nc/3.0/us/ This dissertation is online at: http://purl.stanford.edu/fk938sr9136 ii I certify that I have read this dissertation and that, in my opinion, it is fully adequate in scope and quality as a dissertation for the degree of Doctor of Philosophy. Boris Murmann, Primary Adviser I certify that I have read this dissertation and that, in my opinion, it is fully adequate in scope and quality as a dissertation for the degree of Doctor of Philosophy. Zhenan Bao I certify that I have read this dissertation and that, in my opinion, it is fully adequate in scope and quality as a dissertation for the degree of Doctor of Philosophy. Robert Dutton Approved for the Stanford University Committee on Graduate Studies. Patricia J. Gumport, Vice Provost Graduate Education This signature page was generated electronically upon submission of this dissertation in electronic format. An original signed hard copy of the signature page is on file in University Archives. iii Abstract Low-voltage organic thin-film transistors offer potential for many novel applications. Because organic transistors can be fabricated near room temperature, they allow integrated circuits to be made on flexible plastic substrates.
    [Show full text]
  • LSTM-In-LSTM for Generating Long Descriptions of Images
    Computational Visual Media DOI 10.1007/s41095-016-0059-z Vol. 2, No. 4, December 2016, 379–388 Research Article LSTM-in-LSTM for generating long descriptions of images Jun Song1, Siliang Tang1, Jun Xiao1, Fei Wu1( ), and Zhongfei (Mark) Zhang2 c The Author(s) 2016. This article is published with open access at Springerlink.com Abstract In this paper, we propose an approach by means of text (description generation) is a for generating rich fine-grained textual descriptions of fundamental task in artificial intelligence, with many images. In particular, we use an LSTM-in-LSTM (long applications. For example, generating descriptions short-term memory) architecture, which consists of an of images may help visually impaired people better inner LSTM and an outer LSTM. The inner LSTM understand the content of images and retrieve images effectively encodes the long-range implicit contextual using descriptive texts. The challenge of description interaction between visual cues (i.e., the spatially- generation lies in appropriately developing a model concurrent visual objects), while the outer LSTM that can effectively represent the visual cues in generally captures the explicit multi-modal relationship images and describe them in the domain of natural between sentences and images (i.e., the correspondence language at the same time. of sentences and images). This architecture is capable There have been significant advances in of producing a long description by predicting one description generation recently. Some efforts word at every time step conditioned on the previously rely on manually-predefined visual concepts and generated word, a hidden vector (via the outer LSTM), and a context vector of fine-grained visual cues (via sentence templates [1–3].
    [Show full text]
  • The History Began from Alexnet: a Comprehensive Survey on Deep Learning Approaches
    > REPLACE THIS LINE WITH YOUR PAPER IDENTIFICATION NUMBER (DOUBLE-CLICK HERE TO EDIT) < 1 The History Began from AlexNet: A Comprehensive Survey on Deep Learning Approaches Md Zahangir Alom1, Tarek M. Taha1, Chris Yakopcic1, Stefan Westberg1, Paheding Sidike2, Mst Shamima Nasrin1, Brian C Van Essen3, Abdul A S. Awwal3, and Vijayan K. Asari1 Abstract—In recent years, deep learning has garnered I. INTRODUCTION tremendous success in a variety of application domains. This new ince the 1950s, a small subset of Artificial Intelligence (AI), field of machine learning has been growing rapidly, and has been applied to most traditional application domains, as well as some S often called Machine Learning (ML), has revolutionized new areas that present more opportunities. Different methods several fields in the last few decades. Neural Networks have been proposed based on different categories of learning, (NN) are a subfield of ML, and it was this subfield that spawned including supervised, semi-supervised, and un-supervised Deep Learning (DL). Since its inception DL has been creating learning. Experimental results show state-of-the-art performance ever larger disruptions, showing outstanding success in almost using deep learning when compared to traditional machine every application domain. Fig. 1 shows, the taxonomy of AI. learning approaches in the fields of image processing, computer DL (using either deep architecture of learning or hierarchical vision, speech recognition, machine translation, art, medical learning approaches) is a class of ML developed largely from imaging, medical information processing, robotics and control, 2006 onward. Learning is a procedure consisting of estimating bio-informatics, natural language processing (NLP), cybersecurity, and many others.
    [Show full text]
  • Active Authentication Using an Autoencoder Regularized CNN-Based One-Class Classifier
    Active Authentication Using an Autoencoder Regularized CNN-based One-Class Classifier Poojan Oza and Vishal M. Patel Department of Electrical and Computer Engineering, Johns Hopkins University, 3400 N. Charles St, Baltimore, MD 21218, USA email: fpoza2, [email protected] Access Abstract— Active authentication refers to the process in Enrolled User Denied which users are unobtrusively monitored and authenticated Training Images continuously throughout their interactions with mobile devices. AA Access Generally, an active authentication problem is modelled as a Module Granted one class classification problem due to the unavailability of Trained Access data from the impostor users. Normally, the enrolled user is Denied considered as the target class (genuine) and the unauthorized users are considered as unknown classes (impostor). We propose Training a convolutional neural network (CNN) based approach for one class classification in which a zero centered Gaussian noise AA and an autoencoder are used to model the pseudo-negative Module class and to regularize the network to learn meaningful feature Test Images representations for one class data, respectively. The overall network is trained using a combination of the cross-entropy and (a) (b) the reconstruction error losses. A key feature of the proposed approach is that any pre-trained CNN can be used as the Fig. 1: An overview of a typical AA system. (a) Data base network for one class classification. Effectiveness of the corresponding to the enrolled user are used to train an AA proposed framework is demonstrated using three publically system. (b) During testing, data corresponding to the enrolled available face-based active authentication datasets and it is user as well as unknown user may be presented to the system.
    [Show full text]
  • Adaptive Translinear Analog Signal Processing a Prospectus
    ADAPTIVE TRANSLINEAR ANALOG SIGNAL PROCESSING A PROSPECTUS Eric J. McDonald, Koji Mensa Odame, and Bradley A. Minch School of Electrical and Computer Engineering Cornell University Ithaca, NY 14853-5401 [email protected] ABSTRACT are directly implementable as networks of MITES. We have devised a systematic method of transforming high- One common implementation of a MITE unit is shown level time-domain descriptions of linear and nonlinear adap- in Fig. 1. In this case, we operate a floating-gate PMOS tive signal-processing algorithms into compact, continuous- transistor in weak-inversion in addition to using a cascode time analog circuitry using basic units called multiple-input transistor to reduce the effects of the gate-to-drain capac- translinear elements (MITES). In this paper, we describe itance and channel-length modulation. The input voltages, the current state of the an and illustrate the method with an VI and V,, capacitively couple into the floating gate through unit-sized capacitors. The drain current of this device is example of an analog phase-locked loop (PLL). given by I = Isew(I’,+1’d/UT, 1. CIRCUIT SYNTHESIS METHODOLOGY where I, is a pre-exponential scaling current, w is the weight- The class of dyiamic translinear circuits [ 1-31 makes possi- ing cwfficient of the inputs (accounting for the body-effect 2.’ _.ble a promising approach to the structured design of analog and the capacitive voltage divider, Cunjt/Ctota,),and UT is ”. .‘ signal and information processing systems [4]. Such cir- the thermal voltage, kT/q. ’ cuits are capable of accurately realizing a wide range of lin- ear and nonlinear systems whose behavior can be described 2.
    [Show full text]
  • Analog Signal Processing Christophe Caloz, Fellow, IEEE, Shulabh Gupta, Member, IEEE, Qingfeng Zhang, Member, IEEE, and Babak Nikfal, Student Member, IEEE
    1 Analog Signal Processing Christophe Caloz, Fellow, IEEE, Shulabh Gupta, Member, IEEE, Qingfeng Zhang, Member, IEEE, and Babak Nikfal, Student Member, IEEE Abstract—Analog signal processing (ASP) is presented as a discrimination effects, emphasizing the concept of group delay systematic approach to address future challenges in high speed engineering, and presenting the fundamental concept of real- and high frequency microwave applications. The general concept time Fourier transformation. Next, it addresses the topic of of ASP is explained with the help of examples emphasizing basic ASP effects, such as time spreading and compression, the “phaser”, which is the core of an ASP system; it reviews chirping and frequency discrimination. Phasers, which repre- phaser technologies, explains the characteristics of the most sent the core of ASP systems, are explained to be elements promising microwave phasers, and proposes corresponding exhibiting a frequency-dependent group delay response, and enhancement techniques for higher ASP performance. Based hence a nonlinear phase response versus frequency, and various on ASP requirements, found to be phaser resolution, absolute phaser technologies are discussed and compared. Real-time Fourier transformation (RTFT) is derived as one of the most bandwidth and magnitude balance, it then describes novel fundamental ASP operations. Upon this basis, the specifications synthesis techniques for the design of all-pass transmission of a phaser – resolution, absolute bandwidth and magnitude and reflection phasers
    [Show full text]
  • Key Ideas and Architectures in Deep Learning Applications That (Probably) Use DL
    Key Ideas and Architectures in Deep Learning Applications that (probably) use DL Autonomous Driving Scene understanding /Segmentation Applications that (probably) use DL WordLens Prisma Outline of today’s talk Image Recognition Fun application using CNNs ● LeNet - 1998 ● Image Style Transfer ● AlexNet - 2012 ● VGGNet - 2014 ● GoogLeNet - 2014 ● ResNet - 2015 Questions to ask about each architecture/ paper Special Layers Non-Linearity Loss function Weight-update rule Train faster? Reduce parameters Reduce Overfitting Help you visualize? LeNet5 - 1998 LeNet5 - Specs MNIST - 60,000 training, 10,000 testing Input is 32x32 image 8 layers 60,000 parameters Few hours to train on a laptop Modified LeNet Architecture - Assignment 3 Training Input Forward pass Maxp Max Conv ReLU Conv ReLU FC ReLU Softmax ool pool Backpropagation - Labels Loss update weights Modified LeNet Architecture - Assignment 3 Testing Input Forward pass Maxp Max Conv ReLU Conv ReLU FC ReLU Softmax ool pool Output Compare output with labels Modified LeNet - CONV Layer 1 Input - 28 x 28 Output - 6 feature maps - each 24 x 24 Convolution filter - 5 x 5 x 1 (convolution) + 1 (bias) How many parameters in this layer? Modified LeNet - CONV Layer 1 Input - 32 x 32 Output - 6 feature maps - each 28 x 28 Convolution filter - 5 x 5 x 1 (convolution) + 1 (bias) How many parameters in this layer? (5x5x1+1)*6 = 156 Modified LeNet - Max-pooling layer Decreases the spatial extent of the feature maps, makes it translation-invariant Input - 28 x 28 x 6 volume Maxpooling with filter size 2
    [Show full text]
  • Current Mode Computational Circuits for Analog Signal Processing
    ISSN (Print) : 2320 – 3765 ISSN (Online): 2278 – 8875 International Journal of Advanced Research in Electrical, Electronics and Instrumentation Engineering (An ISO 3297: 2007 Certified Organization) Vol. 3, Issue 4, April 2014 Current Mode Computational Circuits for Analog Signal Processing Amanpreet Kaur1, Rishikesh Pandey2 PG Student (VLSI), Department of ECE, ThaparUniversity, Patiala,Punjab, India 1 Assistant Professor, Department of ECE, Thapar University, Patiala, Punjab, India2 ABSTRACT: The paper presents current adder and subtractor circuits based on cascode current mirror with improved linearity and wide linear range. The proposed circuits can be used for analog signal processing applications such as amplifiers, operational transconductance amplifiers (OTA), Gm-C filters, etc. In the proposed circuits, cascode current mirror topology is employed to improve current mirroring operation. The proposed circuits have been simulated using TSMC 0.18µm CMOS process technology with a supply voltage of 1.8 V. The proposed circuits operate efficiently within the current range of 0 to 80µA with the permissible error percentage of less than 2.5%. The SPICE simulation results have been presented to demonstrate the effectiveness of the proposed circuits. KEYWORDS: Adaptive biasing, cascode current mirror, current mode circuits, adder, subtractor.. I.INTRODUCTION In low-voltage/ low-power analog systems, current-mode signal processing has been usually considered an attractive strategy due to its potential for high-speed operation and low-voltage compatibility [1]-[4]. The behaviour of electrical circuits is always the result of interplay between voltage and current. In current mode circuits (CMCs), the currents determine the complete circuit response. The voltage signals are irrelevant in determining the circuit performance.
    [Show full text]