Learning Sparse Codes from Compressed Representations with Biologically Plausible Local Wiring Constraints
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Automatic Feature Learning Using Recurrent Neural Networks
Predicting purchasing intent: Automatic Feature Learning using Recurrent Neural Networks Humphrey Sheil Omer Rana Ronan Reilly Cardiff University Cardiff University Maynooth University Cardiff, Wales Cardiff, Wales Maynooth, Ireland [email protected] [email protected] [email protected] ABSTRACT influence three out of four major variables that affect profit. Inaddi- We present a neural network for predicting purchasing intent in an tion, merchants increasingly rely on (and pay advertising to) much Ecommerce setting. Our main contribution is to address the signifi- larger third-party portals (for example eBay, Google, Bing, Taobao, cant investment in feature engineering that is usually associated Amazon) to achieve their distribution, so any direct measures the with state-of-the-art methods such as Gradient Boosted Machines. merchant group can use to increase their profit is sorely needed. We use trainable vector spaces to model varied, semi-structured input data comprising categoricals, quantities and unique instances. McKinsey A.T. Kearney Affected by Multi-layer recurrent neural networks capture both session-local shopping intent and dataset-global event dependencies and relationships for user Price management 11.1% 8.2% Yes sessions of any length. An exploration of model design decisions Variable cost 7.8% 5.1% Yes including parameter sharing and skip connections further increase Sales volume 3.3% 3.0% Yes model accuracy. Results on benchmark datasets deliver classifica- Fixed cost 2.3% 2.0% No tion accuracy within 98% of state-of-the-art on one and exceed state-of-the-art on the second without the need for any domain / Table 1: Effect of improving different variables on operating dataset-specific feature engineering on both short and long event profit, from [22]. -
Exploring the Potential of Sparse Coding for Machine Learning
Portland State University PDXScholar Dissertations and Theses Dissertations and Theses 10-19-2020 Exploring the Potential of Sparse Coding for Machine Learning Sheng Yang Lundquist Portland State University Follow this and additional works at: https://pdxscholar.library.pdx.edu/open_access_etds Part of the Applied Mathematics Commons, and the Computer Sciences Commons Let us know how access to this document benefits ou.y Recommended Citation Lundquist, Sheng Yang, "Exploring the Potential of Sparse Coding for Machine Learning" (2020). Dissertations and Theses. Paper 5612. https://doi.org/10.15760/etd.7484 This Dissertation is brought to you for free and open access. It has been accepted for inclusion in Dissertations and Theses by an authorized administrator of PDXScholar. Please contact us if we can make this document more accessible: [email protected]. Exploring the Potential of Sparse Coding for Machine Learning by Sheng Y. Lundquist A dissertation submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy in Computer Science Dissertation Committee: Melanie Mitchell, Chair Feng Liu Bart Massey Garrett Kenyon Bruno Jedynak Portland State University 2020 © 2020 Sheng Y. Lundquist Abstract While deep learning has proven to be successful for various tasks in the field of computer vision, there are several limitations of deep-learning models when com- pared to human performance. Specifically, human vision is largely robust to noise and distortions, whereas deep learning performance tends to be brittle to modifi- cations of test images, including being susceptible to adversarial examples. Addi- tionally, deep-learning methods typically require very large collections of training examples for good performance on a task, whereas humans can learn to perform the same task with a much smaller number of training examples. -
Text-To-Speech Synthesis
Generative Model-Based Text-to-Speech Synthesis Heiga Zen (Google London) February rd, @MIT Outline Generative TTS Generative acoustic models for parametric TTS Hidden Markov models (HMMs) Neural networks Beyond parametric TTS Learned features WaveNet End-to-end Conclusion & future topics Outline Generative TTS Generative acoustic models for parametric TTS Hidden Markov models (HMMs) Neural networks Beyond parametric TTS Learned features WaveNet End-to-end Conclusion & future topics Text-to-speech as sequence-to-sequence mapping Automatic speech recognition (ASR) “Hello my name is Heiga Zen” ! Machine translation (MT) “Hello my name is Heiga Zen” “Ich heiße Heiga Zen” ! Text-to-speech synthesis (TTS) “Hello my name is Heiga Zen” ! Heiga Zen Generative Model-Based Text-to-Speech Synthesis February rd, of Speech production process text (concept) fundamental freq voiced/unvoiced char freq transfer frequency speech transfer characteristics magnitude start--end Sound source fundamental voiced: pulse frequency unvoiced: noise modulation of carrier wave by speech information air flow Heiga Zen Generative Model-Based Text-to-Speech Synthesis February rd, of Typical ow of TTS system TEXT Sentence segmentation Word segmentation Text normalization Text analysis Part-of-speech tagging Pronunciation Speech synthesis Prosody prediction discrete discrete Waveform generation ) NLP Frontend discrete continuous SYNTHESIZED ) SEECH Speech Backend Heiga Zen Generative Model-Based Text-to-Speech Synthesis February rd, of Rule-based, formant synthesis [] -
Compressive Phase Retrieval Lei Tian
Compressive Phase Retrieval by Lei Tian B.S., Tsinghua University (2008) S.M., Massachusetts Institute of Technology (2010) Submitted to the Department of Mechanical Engineering in partial fulfillment of the requirements for the degree of Doctor of Philosophy at the MASSACHUSETTS INSTITUTE OF TECHNOLOGY June 2013 c Massachusetts Institute of Technology 2013. All rights reserved. Author.............................................................. Department of Mechanical Engineering May 18, 2013 Certified by. George Barbastathis Professor Thesis Supervisor Accepted by . David E. Hardt Chairman, Department Committee on Graduate Students 2 Compressive Phase Retrieval by Lei Tian Submitted to the Department of Mechanical Engineering on May 18, 2013, in partial fulfillment of the requirements for the degree of Doctor of Philosophy Abstract Recovering a full description of a wave from limited intensity measurements remains a central problem in optics. Optical waves oscillate too fast for detectors to measure anything but time{averaged intensities. This is unfortunate since the phase can reveal important information about the object. When the light is partially coherent, a complete description of the phase requires knowledge about the statistical correlations for each pair of points in space. Recovery of the correlation function is a much more challenging problem since the number of pairs grows much more rapidly than the number of points. In this thesis, quantitative phase imaging techniques that works for partially co- herent illuminations are investigated. In order to recover the phase information with few measurements, the sparsity in each underly problem and efficient inversion meth- ods are explored under the framework of compressed sensing. In each phase retrieval technique under study, diffraction during spatial propagation is exploited as an ef- fective and convenient mechanism to uniformly distribute the information about the unknown signal into the measurement space. -
Compressed Digital Holography: from Micro Towards Macro
Compressed digital holography: from micro towards macro Colas Schretter, Stijn Bettens, David Blinder, Beatrice Pesquet-Popescu, Marco Cagnazzo, Frederic Dufaux, Peter Schelkens To cite this version: Colas Schretter, Stijn Bettens, David Blinder, Beatrice Pesquet-Popescu, Marco Cagnazzo, et al.. Compressed digital holography: from micro towards macro. Applications of Digital Image Processing XXXIX, SPIE, Aug 2016, San Diego, United States. hal-01436102 HAL Id: hal-01436102 https://hal.archives-ouvertes.fr/hal-01436102 Submitted on 10 Jan 2020 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Compressed digital holography: from micro towards macro Colas Schrettera,c, Stijn Bettensa,c, David Blindera,c, Béatrice Pesquet-Popescub, Marco Cagnazzob, Frédéric Dufauxb, and Peter Schelkensa,c aDept. of Electronics and Informatics (ETRO), Vrije Universiteit Brussel, Brussels, Belgium bLTCI, CNRS, Télécom ParisTech, Université Paris-Saclay, Paris, France ciMinds, Technologiepark 19, Zwijnaarde, Belgium ABSTRACT The age of computational imaging is merging the physical hardware-driven approach of photonics with advanced signal processing methods from software-driven computer engineering and applied mathematics. The compressed sensing theory in particular established a practical framework for reconstructing the scene content using few linear combinations of complex measurements and a sparse prior for regularizing the solution. -
Compressive Sensing Off the Grid
Compressive sensing off the grid Abstract— We consider the problem of estimating the fre- Indeed, one significant drawback of the discretization quency components of a mixture of s complex sinusoids approach is the performance degradation when the true signal from a random subset of n regularly spaced samples. Unlike is not exactly supported on the grid points, the so called basis previous work in compressive sensing, the frequencies are not assumed to lie on a grid, but can assume any values in the mismatch problem [13], [15], [16]. When basis mismatch normalized frequency domain [0; 1]. We propose an atomic occurs, the true signal cannot be sparsely represented by the norm minimization approach to exactly recover the unobserved assumed dictionary determined by the grid points. One might samples, which is then followed by any linear prediction method attempt to remedy this issue by using a finer discretization. to identify the frequency components. We reformulate the However, increasing the discretization level will also increase atomic norm minimization as an exact semidefinite program. By constructing a dual certificate polynomial using random the coherence of the dictionary. Common wisdom in com- kernels, we show that roughly s log s log n random samples pressive sensing suggests that high coherence would also are sufficient to guarantee the exact frequency estimation with degrade the performance. It remains unclear whether over- high probability, provided the frequencies are well separated. discretization is beneficial to solving the problems. Finer Extensive numerical experiments are performed to illustrate gridding also results in higher computational complexity and the effectiveness of the proposed method. -
Unsupervised Speech Representation Learning Using Wavenet Autoencoders Jan Chorowski, Ron J
1 Unsupervised speech representation learning using WaveNet autoencoders Jan Chorowski, Ron J. Weiss, Samy Bengio, Aaron¨ van den Oord Abstract—We consider the task of unsupervised extraction speaker gender and identity, from phonetic content, properties of meaningful latent representations of speech by applying which are consistent with internal representations learned autoencoding neural networks to speech waveforms. The goal by speech recognizers [13], [14]. Such representations are is to learn a representation able to capture high level semantic content from the signal, e.g. phoneme identities, while being desired in several tasks, such as low resource automatic speech invariant to confounding low level details in the signal such as recognition (ASR), where only a small amount of labeled the underlying pitch contour or background noise. Since the training data is available. In such scenario, limited amounts learned representation is tuned to contain only phonetic content, of data may be sufficient to learn an acoustic model on the we resort to using a high capacity WaveNet decoder to infer representation discovered without supervision, but insufficient information discarded by the encoder from previous samples. Moreover, the behavior of autoencoder models depends on the to learn the acoustic model and a data representation in a fully kind of constraint that is applied to the latent representation. supervised manner [15], [16]. We compare three variants: a simple dimensionality reduction We focus on representations learned with autoencoders bottleneck, a Gaussian Variational Autoencoder (VAE), and a applied to raw waveforms and spectrogram features and discrete Vector Quantized VAE (VQ-VAE). We analyze the quality investigate the quality of learned representations on LibriSpeech of learned representations in terms of speaker independence, the ability to predict phonetic content, and the ability to accurately re- [17]. -
Sparse Feature Learning for Deep Belief Networks
Sparse Feature Learning for Deep Belief Networks Marc'Aurelio Ranzato1 Y-Lan Boureau2,1 Yann LeCun1 1 Courant Institute of Mathematical Sciences, New York University 2 INRIA Rocquencourt {ranzato,ylan,[email protected]} Abstract Unsupervised learning algorithms aim to discover the structure hidden in the data, and to learn representations that are more suitable as input to a supervised machine than the raw input. Many unsupervised methods are based on reconstructing the input from the representation, while constraining the representation to have cer- tain desirable properties (e.g. low dimension, sparsity, etc). Others are based on approximating density by stochastically reconstructing the input from the repre- sentation. We describe a novel and efficient algorithm to learn sparse represen- tations, and compare it theoretically and experimentally with a similar machine trained probabilistically, namely a Restricted Boltzmann Machine. We propose a simple criterion to compare and select different unsupervised machines based on the trade-off between the reconstruction error and the information content of the representation. We demonstrate this method by extracting features from a dataset of handwritten numerals, and from a dataset of natural image patches. We show that by stacking multiple levels of such machines and by training sequentially, high-order dependencies between the input observed variables can be captured. 1 Introduction One of the main purposes of unsupervised learning is to produce good representations for data, that can be used for detection, recognition, prediction, or visualization. Good representations eliminate irrelevant variabilities of the input data, while preserving the information that is useful for the ul- timate task. One cause for the recent resurgence of interest in unsupervised learning is the ability to produce deep feature hierarchies by stacking unsupervised modules on top of each other, as pro- posed by Hinton et al. -
Sparse Penalties in Dynamical System Estimation
1 Sparsity Penalties in Dynamical System Estimation Adam Charles, M. Salman Asif, Justin Romberg, Christopher Rozell School of Electrical and Computer Engineering Georgia Institute of Technology, Atlanta, Georgia 30332-0250 Email: facharles6,sasif,jrom,[email protected] Abstract—In this work we address the problem of state by the following equations: estimation in dynamical systems using recent developments x = f (x ) + ν in compressive sensing and sparse approximation. We n n n−1 n (1) formulate the traditional Kalman filter as a one-step update yn = Gnxn + n optimization procedure which leads us to a more unified N framework, useful for incorporating sparsity constraints. where xn 2 R represents the signal of interest, N N We introduce three combinations of two sparsity conditions fn(·)jR ! R represents the (assumed known) evo- (sparsity in the state and sparsity in the innovations) M lution of the signal from time n − 1 to n, yn 2 R and write recursive optimization programs to estimate the M is a set of linear measurements of xn, n 2 R state for each model. This paper is meant as an overview N of different methods for incorporating sparsity into the is the associated measurement noise, and νn 2 R dynamic model, a presentation of algorithms that unify the is our modeling error for fn(·) (commonly known as support and coefficient estimation, and a demonstration the innovations). In the case where Gn is invertible that these suboptimal schemes can actually show some (N = M and the matrix has full rank), state estimation performance improvements (either in estimation error or at each iteration reduces to a least squares problem. -
Compressed Sensing Applied to Modeshapes Reconstruction Joseph Morlier, Dimitri Bettebghor
Compressed sensing applied to modeshapes reconstruction Joseph Morlier, Dimitri Bettebghor To cite this version: Joseph Morlier, Dimitri Bettebghor. Compressed sensing applied to modeshapes reconstruction. XXX Conference and exposition on structural dynamics (IMAC 2012), Jan 2012, Jacksonville, FL, United States. pp.1-8, 10.1007/978-1-4614-2425-3_1. hal-01852318 HAL Id: hal-01852318 https://hal.archives-ouvertes.fr/hal-01852318 Submitted on 1 Aug 2018 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. OATAO is an open access repository that collects the work of Toulouse researchers and makes it freely available over the web where possible. This is an author-deposited version published in: http://oatao.univ-toulouse.fr/ Eprints ID: 5163 To cite this document: Morlier, Joseph and Bettebghor, Dimitri Compressed sensing applied to modeshapes reconstruction. (2011) In: IMAC XXX Conference and exposition on structural dynamics, 30 Jan – 02 Feb 2012, Jacksonville, USA. Any correspondence concerning this service should be sent to the repository administrator: [email protected] Compressed sensing applied to modeshapes reconstruction Joseph Morlier 1*, Dimitri Bettebghor 2 1 Université de Toulouse, ICA, ISAE DMSM ,10 avenue edouard Belin 31005 Toulouse cedex 4, France 2 Onera DTIM MS2M, 10 avenue Edouard Belin, Toulouse, France * Corresponding author, Email: [email protected], Phone no: + (33) 5 61 33 81 31, Fax no: + (33) 5 61 33 83 30 ABSTRACT Modal analysis classicaly used signals that respect the Shannon/Nyquist theory. -
Kalman Filtered Compressed Sensing
KALMAN FILTERED COMPRESSED SENSING Namrata Vaswani Dept. of ECE, Iowa State University, Ames, IA, [email protected] ABSTRACT that we address is: can we do better than performing CS at each time separately, if (a) the sparsity pattern (support set) of the transform We consider the problem of reconstructing time sequences of spa- coefficients’ vector changes slowly, i.e. every time, none or only a tially sparse signals (with unknown and time-varying sparsity pat- few elements of the support change, and (b) a prior model on the terns) from a limited number of linear “incoherent” measurements, temporal dynamics of its current non-zero elements is available. in real-time. The signals are sparse in some transform domain re- Our solution is motivated by reformulating the above problem ferred to as the sparsity basis. For a single spatial signal, the solu- as causal minimum mean squared error (MMSE) estimation with a tion is provided by Compressed Sensing (CS). The question that we slow time-varying set of dominant basis directions (or equivalently address is, for a sequence of sparse signals, can we do better than the support of the transform vector). If the support is known, the CS, if (a) the sparsity pattern of the signal’s transform coefficients’ MMSE solution is given by the Kalman filter (KF) [9] for this sup- vector changes slowly over time, and (b) a simple prior model on port. But what happens if the support is unknown and time-varying? the temporal dynamics of its current non-zero elements is available. The initial support can be estimated using CS [7]. -
Robust Compressed Sensing Using Generative Models
Robust Compressed Sensing using Generative Models Ajil Jalal ∗ Liu Liu ECE, UT Austin ECE, UT Austin [email protected] [email protected] Alexandros G. Dimakis Constantine Caramanis ECE, UT Austin ECE, UT Austin [email protected] [email protected] Abstract The goal of compressed sensing is to estimate a high dimensional vector from an underdetermined system of noisy linear equations. In analogy to classical compressed sensing, here we assume a generative model as a prior, that is, we assume the vector is represented by a deep generative model G : Rk ! Rn. Classical recovery approaches such as empirical risk minimization (ERM) are guaranteed to succeed when the measurement matrix is sub-Gaussian. However, when the measurement matrix and measurements are heavy-tailed or have outliers, recovery may fail dramatically. In this paper we propose an algorithm inspired by the Median-of-Means (MOM). Our algorithm guarantees recovery for heavy-tailed data, even in the presence of outliers. Theoretically, our results show our novel MOM-based algorithm enjoys the same sample complexity guarantees as ERM under sub-Gaussian assumptions. Our experiments validate both aspects of our claims: other algorithms are indeed fragile and fail under heavy-tailed and/or corrupted data, while our approach exhibits the predicted robustness. 1 Introduction Compressive or compressed sensing is the problem of reconstructing an unknown vector x∗ 2 Rn after observing m < n linear measurements of its entries, possibly with added noise: y = Ax∗ + η; where A 2 Rm×n is called the measurement matrix and η 2 Rm is noise. Even without noise, this is an underdetermined system of linear equations, so recovery is impossible without a structural assumption on the unknown vector x∗.