An Integer Programming Approach to Deep Neural Networks with Binary Activation Functions

Total Page:16

File Type:pdf, Size:1020Kb

An Integer Programming Approach to Deep Neural Networks with Binary Activation Functions An Integer Programming Approach to Deep Neural Networks with Binary Activation Functions Jannis Kurtz 1 Bubacarr Bah 2 3 Abstract to be more robust to adversarial perturbations than the con- tinuous activation networks (Qin et al., 2020). Furthermore We study deep neural networks with binary ac- low-powered computations may benefit from discrete ac- tivation functions (BDNN), i.e. the activation tivations as a form of coarse quantizations (Plagianakos function only has two states. We show that the et al., 2001; Bengio et al., 2013; Courbariaux et al., 2015; BDNN can be reformulated as a mixed-integer Rastegari et al., 2016). Nevertheless, gradient descent-based linear program which can be solved to global op- training behaves like a black box, raising a lot of questions timality by classical integer programming solvers. regarding the explainability and interpretability of inter- Additionally, a heuristic solution algorithm is pre- nal representations (Hampson & Volper, 1990; Plagianakos sented and we study the model under data uncer- et al., 2001; Bengio et al., 2013). tainty, applying a two-stage robust optimization approach. We implemented our methods on ran- On the other hand, integer programming (IP) is known as a dom and real datasets and show that the heuristic powerful tool to model a huge class of real-world optimiza- version of the BDNN outperforms classical deep tion problems (Wolsey, 1998). Recently it was successfully neural networks on the Breast Cancer Wisconsin applied to machine learning problems involving sparsity dataset while performing worse on random data. constraints and to evaluate trained neural networks (Bertsi- mas et al., 2017; 2019b; Fischetti & Jo, 2018). 1. Introduction Contributions: In Section 3, we reformulate the binary neural network as an IP formulation, referred to in the sequel Deep learning (DL) methods have of-late reinvigorated inter- as a binary deep neural network (BDNN), which can be est in artificial intelligence and data science, and they have solved to global optimality by classical IP solvers. All our had many successful applications in computer vision, nat- results can also be applied to BDNNs with binary weights. ural language processing, and data analytics (LeCun et al., Solving the latter class of networks to optimality was still an 2015). The training of deep neural networks relies mostly on open problem. We note that it is straight forward to extend (stochastic) gradient descent, hence the use of differentiable the IP formulation to more general settings and several activation functions like ReLU, sigmoid or the hyperbolic variations of the model. We present a heuristic algorithm to tangent is the state of the art (Rumelhart et al., 1986; Good- calculate local optimal solutions of the BDNN in Section fellow et al., 2016). On the contrary discrete activations, 3.1 and apply a two-stage robust optimization approach to which may be more analogous to biological activations, protect the model against adversarial attacks in Section 4. present a training challenge due to non-differentiability and In Section 5 we present computational results for random even discontinuity. If additionally the weights are consid- and real datasets and compare the BDNN to a deep neural ered to be binary, the use of discrete activation networks can network using ReLU activations (DNN). Despite scalability reduce the computation and storage complexities, provides issues and a slightly worse accuracy on random datasets, the for better interpretation of solutions, and has the potential results indicate that the heuristic version outperforms the DNN on the Breast Cancer Wisconsin dataset. *Equal contribution 1Chair for Mathematics of Information Pro- cessing, RWTH Aachen University, Aachen, Germany 2African Institute for Mathematical Sciences, Cape Town, South Africa 2. Related Literature 3Division of Applied Mathematics, Stellenbosch University, Stel- lenbosch, South Africa. Correspondence to: Jannis Kurtz The interest in BDNN goes back to (McCulloch & Pitts, <[email protected]>. 1943) where BDNNs were used to simulate Boolean func- Workshop on Beyond first-order methods in ML systems at the 37 th tions. However, until the beginning of this century con- International Conference on Machine Learning, Vienna, Austria, certed efforts were made to train these networks either by 2020. Copyright 2020 by the author(s). specific schemes (Gray & Michel, 1992; Kohut & Stein- Deep Neural Networks with Binary Activation Functions bach, 2004) or via back propagation by modifications of the Given a set of labeled training samples gradient descent method (Widrow & Winter, 1988; Toms, i i n 1990; Barlett & Downs, 1992; Goodman & Zeng, 1994; X × Y = (x ; y ) j i 2 [m] ⊂ R × f0; 1g Corwin et al., 1994; Plagianakos et al., 2001). More recent we consider loss functions work regarding the back propagation, mostly motivated by the low complexity of computation and storage, build-on dK ` : f0; 1g × R ! R (2) the pioneering works of (Bengio et al., 2013; Courbariaux et al., 2015; Hubara et al., 2016; Rastegari et al., 2016; Kim and the task is to find the optimal weight matrices which & Smaragdis, 2016); see (Qin et al., 2020) for a detailed minimize the empirical loss over the training samples, i.e. survey. Regarding the generalization error of BDNN, it we want to solve the problem was already proved that the VC-Dimension of deep neural m networks with binary activation functions is ρ log(ρ), where X min ` yi; zi ρ is the number of weights of the BDNN; see (Baum & i=1 Haussler, 1989; Maass, 1994; Sakurai, 1993). s:t: zi = σK W K σK−1 : : : σ1 W 1xi ::: 8i 2 [m] One the other hand IP methods were successfully applied W k 2 dk×dk−1 8k 2 [K] to sparse classification problems (Bertsimas et al., 2017), R sparse regression (Bertsimas et al., 2019b) and to evaluate λk 2 R 8k 2 [K] trained neural networks (Fischetti & Jo, 2018). In (Icarte (3) et al., 2019) BDNNs with weights restricted to {−1; 1g are for given dimensions d0; : : : dK where d0 = n. We set trained by a hybrid method based on constraint program- dK =: L. All results in this work even hold for regression ming and mixed-integer programming. In (Khalil et al., problems, more precisely for loss functions 2018) the authors calculate optimal adversarial examples for ` : L × L ! BDNNs using a MIP formulation and integer propagation. r R R R Furthermore, robust optimization approaches were used to Pm i i where we minimize the empirical loss i=1 `r x ; z in- protect against adversarial attacks for other machine learn- stead. We use labels f0; 1g instead of the classical labels ing methods (Xu et al., 2009b;a; Bertsimas et al., 2019a). {−1; 1g due to ease of notation in our model. Nevertheless the same approach can be adapted to labels y~i 2 {−1; 1g 3. Discrete Neural Networks by adding the additional constraints In this work we study binary deep neural net- y~i = −1 + 2yi: works (BDNN), i.e. classical deep neural net- works with binary activation functions. As in Several variants of the model can be considered: the classical framework, for a given input vector x 2 n we study classification functions f of the form R • The output of the neural network is 0 or 1, defining f(x) = σK W K σK−1 W K−1 : : : ; σ1 W 1x ::: for the class the input is assigned to. As a loss function, weight matrices W k 2 dk×dk−1 and activation functions R we count the number of misclassified training samples. σk, which are applied component-wise. The dimension d k This variant can be modeled by choosing the last layer is called the width of the k-th layer . In contrast to the to have width d = 1 together with the loss function recent developments of the field we consider the activation K `(y; z) = jy − zj. functions to be binary, more precisely each function is of the form • The output of the neural network is a continuous vector z 2 RL. This case can be modeled by choosing the ac- ( K 0 if α < λk tivation function of the last layer σ as the identity and σk(α) = (1) 1 otherwise dK = L. Any classical loss functions maybe consider, 2 e.g. the squared loss `(x; z) = ky − zk2 . Note that a special case of this model is the autoencoder where in contrast to the continuous framework the input is for α 2 R where the parameters λk 2 R can be learned by our model simultaneously with the weight matrices which encoded into a vector with 0-1 entries. is normally not the case in the classical neural network • All results can easily be generalized to multiclass clas- approaches. Note that it is also possible to fix the values λk sification; see Section 3. in advance. In the following we use the notation [p] := f1; : : : ; pg for In the following lemma we show how to reformulate Prob- p 2 N. lem (3) as an integer program. Deep Neural Networks with Binary Activation Functions Lemma 1. Assume the euclidean norm of each data point since otherwise Constraint (6) would be violated. Note that i;1 in X is bounded by r, then Problem (3) is equivalent to the if uj = 1, then Constraint (5) is always satisfied since mixed-integer non-linear program all entries of W 1 are in [−1; 1] and all entries of xi are in 1 i m [−r; r] and therefore jW x j ≤ nr < M1. Similarly we X i;1 1 > i min ` yi; ui;K s:t: (4) can show that uj = 0 if and only if (wj ) x < λ1. Hence i;1 i i=1 u is the output of the first layer for data point x which 1 i i;1 is applied to W 2 in Constraints (7) and (8).
Recommended publications
  • Anastasios Kyrillidis
    Anastasios Kyrillidis CONTACT 3119 Duncan Hall INFORMATION 6100 Main Street, 77005 Tel: (+1) 713-348-4741 Houston, United States E-mail: [email protected] Website: akyrillidis.github.io RESEARCH Optimization for machine learning, convex and non-convex analysis and optimization, structured low dimensional INTERESTS models, large-scale computing, quantum computing. CADEMIC Rice University, Houston, USAA APPOINTMENTS Noah Harding Assistant Professor at Computer Science Dept. July 2018 - now University of Texas at Austin, Austin, USA Simons Foundation Postdoctoral Researcher November 2014 - August 2017 Member of WNCG PROFESSIONAL IBM T.J. Watson Research Center, New York (USA) APPOINTMENTS Goldstine PostDoctoral Fellow September 2017 - July 2018 EDUCATION Ecole´ Polytechnique F´ed´eralede Lausanne (EPFL), Lausanne, Switzerland Ph.D., School of Computer and Communication Sciences, September 2010 - October 2014. TEACHING AND Instructor SUPERVISING Rice University EXPERIENCE • COMP 414/514 — Optimization: Algorithms, complexity & approximations — Fall ‘19 • Students enrolled: 30 (28 G/2 UG) • Class evaluation — Overall quality: 1.54 (Rice mean: 1.76); Organization: 1.57 (Rice mean: 1.75); Chal- lenge: 1.5 (Rice 1.73) • Instructor evaluation — Effectiveness: 1.43 (Rice mean: 1.67); Presentation: 1.39 (Rice mean: 1.73); Knowledge: 1.29 (Rice mean: 1.56). — Fall ‘20 • Students enrolled: 30 (8 G/22 UG) • Class evaluation — Overall quality: 1.29 (Rice mean: 1.69); Organization: 1.28 (Rice mean: 1.69); Chal- lenge: 1.31 (Rice 1.68) • Instructor evaluation — Effectiveness: 1.21 (Rice mean: 1.58); Presentation: 1.17 (Rice mean: 1.63); Knowledge: 1.14 (Rice mean: 1.49). • COMP 545 — Advanced topics in optimization: From simple to complex ML systems — Spring ‘19 • Students enrolled: 10 • Class evaluation — Overall quality: 1.31 (Rice mean: 1.78); Organization: 1.23 (Rice mean: 1.79); Chal- lenge: 1.46 (Rice mean: 1.75).
    [Show full text]
  • University of California San Diego
    UNIVERSITY OF CALIFORNIA SAN DIEGO Sparse Recovery and Representation Learning A dissertation submitted in partial satisfaction of the requirements for the degree Doctor of Philosophy in Mathematics by Jingwen Liang Committee in charge: Professor Rayan Saab, Chair Professor Jelena Bradic Professor Massimo Franceschetti Professor Philip E. Gill Professor Tara Javidi 2020 Copyright Jingwen Liang, 2020 All rights reserved. The dissertation of Jingwen Liang is approved, and it is ac- ceptable in quality and form for publication on microfilm and electronically: Chair University of California San Diego 2020 iii DEDICATION To my loving family, friends and my advisor. iv TABLE OF CONTENTS Signature Page . iii Dedication . iv Table of Contents . .v List of Figures . vii Acknowledgements . viii Vita .............................................x Abstract of the Dissertation . xi Chapter 1 Introduction and Background . .1 1.1 Compressed Sensing and low-rank matrix recovery with prior infor- mations . .1 1.2 Learning Dictionary with Fast Transforms . .3 1.3 Deep Generative Model Using Representation Learning Techniques5 1.4 Contributions . .6 Chapter 2 Signal Recovery with Prior Information . .8 2.1 Introduction . .8 2.1.1 Compressive Sensing . .9 2.1.2 Low-rank matrix recovery . 10 2.1.3 Prior Information for Compressive Sensing and Low-rank matrix recovery . 12 2.1.4 Related Work . 13 2.1.5 Contributions . 21 2.1.6 Overview . 22 2.2 Low-rank Matrices Recovery . 22 2.2.1 Problem Setting and Notation . 22 2.2.2 Null Space Property of Low-rank Matrix Recovery . 23 2.3 Low-rank Matrix Recovery with Prior Information . 31 2.3.1 Support of low rank matrices .
    [Show full text]
  • This Booklet
    Applied Mathematics Department of Mathematical Sciences Research Projects 2018-2019 2 Forecasting solar and wind power outputs using deep learning approaches Research Team: Dr Bubacarr Bah In collaboration with University of Pretoria Highlights Identification of suitable locations of wind and solar farms in South Africa. Prediction of wind (solar) power from historical wind (solar) data, and other environmental factors, of South Africa using recurrent neural networks. Kalle Pihlajasaari / Wikimedia Commons / CC-BY-SA-3.0 Figure. Wind turbines at Darling, Western Cape. Applications Informed decision making by power producer in terms of integra- tion of renewable energies into existing electricity grids. Forecasting of energy prices. 3 Machine learning outcome prediction for coma patients Research Team: Dr Bubacarr Bah In collaboration with University of Cape Town Highlights Outcome prediction with Serial Neuron-Specific Enolase (SNSE) measurements from coma patients. The use of k-nearest neighbors (k-NN) for imputation of missing data shows promising results. Outcome prediction with EEG signals from coma patients. Using machine learning to determine the prognostic power of SNSE and EEG Figure. EEG and the determination of level of consciousness in coma patients. Applications Potential application in medical and health systems to improve diag- nosis (and treatment) of coma. 4 Data-driven river flow routing using deep learning Research Team: Dr Willie Brink and Prof Francois Smit MSc Student: Jaco Briers Highlights Predicting flow along the lower Orange River in South Africa, for improved release scheduling at the Vanderkloof Dam, using recur- rent neural networks as well as feedforward convolutional neural networks trained on historical time series records.
    [Show full text]
  • Tuesday July 9, 1.45-8.00 Pm
    Tuesday July 9, 1.45-8.00 pm 1.45-2.45 Combinatorial compressed sensing with expanders Bubacarr Bah Wegener Chair: Ana Gilbert Deep learning (invited session) Chair: Misha Belkin & Mahdi Soltanolkotabi 2:55-3:20 Reconciling modern machine learning practice and the classical bias-variance trade-off Mikhail Belkin, Daniel Hsu, Siyuan Ma & Soumik Mandal 3.20-3.45 Overparameterized Nonlinear Optimization with Applications to Neural Nets Samet Oymak 3.45-4.10 General Bounds for 1-Layer ReLU approximation Bolton R. Bailey & Matus Telgarsky 4.10-4.20 Short Break A9 Amphi 1 4.20-4.45 Generalization in deep nets: an empirical perspective Tom Goldstein 4.45-5.10 Neuron birth-death dynamics accelerates gradient descent and converges asymptotically Joan Bruna 5.45-8.00 Poster simposio Domaine du Haut-Carr´e Tuesday July 9, 1.45-8.00 pm 1.45-2.45 Combinatorial compressed sensing with expanders Bubacarr Bah Wegener Chair: Anna Gilbert Frame Theory Chair: Ole Christensen 2.55-3.20 Banach frames and atomic decompositions in the space of bounded operators on Hilbert spaces Peter Balazs 3.20-3.45 Frames by Iterations in Shift-invariant Spaces Alejandra Aguilera, Carlos Cabrelli, Diana Carbajal & Victoria Paternostro 3.45-4.10 Frame representations via suborbits of bounded operators Ole Christensen & Marzieh Hasannasabjaldehbakhani 4.10-4.20 Short Break 4.20-4.45 Sum-of-Squares Optimization and the Sparsity Structure A29 Amphi 2 of Equiangular Tight Frames Dmitriy Kunisky & Afonso Bandeira 4.45-5.10 Frame Potentials and Orthogonal Vectors Josiah Park 5.10-5.35
    [Show full text]
  • DIFFUSION MAPS 43 5.1 Clustering
    Di®usion Maps: Analysis and Applications Bubacarr Bah Wolfson College University of Oxford A dissertation submitted for the degree of MSc in Mathematical Modelling and Scienti¯c Computing September 2008 This thesis is dedicated to my family, my friends and my country for the unflinching moral support and encouragement. Acknowledgements I wish to express my deepest gratitude to my supervisor Dr Radek Erban for his constant support and valuable supervision. I consider myself very lucky to be supervised by a person of such a wonderful personality and enthusiasm. Special thanks to Prof I. Kevrekidis for showing interest in my work and for motivating me to do Section 4.3 and also for sending me some material on PCA. I am grateful to the Commonwealth Scholarship Council for funding this MSc and the Scholarship Advisory Board (Gambia) for nominating me for the Commonwealth Scholarship award. Special thanks to the President Alh. Yahya A.J.J. Jammeh, the SOS and the Permanent Secretary of the DOSPSE for being very instrumental in my nomination for this award. My gratitude also goes to my employer and former university, University of The Gambia, for giving me study leave. I appreciate the understanding and support given to me by my wife during these trying days of my studies and research. Contents 1 INTRODUCTION 2 1.1 The dimensionality reduction problem . 2 1.1.1 Algorithm . 4 1.2 Illustrative Examples . 5 1.2.1 A linear example . 5 1.2.2 A nonlinear example . 8 1.3 Discussion . 9 2 THE CHOICE OF WEIGHTS 13 2.1 Optimal Embedding .
    [Show full text]
  • Structured Msc in Mathematical Sciences - January 2021 Intake
    ` Structured MSc in Mathematical Sciences - January 2021 intake Courses overview and abstracts Dates Course Lecturer Affiliation 22-26 February Introduction to Computing and 2021 LaTeX Jan Groenewald AIMS South Africa Paul Taylor National Institutes of Health Python Programming Martha Kamkuemah Stellenbosch University The Geometry of Maxwell’s Tevian Dray Equations Corinne Manogue Oregon State University 1-19 March Ronnie Becker Financial Mathematics Hans-Georg Zimmerman AIMS South Africa 2021 Bubacarr Bah AIMS South Africa Pete Grindrod Oxford University Data Science Hans-Georg Zimmerman Fraunhofer IBMT African Centre for Advanced Experimental Mathematics with Yae Gaba Studies Sage Evans Doe Ocansey Johannes Keppler University Introduction to Random Systems, Information Theory, and related topics Stéphane Ouvry Université Paris Saclay 22 March to 9 Networks Phil Knight University of Strathclyde April 2021 Introduction to Multiscale Models and their Analysis Jacek Banasiak University of Pretoria Analytical Techniques in Mathematical Biology Lyndsay Kerr Edinburgh University 12- 30 April Wolfram Decker and 2021 Computational Algebra Gerhard Pfister TU Kaiserslautern Grae Worster University of Cambridge Fluid Dynamics Richard Katz University of Oxford 3-7 May Reading week 10-28 May Probability and Statistics Daniel Nickelsen AIMS South Africa 2021 Biophysics at the Microscale Daisuke Takagi University of Hawaii at Manoa Symmetry Analysis of Masood Khalique North-West University Differential Equations Abdul Kara University of the Witwatersrand
    [Show full text]
  • Bubacarr Bah CV
    Curriculum Vitae of Bubacarr Bah Address African Institute for Mathematical Sciences (AIMS) South Africa 6 Melrose Road Muizenberg 7945 South Africa Recent Employment History Oct 2016 { present German Research Chair (Senior Researcher) in Mathematics of Data Science, AIMS South Africa, & Senior Lecturer (Asst. Prof.), Stellenbosch University Sep 2014 { Sep 2016 Postdoctoral Fellow, Mathematics Department, & Institute for Computational Engineering & Sciences, University of Texas in Austin Sep 2012 { Aug 2014 Research Scientist (postdoc), Laboratory for Information and Inference Systems, Ecole Polytechnique Federale de Lausanne (EPFL) Education Sep 2008 { Aug 2012 PhD in Applied and Computational Mathematics, University of Edinburgh, UK Sep 2007 { Sep 2008 MSc in Mathematical Modelling & Scientific Computing, University of Oxford, UK Jan 2000 { Aug 2004 BSc in Mathematics and Physics, (summa cum laude), University of The Gambia Honours 2020 Alexander von Humboldt Award of German Research Chair Mathematics at AIMS South Africa 2016 Alexander von Humboldt Award of German Research Chair Mathematics at AIMS South Africa 2015 NSF-funded SIAM ICIAM15 Travel Award 2013 Travel Grant for Matheon Workshop 2013 on Compressed Sensing & its Applications 2011 SIAM Certificate of Recognition 2010 SIAM Best Student Paper Prize 2008 Edinburgh University Principal's Scholarship for PhD studies 2007 Commonwealth Scholarship for MSc studies 2005 Overall Best Student (valedictorian) Award 2005 Best Student of the Faculty of Science & Agriculture Award 2005 Vice Chancellor's Award Journal Publications 1. Improved restricted isometry constant bounds for Gaussian matrices; SIAM Journal on Matrix Analysis, Vol. 31(5) (2010) 2882{2898 (with J. Tanner). 2. Bounds of restricted isometry constants in extreme asymptotics: formulae for Gaussian matrices; Linear Algebra and its Applications, Vol.
    [Show full text]
  • Full Report (Pdf)
    Recognising African Excellence in Machine Learning The Inaugural Kambule and Maathai Awards August 2018 The Deep Learning Indaba 7/31/2018 Awards-Report-2018-markup - Google Docs Recognising African Excellence in Machine Learning: The Inaugural Kambule and Maathai Awards 2018 Issued August 2018. 2018. The Deep Learning Indaba. Typeset in Maven Pro. Corresponding Authors: Daniela Massiceti and Shakir Mohamed Contact: [email protected] This report can be viewed online at deeplearningindaba.com/reports Indaba Abantu Shakir Mohamed, DeepMind Ulrich Paquet, DeepMind Vukosi Marivate, University of Pretoria Willie Brink, Stellenbosch University Nyalleng Moorosi, Google AI Ghana Stephan Gouws, DeepMind Benjamin Rosman, Council for Scientific and Industrial research (CSIR), and University of the Witwatersrand Richard Klein, University of the Witwatersrand Avishkar Bhoopchand, DeepMind Kathleen Siminyu, Africa’s Talking Muthoni Wanyoike, InstaDeep, Kenya. Daniela Massiceti, University of Oxford Herman Kamper, Stellenbosch University 1 https://docs.google.com/document/d/1cD_XexDztnNAAKBa91d5l1CT16NGF5BUynL98RoaUMM/edit#heading=h.j5ghgepwl09f 2/17 7/31/2018 Awards-Report-2018-markup - Google Docs A Personal Message Reflecting on the outcomes of each of our programmes gives us the welcome and needed opportunity to examine our work with fresh eyes, to review how our programmes work together, and to think about what our programmes mean to us personally. For us, the creation of the Kambule and Maathai awards have been an instrument with which
    [Show full text]
  • AIMS Annual Report TEXT.Indd
    ANNUAL REPORT 2019 ABOUT US The African Institute for Mathematical Sciences (AIMS) is a pan-African network of centres of excellence for postgraduate education, research and public engagement in mathematical sciences. Its mission is to enable Africa’s brightest students to flourish as independent thinkers, problem solvers and innovators capable of driving Africa’s future scientific, educational and economic self-sufficiency. AIMS was founded in Cape Town, South Africa, in 2003. Since then AIMS centres have opened in Senegal (2011), Ghana (2012), Cameroon (2013) and Rwanda (2016). The pan-African network of AIMS centres is coordinated by the AIMS Next Einstein Initiative (AIMS-NEI).* This is the annual report of AIMS South Africa for the period 1 August 2018 to 31 July 2019. It includes an overview of all activities of AIMS South Africa and its associated projects, as well as the financial statements for the 2018 calendar year. Since AIMS South Africa opened in 2003, 812 students, of which 34% are women, from 38 different African countries have graduated from its core academic programme. AIMS South Africa has local association with the Universities of Cape Town (UCT), Stellenbosch (SU) and the Western Cape (UWC) and international association with the Universities of Cambridge, Oxford and Paris-Sud. AIMS South Africa offers: • An intensive one-year structured Master’s in Mathematical Sciences with intakes in August and January. • Specialised courses as part of regular postgraduate programmes at South African universities. • A well-established research centre which hosts regular workshops and conferences. • Professional development programmes for teachers. • Public engagement activities. * AIMS Next Einstein Initiative, Kigali, Rwanda, Email: [email protected].
    [Show full text]
  • Restricted Isometry Constants in Compressed Sensing
    Restricted Isometry Constants in Compressed Sensing Bubacarr Bah Doctor of Philosophy University of Edinburgh 2012 Declaration I declare that this thesis was composed by myself and that the work contained therein is my own, except where explicitly stated otherwise in the text. (Bubacarr Bah) 3 To my grandmother, Mama Juma Sowe. 4 Acknowledgement I wish to express my deepest gratitude to my supervisor, Professor Jared Tanner, for his guid- ance and support during the course of my PhD. I would follow this with special thanks to my second supervisor, Dr. Coralia Cartis, for her regular monitoring of my progress and her encouragement. Further special thanks to my sponsor the University of Edinburgh through the MTEM Scholarships created in 2007 by the Principal, Professor Sir Timothy O’Shea. My sincere gratitude goes to Professor Endre Suli of Oxford University who informed me of, and recommended me for, this PhD opening with Professor Jared Tanner and also to my MSc dissertation supervisor, Dr. Radek Erban of Oxford University, for recommending me for this PhD programme. Many thanks to both the admin and support staff of the School of Mathematics, University of Edinburgh, that contributed to making my PhD programme proceed smoothly. Similar thanks go to members of the Mathematics Teaching Organisation of the school. I would like to thank the following people for contributing to my research either by responding to a question I posed, discussing an issue pertaining to my research or giving me feedback on my research: Dennis Amelunxen, Jeff Blanchard, Burak Burke, Prof. David Donoho, Martin Lotz, Andrew Thompson, Ke Wei, Pavel Zhlobich.
    [Show full text]
  • Artificial Intelligence and Data Science for Society Special Track
    Artificial Intelligence and Data Science for Society Special Track International Conference on Intelligent Systems Design and Applications (ISDA 2019), Dec 2019 Session Information The use of Artificial Intelligence (AI) and Data Science (DS) approaches to tackle social challenges has been receiving attention over the last few years. This entails using Machine Learning, Statistics, Mathematics, and other related tools to tackle challenges that lead to Social Impact. This is exemplified by organisations that work with government, NGOs, private entities with the sole purpose of affecting some process that is a public good or at least has externalities that are public good. Overall this encompasses more than just using Data Science in social good settings; it involves understanding society, factors that contribute to or impede progress, building skill for public servants, creating receptive environments, sharing experiences within the sector with other and more. Covering all these factors is in the pursuit of having lasting impact. There are different paradigms being used to tackle problems in different areas and there is value in learning the insights gained in all these areas. We aim to provide a session where practitioners, industrial representatives and researchers in this field can share their experiences, present current research results and discuss the modelling or analysis in various problem areas. Goal 1: Research papers This session invites full research papers covering AI/DS Research, Practice & Capacity building in: ● Food, water, and utilities ● Public safety and disaster relief ● Inclusiveness ● Ethical issues, fairness in Machine Learning or Artificial Intelligence ● Efficiency and sustainability ● Government transparency ● Economic, social, and personal development ● Transportation ● Energy ● Smart cities ● Education ● Social services, unemployment and homelessness ● Healthcare ● Policymaking ● Environment ● And more….
    [Show full text]
  • Sparse Matrices for Weighted Sparse Recovery
    Sparse matrices for weighted sparse recovery Bubacarr Bah ∗ August 13, 2018 Abstract We derived the first sparse recovery guarantees for weighted ℓ1 minimization with sparse random matrices and the class of weighted sparse signals, using a weighted versions of the null space property to derive these guarantees. These sparse matrices from expender graphs can be applied very fast and have other better computational complexities than their dense counterparts. In addition we show that, using such sparse matrices, weighted sparse recovery with weighted ℓ1 minimization leads to sample complexities that are linear in the weighted sparsity of the signal and these sampling rates can be smaller than those of standard sparse recovery. Moreover, these results reduce to known results in standard sparse recovery and sparse recovery with prior information and the results are supported by numerical experiments. Keywords: Compressed sensing, sparse recovery, weighted ℓ1-minimization, weighted sparsity, sparse matrix, expander graph, sample complexity, nullspace property 1 Introduction Weighted sparse recovery was introduced by [26] for application to function interpolation but there has been a steady growth in interest in the area, [1, 2, 5, 7, 15, 25], since then. In particular [5] showed its equivalence to standard sparse recovery using weighted ℓ1 minimization. The setting considered here is as follows: for a signal of interest x RN ∈ which is k-sparse and a given measurement matrix A Rn×N , we perform measurements arXiv:1604.01067v2 [math.NA] 9 May 2016 ∈ y = Ax + e for a noise vector e with e η (this being the main difference with the k k1 ≤ previously mentioned works).
    [Show full text]