Programming the K-Means Clustering Algorithm in SQL

Total Page:16

File Type:pdf, Size:1020Kb

Programming the K-Means Clustering Algorithm in SQL Programming the K-means Clustering Algorithm in SQL Carlos Ordonez Teradata, NCR San Diego, CA, USA ABSTRACT by the database application programmer. Moreover, nowa- Using SQL has not been considered an efficient and feasible days SQL is the standard way to interact with a relational way to implement data mining algorithms. Although this is DBMS. So can SQL, as it exists today, be used to get an effi- true for many data mining, machine learning and statistical cient implementation of a clustering algorithm? This article algorithms, this work shows it is feasible to get an efficient shows the answer is yes for the popular K-means algorithm. SQL implementation of the well-known K-means clustering It is worth mentioning programming data mining algorithms algorithm that can work on top of a relational DBMS. The in SQL has not received too much attention in the database article emphasizes both correctness and performance. From literature. This is because SQL, being a database language, a correctness point of view the article explains how to com- is constrained to work with tables and columns, and then pute Euclidean distance, nearest-cluster queries and updat- it does not provide the flexibility and speed of a high level ing clustering results in SQL. From a performance point of programming language like C++ or Java. Summarizing, this view it is explained how to cluster large data sets defining article presents an efficient SQL implementation of the K- and indexing tables to store and retrieve intermediate and means algorithm that can work on top of a relational DBMS final results, optimizing and avoiding joins, optimizing and to cluster large data sets. simplifying clustering aggregations, and taking advantage of The article is organized as follows. Section 2 introduces sufficient statistics. Experiments evaluate scalability with definitions and an overview of the K-means algorithm. Sec- synthetic data sets varying size and dimensionality. The tion 3 introduces several alternatives and optimizations to proposed K-means implementation can cluster large data implement the K-means algorithm in SQL. Section 4 con- sets and exhibits linear scalability. tains experiments to evaluate performance with synthetic data sets. Section 5 discusses related work. Section 6 pro- vides general conclusions and directions for future work. Categories and Subject Descriptors H.2.8 [Database Management]: Database Applications- Data Mining 2. DEFINITIONS The basic input for K-means [2, 6] is a data set Y contain- ing n points in d dimensions, Y = fy1; y2; : : : ; yng, and k, General Terms the desired number of clusters. The output are three matri- Algorithms, Languages ces W; C; R, containing k weights, k means and k variances respectively corresponding to each cluster and a partition of Keywords Y into k subsets. Matrices C and R are d×k and W is k×1. Throughout this work three subscripts are used to index ma- Clustering, SQL, relational DBMS, integration trices: i = 1; : : : ; n; j = 1; : : : ; k; l = 1; : : : ; d. To refer to one column of C or R we use the j subscript (e.g. Cj ; Rj ); Cj 1. INTRODUCTION can be understood as a d-dimensional vector containing the There exist many efficient clustering algorithms in the centroid of the jth cluster having the respective squared ra- data mining literature. Most of them follow the approach diuses per dimension given by Rj . For transposition we will proposed in [14], minimizing disk access and doing most of use the T superscript. For instance Cj refers to the jth cen- T the work in main memory. Unfortunately, many of those al- troid in column form and Cj is the jth centroid in row form. gorithms are hard to implement inside a real DBMS where Let X1; X2; : : : ; Xk be the k subsets of Y induced by clusters 0 the programmer needs to worry about storage management, s.t. Xj \ Xj0 = ;; j 6= j . K-means uses Euclidean distance concurrent access, memory leaks, fault tolerance, security to find the nearest centroid to each input point, defined as T d 2 and so on. On the other hand, SQL has been growing over d(yi; Cj ) = (yi − Cj ) (yi − Cj ) = Pl=1(yli − Clj ) . the years to become a fairly comprehensive and complex K-means can be described at a high level as follows. Cen- query language where the aspects mentioned above are au- troids Cj are generally initialized with k points randomly tomatically handled for the most part or they can be tuned selected from Y for an approximation when there is an idea about potential clusters. The algorithm iterates executing c ACM, 2004 . This is the author's version of the work. It is posted here the E and the M steps starting from some initial solution un- by permission of ACM for your personal use. Not for redistribution. The definitive version was published in KDD Conference 2004, til cluster centroids become stable. The E step determines http://doi.acm.org/10.1145/1014052.1016921 the nearest cluster for each point and adds the point to it. That is, the E step determines cluster membership and par- to implement K-means. In general we omit Data Defini- titions Y into k subsets. The M step updates all centroids Cj tion Language (DDL) statements and deletion statements to by averaging points belonging to the same cluster. Then the make exposition more concise. Thus most of the SQL code k cluster weights Wj and the k diagonal variance matrices presented involves Data Manipulation Language (DML) state- Rj are updated based on the new Cj centroids. The quality ments. The columns making up the primary key of a table of a clustering solution is measured by the average quantiza- are underlined. Tables are indexed on their primary key for tion error q(C) (also known as squared assignment distance efficient join access. Subscripts i; j; l (see Section 2) are de- [6]). The goal of K-means is minimizing q(C), defined as fined as integer columns and the d numerical dimensions of n q(C) = Pi=1 d(yi; Cj )=n. where yi 2 Xj . This quantity points of Y , distances, and matrix entries of W; C; R are de- measures the average squared distance from each point to fined as FLOAT columns in SQL. Before each INSERT state- the cluster where it was assigned according to the parti- ment it is assumed there is a "DELETE FROM ... ALL;" tion into k subsets. K-means stops when q(C) changes by a statement that leaves the table empty before insertion. marginal fraction () in consecutive iterations. K-means is As introduced in Section 2 the input data set has d di- theoretically guaranteed to converge decreasing q(C) at each mensions. In database terms this means there exists a table iteration [6], but it is common to set a maximum number of Y with several numerical columns out of which d columns iterations to avoid long runs. are picked for clustering analysis. In practice the input ta- ble may have many more than d columns but to simplify exposition we will assume its definition is Y (Y1; Y2; ::; Yd). 3. IMPLEMENTING K-MEANS IN SQL So the SQL implementation needs to build its own reduced This section presents our main contributions. We explain version projecting the desired d columns. This motivates how to implement K-means in a relational DBMS by auto- defining the following "horizontal" table with d + 1 columns: matically generating SQL code given an input table Y with Y H(i; Y1; Y2; :::; Yd) having i as primary key. The first col- d selected numerical columns and k, the desired number of umn is the i subscript for each point and then Y H has the clusters as input as defined in Section 2. The SQL code list of d dimensions. This table saves Input/Output access generator dynamically creates SQL statements monitoring (I/O) since it may have fewer columns than Y and it is the difference of quality of the solution in consecutive itera- scanned several times during the algorithm progress. In tions to stop. There are two main schemes presented in here. general it is not guaranteed i (point id) exists because the The first one presents a simple implementation of K-means primary key of Y may consist of more than one column, or it explaining how to program each computation in SQL. We re- may not exist at all because Y is the result of some aggrega- fer to this scheme as the Standard K-means implementation. tions. In an implementation in an imperative programming The second scheme presents a more complex K-means im- language like C++ or Java the point identifier is immaterial plementation incorporating several optimizations that dra- since Y is accessed sequentially, but in a relational database matically improve performance. We call this scheme the it is essential. Therefore it is necessary to automatically Optimized K-means implementation. create i guaranteeing a unique identifier for each point yi. There are important assumptions behind our proposal The following statement computes a cumulative sum on one from a performance point of view. Two tables with n rows scan over Y to get i 2 f1 : : : ng and projects the desired d having the same primary key can be joined in time O(n) columns. using a hash-based join. So if a different DBMS does not provide hash-based indexing, joining tables may take longer INSERT INTO YH than O(n). However, the proposed scheme should still pro- SELECT sum(1) over(rows unbounded preceding) AS i vide the most efficient implementation even in such cases.
Recommended publications
  • An Array-Oriented Language with Static Rank Polymorphism
    An array-oriented language with static rank polymorphism Justin Slepak, Olin Shivers, and Panagiotis Manolios Northeastern University fjrslepak,shivers,[email protected] Abstract. The array-computational model pioneered by Iverson's lan- guages APL and J offers a simple and expressive solution to the \von Neumann bottleneck." It includes a form of rank, or dimensional, poly- morphism, which renders much of a program's control structure im- plicit by lifting base operators to higher-dimensional array structures. We present the first formal semantics for this model, along with the first static type system that captures the full power of the core language. The formal dynamic semantics of our core language, Remora, illuminates several of the murkier corners of the model. This allows us to resolve some of the model's ad hoc elements in more general, regular ways. Among these, we can generalise the model from SIMD to MIMD computations, by extending the semantics to permit functions to be lifted to higher- dimensional arrays in the same way as their arguments. Our static semantics, a dependent type system of carefully restricted power, is capable of describing array computations whose dimensions cannot be determined statically. The type-checking problem is decidable and the type system is accompanied by the usual soundness theorems. Our type system's principal contribution is that it serves to extract the implicit control structure that provides so much of the language's expres- sive power, making this structure explicitly apparent at compile time. 1 The Promise of Rank Polymorphism Behind every interesting programming language is an interesting model of com- putation.
    [Show full text]
  • Compendium of Technical White Papers
    COMPENDIUM OF TECHNICAL WHITE PAPERS Compendium of Technical White Papers from Kx Technical Whitepaper Contents Machine Learning 1. Machine Learning in kdb+: kNN classification and pattern recognition with q ................................ 2 2. An Introduction to Neural Networks with kdb+ .......................................................................... 16 Development Insight 3. Compression in kdb+ ................................................................................................................. 36 4. Kdb+ and Websockets ............................................................................................................... 52 5. C API for kdb+ ............................................................................................................................ 76 6. Efficient Use of Adverbs ........................................................................................................... 112 Optimization Techniques 7. Multi-threading in kdb+: Performance Optimizations and Use Cases ......................................... 134 8. Kdb+ tick Profiling for Throughput Optimization ....................................................................... 152 9. Columnar Database and Query Optimization ............................................................................ 166 Solutions 10. Multi-Partitioned kdb+ Databases: An Equity Options Case Study ............................................. 192 11. Surveillance Technologies to Effectively Monitor Algo and High Frequency Trading ..................
    [Show full text]
  • Application and Interpretation
    Programming Languages: Application and Interpretation Shriram Krishnamurthi Brown University Copyright c 2003, Shriram Krishnamurthi This work is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 United States License. If you create a derivative work, please include the version information below in your attribution. This book is available free-of-cost from the author’s Web site. This version was generated on 2007-04-26. ii Preface The book is the textbook for the programming languages course at Brown University, which is taken pri- marily by third and fourth year undergraduates and beginning graduate (both MS and PhD) students. It seems very accessible to smart second year students too, and indeed those are some of my most successful students. The book has been used at over a dozen other universities as a primary or secondary text. The book’s material is worth one undergraduate course worth of credit. This book is the fruit of a vision for teaching programming languages by integrating the “two cultures” that have evolved in its pedagogy. One culture is based on interpreters, while the other emphasizes a survey of languages. Each approach has significant advantages but also huge drawbacks. The interpreter method writes programs to learn concepts, and has its heart the fundamental belief that by teaching the computer to execute a concept we more thoroughly learn it ourselves. While this reasoning is internally consistent, it fails to recognize that understanding definitions does not imply we understand consequences of those definitions. For instance, the difference between strict and lazy evaluation, or between static and dynamic scope, is only a few lines of interpreter code, but the consequences of these choices is enormous.
    [Show full text]
  • Handout 16: J Dictionary
    J Dictionary Roger K.W. Hui Kenneth E. Iverson Copyright © 1991-2002 Jsoftware Inc. All Rights Reserved. Last updated: 2002-09-10 www.jsoftware.com . Table of Contents 1 Introduction 2 Mnemonics 3 Ambivalence 4 Verbs and Adverbs 5 Punctuation 6 Forks 7 Programs 8 Bond Conjunction 9 Atop Conjunction 10 Vocabulary 11 Housekeeping 12 Power and Inverse 13 Reading and Writing 14 Format 15 Partitions 16 Defined Adverbs 17 Word Formation 18 Names and Displays 19 Explicit Definition 20 Tacit Equivalents 21 Rank 22 Gerund and Agenda 23 Recursion 24 Iteration 25 Trains 26 Permutations 27 Linear Functions 28 Obverse and Under 29 Identity Functions and Neutrals 30 Secondaries 31 Sample Topics 32 Spelling 33 Alphabet and Numbers 34 Grammar 35 Function Tables 36 Bordering a Table 37 Tables (Letter Frequency) 38 Tables 39 Classification 40 Disjoint Classification (Graphs) 41 Classification I 42 Classification II 43 Sorting 44 Compositions I 45 Compositions II 46 Junctions 47 Partitions I 48 Partitions II 49 Geometry 50 Symbolic Functions 51 Directed Graphs 52 Closure 53 Distance 54 Polynomials 55 Polynomials (Continued) 56 Polynomials in Terms of Roots 57 Polynomial Roots I 58 Polynomial Roots II 59 Polynomials: Stopes 60 Dictionary 61 I. Alphabet and Words 62 II. Grammar 63 A. Nouns 64 B. Verbs 65 C. Adverbs and Conjunctions 66 D. Comparatives 67 E. Parsing and Execution 68 F. Trains 69 G. Extended and Rational Arithmeti 70 H. Frets and Scripts 71 I. Locatives 72 J. Errors and Suspensions 73 III. Definitions 74 Vocabulary 75 = Self-Classify - Equal 76 =. Is (Local) 77 < Box - Less Than 78 <.
    [Show full text]
  • A Guide to PL/M Programming
    INTEL CORP. 3065 Bowers Avenue, Santa Clara, California 95051 • (408) 246-7501 mcs=a A Guide to PL/M programming PL/M is a new high level programming language designed specifically for Intel's 8 bit microcomputers. The new language gives the microcomputer systems program­ mer the same advantages of high level language programming currently available in the mini and large computer fields. Designed to meet the special needs of systems programming, the new language will drastically cut microcomputer programming time and costs without sacrifice of program efficiency. in addition, training, docu­ mentation, program maintenance and the inclusion of library subroutines will all be made correspondingly easier. PL/M is well suited for all microcomputer program­ ming applications, retaining the control and efficiency of assembly language, while greatly reducing programming effort The PL/M compiler is written in ANSI stand­ ard Fortran I V and thus will execute on most machines without alteration. SEPTEMBER 1973 REV.1 © Intel Corporation 1973 TABLE OF CONTENTS Section Page I. INTRODUCTION TO PL/M 1 II. A TUTORIAL APPROACH TO PL/M ............................ 2 1. The Organization of a PL/M Program ........................ 2 2. Basic Constituents of a PL/M Program ....................... 4 2.1. PL/M Character Set ................................ 4 2.2. Identifiers and Reserved Words ....................... 5 2.3. Comments . 7 3. PL/M Statement Organization . 7 4. PL/M Data Elements . 9 4.1. Variable Declarations ............................... 9 4.2. Byte and Double Byte Constants. .. 10 5. Well-Formed Expressions and Assignments . 11 6. A Simple Example. 15 7. DO-Groups. 16 7.1. The DO-WHILE Group ...........
    [Show full text]
  • Visual Programming Language for Tacit Subset of J Programming Language
    Visual Programming Language for Tacit Subset of J Programming Language Nouman Tariq Dissertation 2013 Erasmus Mundus MSc in Dependable Software Systems Department of Computer Science National University of Ireland, Maynooth Co. Kildare, Ireland A dissertation submitted in partial fulfilment of the requirements for the Erasmus Mundus MSc Dependable Software Systems Head of Department : Dr Adam Winstanley Supervisor : Professor Ronan Reilly June 30, 2013 Declaration I hereby certify that this material, which I now submit for assessment on the program of study leading to the award of Master of Science in Dependable Software Systems, is entirely my own work and has not been taken from the work of others save and to the extent that such work has been cited and acknowledged within the text of my work. Signed:___________________________ Date:___________________________ Abstract Visual programming is the idea of using graphical icons to create programs. I take a look at available solutions and challenges facing visual languages. Keeping these challenges in mind, I measure the suitability of Blockly and highlight the advantages of using Blockly for creating a visual programming environment for the J programming language. Blockly is an open source general purpose visual programming language designed by Google which provides a wide range of features and is meant to be customized to the user’s needs. I also discuss features of the J programming language that make it suitable for use in visual programming language. The result is a visual programming environment for the tacit subset of the J programming language. Table of Contents Introduction ............................................................................................................................................ 7 Problem Statement ............................................................................................................................. 7 Motivation..........................................................................................................................................
    [Show full text]
  • Frederick Phillips Brooks, Jr
    Frederick Phillips Brooks, Jr. Biography Frederick P. Brooks, Jr., was born in 1931 in Durham NC, and grew up in Greenville NC. He received an A.B. summa cum laude in physics from Duke and a Ph.D. in computer science from Harvard, under Howard Aiken, the inventor of the early Harvard computers. He joined IBM, working in Poughkeepsie and Yorktown, NY, 1956-1965. He was an architect of the Stretch and Harvest computers and then was the project manager for the development of IBM's System/360 family of computers and then of the Operating System/360 software. For this work he received a National Medal of Technology jointly with Bob O. Evans and Erich Bloch Brooks and Dura Sweeney in 1957 patented an interrupt system for the IBM Stretch computer that introduced most features of today's interrupt systems. He coined the term computer architecture. His System/360 team first achieved strict compatibility, upward and downward, in a computer family. His early concern for word processing led to his selection of the 8-bit byte and the lowercase alphabet for the System/360, engineering of many new 8-bit input/output devices, and introduction of a character-string datatype in the PL/I programming language. In 1964 he founded the Computer Science Department at the University of North Carolina at Chapel Hill and chaired it for 20 years. Currently, he is Kenan Professor of Computer Science. His principal research is in real-time, three-dimensional, computer graphics—“virtual reality.” His research has helped biochemists solve the structure of complex molecules and enabled architects to “walk through” structures still being designed.
    [Show full text]
  • Symantec™ Endpoint Protection 14.X for Linux Client Guide Contents
    Symantec™ Endpoint Protection 14.x for Linux Client Guide Contents Chapter 1 Protecting Linux computers with Symantec Endpoint Protection ....................................................... 3 About Symantec Endpoint Protection for Linux ..................................... 3 Symantec Endpoint Protection client for Linux system requirements ........................................................................... 4 Getting started on the Linux client ...................................................... 5 About the Linux client graphical user interface ...................................... 7 What happens when a virus is detected ........................................ 7 Importing client-server communication settings into the Linux client ..................................................................................... 8 Uninstalling the Symantec Endpoint Protection client for Linux ................ 9 Appendix A Symantec Endpoint Protection client for Linux command line reference .............................................. 11 rtvscand ..................................................................................... 12 sav ............................................................................................ 15 savtray ....................................................................................... 22 smcd ......................................................................................... 24 symcfg ....................................................................................... 27 symcfgd ....................................................................................
    [Show full text]
  • KDB+ Quick Guide
    KKDDBB++ -- QQUUIICCKK GGUUIIDDEE http://www.tutorialspoint.com/kdbplus/kdbplus_quick_guide.htm Copyright © tutorialspoint.com KKDDBB++ -- OOVVEERRVVIIEEWW This is a complete quide to kdb+ from kx systems, aimed primarily at those learning independently. kdb+, introduced in 2003, is the new generation of the kdb database which is designed to capture, analyze, compare, and store data. A kdb+ system contains the following two components − KDB+ − the database kdatabaseplus Q − the programming language for working with kdb+ Both kdb+ and q are written in k programming language (same as q but less readable). Background Kdb+/q originated as an obscure academic language but over the years, it has gradually improved its user friendliness. APL 1964, AProgrammingLanguage A+ 1988, modifiedAPLbyArthurWhitney K 1993, crispversionofA + , developedbyA. Whitney Kdb 1998, in − memorycolumn − baseddb Kdb+/q 2003, q language – more readable version of k Why and Where to Use KDB+ Why? − If you need a single solution for real-time data with analytics, then you should consider kdb+. Kdb+ stores database as ordinary native files, so it does not have any special needs regarding hardware and storage architecture. It is worth pointing out that the database is just a set of files, so your administrative work won’t be difficult. Where to use KDB+? − It’s easy to count which investment banks are NOT using kdb+ as most of them are using currently or planning to switch from conventional databases to kdb+. As the volume of data is increasing day by day, we need a system that can handle huge volumes of data. KDB+ fulfills this requirement. KDB+ not only stores an enormous amount of data but also analyzes it in real time.
    [Show full text]
  • Software Conceptual Integrity: Deconstruction, Then Reconstruction
    Software Conceptual Integrity: Deconst, Then Reconst Iaakov Exman Software Conceptual Integrity: Deconstruction, Then Reconstruction Iaakov Exman 1 Software Engineering Department The Jerusalem College of Engineering- Azrieli Jerusalem, Israel [email protected] Computer Classification System Software and its engineering → Software creation and management → Designing software {CCS 2012} Abstract Conceptual Integrity is the most important consideration for software system design, as stated by Frederick Brooks. Brooks also suggested that Conceptual Integrity can be attained by means of design principles, such as Propriety, and Orthogonality. However, Brooks’ principles have not been formalized, posing obstacles to their application in practice, and to a deeper comprehension of Conceptual Integrity. This paper has three goals: first, to achieve deeper comprehension of Conceptual Integrity by deconstructing it into two phases, viz. Conceptualization and Modularization , iteratively applied during software system design; second, to show that the algebraic Linear Software Models already provide the hitherto lacking formalization of Brooks’ design principles, which surprisingly belong mainly to the Modularization phase; third, to reconstruct Conceptualization and Modularization, preserving the desirable tension between: a- phases’ separation , each with its own specific formal manipulation techniques; b- precise transition between these phases, consisting of explicit mutual relationships. The tension stems from the Modularity Matrix linking two
    [Show full text]
  • Krust: a Formal Executable Semantics of Rust
    KRust: A Formal Executable Semantics of Rust Feng Wang∗, Fu Song∗, Min Zhang†, Xiaoran Zhu† and Jun Zhang∗ ∗School of Information Science and Technology, ShanghaiTech University, Shanghai, China †Shanghai Key Laboratory of Trustworthy Computing, East China Normal University, Shanghai, China Abstract—Rust is a new and promising high-level system object is a variable binding. When a variable goes out of its programming language. It provides both memory safety and scope, Rust will free the bound resources. The ownership of thread safety through its novel mechanisms such as ownership, an object can be transferred, after which the object cannot moves and borrows. Ownership system ensures that at any point there is only one owner of any given resource. The ownership of be accessed via the outdated owner. This ensures that there a resource can be moved or borrowed according to the lifetimes. is exactly one binding to any object. Instead of transferring The ownership system establishes a clear lifetime for each value ownership, Rust provides borrows which allows an object to and hence does not necessarily need garbage collection. These be shared by multiple references. There are two kinds of novel features bring Rust high performance, fine low-level control borrows in Rust: mutable references and immutable references. of C and C++, and unnecessity in garbage collection, which differ Rust from other existing prevalent languages. For formal A mutable reference can mutate the borrowed object if the analysis of Rust programs and helping programmers learn its object is mutable, while an immutable reference cannot mutate new mechanisms and features, a formal semantics of Rust is the borrowed object even the object itself is mutable.
    [Show full text]
  • A Declarative Metaprogramming Language for Digital Signal
    Vesa Norilo Kronos: A Declarative Centre for Music and Technology University of Arts Helsinki, Sibelius Academy Metaprogramming PO Box 30 FI-00097 Uniarts, Finland Language for Digital [email protected] Signal Processing Abstract: Kronos is a signal-processing programming language based on the principles of semifunctional reactive systems. It is aimed at efficient signal processing at the elementary level, and built to scale towards higher-level tasks by utilizing the powerful programming paradigms of “metaprogramming” and reactive multirate systems. The Kronos language features expressive source code as well as a streamlined, efficient runtime. The programming model presented is adaptable for both sample-stream and event processing, offering a cleanly functional programming paradigm for a wide range of musical signal-processing problems, exemplified herein by a selection and discussion of code examples. Signal processing is fundamental to most areas handful of simple concepts. This language is a good of creative music technology. It is deployed on fit for hardware—ranging from CPUs to GPUs and both commodity computers and specialized sound- even custom-made DSP chips—but unpleasant for processing hardware to accomplish transformation humans to work in. Human programmers are instead and synthesis of musical signals. Programming these presented with a very high-level metalanguage, processors has proven resistant to the advances in which is compiled into the lower-level data flow. general computer science. Most signal processors This programming method is called Kronos. are programmed in low-level languages, such as C, often thinly wrapped in rudimentary C++. Such a workflow involves a great deal of tedious detail, as Musical Programming Paradigms these languages do not feature language constructs and Environments that would enable a sufficiently efficient imple- mentation of abstractions that would adequately The most prominent programming paradigm for generalize signal processing.
    [Show full text]