Continuous-Time and Companding Digital Signal Processors Using Adaptivity and Asynchronous Techniques

Total Page:16

File Type:pdf, Size:1020Kb

Continuous-Time and Companding Digital Signal Processors Using Adaptivity and Asynchronous Techniques Continuous-Time and Companding Digital Signal Processors Using Adaptivity and Asynchronous Techniques Christos Vezyrtzis Submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy in the Graduate School of Arts and Sciences COLUMBIA UNIVERSITY 2013 ©2013 Christos Vezyrtzis All Rights Reserved Abstract Continuous-Time and Companding Digital Signal Processors Using Adaptivity and Asynchronous Techniques Christos Vezyrtzis The fully synchronous approach has been the norm for digital signal processors (DSPs) for many decades. Due to its simplicity, the classical DSP structure has been used in many appli- cations. However, due to its rigid discrete-time operation, a classical DSP has limited efficiency or inadequate resolution for some emerging applications, such as processing of multimedia and biological signals. This thesis proposes fundamentally new approaches to designing DSPs, which are different from the classical scheme. The defining characteristic of all new DSPs examined in this thesis is the notion of “adaptivity” or “adaptability.” Adaptive DSPs dynamically change their behavior to adjust to some property of their input stream, for example the rate of change of the input. This thesis presents both enhancements to existing adaptive DSPs, as well as new adaptive DSPs. The main class of DSPs that are examined throughout the thesis are continuous-time (CT) DSPs. CT DSPs are clock-less and event-driven; they naturally adapt their activity and power consumption to the rate of their inputs. The absence of a clock also provides a complete avoidance of aliasing in the frequency domain, hence improved signal fidelity. The core of this thesis deals with the complete and systematic design of a truly general-purpose CT DSP. A scalable design methodology for CT DSPs is presented. This leads to the main con- tribution of this thesis, namely a new CT DSP chip. This chip is the first general-purpose CT DSP chip, able to process many different classes of CT and synchronous signals. The chip has the property of handling various types of signals, i.e. various different digital modulations, both synchronous and asynchronous, without requiring any reconfiguration; such property is presented for the first time CT DSPs and is impossible for classical DSPs. As opposed to previous CT DSPs, which were limited to using only one type of digital format, and whose design was hard to scale for different bandwidths and bit-widths, this chip has a formal, robust and scalable design, due to the systematic usage of asynchronous design techniques. The second contribution of this thesis is a complete methodology to design adaptive delay lines. In particular, it is shown how to make the granularity, i.e. the number of stages, adaptive in a real- time delay line. Adaptive granularity brings about a significant improvement in the line’s power consumption, up to 70% as reported by simulations on two design examples. This enhancement can have a direct large power impact on any CT DSP, since a delay line consumes the majority of a CT DSP’s power. The robust methodology presented in this thesis allows safe dynamic reconfiguration of the line’s granularity, on-the-fly and according to the input traffic. As a final contribution, the thesis also examines two additional DSPs: one operating the CT domain and one using the companding technique. The former operates only on level-crossing samples; the proposed methodology shows a potential for high-quality outputs by using a complex interpolation function. Finally, a companding DSP is presented for MPEG audio. Companding DSPs adapt their dynamic range to the amplitude of their input; the resulting can offer high-quality outputs even for small inputs. By applying companding to MPEG DSPs, it is shown how the DSP distortion can be made almost inaudible, without requiring complex arithmetic hardware. Contents List of Figures vii List of Tables xvi 1 Introduction 1 1.1 Overview of classical DSP systems . .2 1.1.1 Structure and operation . .2 1.1.2 Signal encodings and formats . .4 1.2 Limitations of classical DSPs . .5 1.2.1 Effect of reduced signal amplitude . .6 1.2.2 Power waste during quiet inputs . .7 1.2.3 Dependence of the frequency response on the clock . .8 1.2.4 Limitation in the DSP resolution . .9 1.3 Research focus . 11 1.3.1 Research challenges . 12 1.4 Asynchronous design methodologies for continuous-time DSPs . 13 1.4.1 A CT DSP chip for multiple digital formats and sample rates . 15 i 1.4.2 Adaptive granularity management for delay lines . 17 1.5 Adaptive DSPs: level-crossing and companding DSPs . 19 1.5.1 A method for high-resolution processing of level-crossing encoded signals 20 1.5.2 Companding DSPs for MPEG-encoded audio . 21 1.6 Contribution of this thesis . 22 1.7 Structure of the thesis . 24 2 Background 26 2.1 Continuous time (CT) DSPs . 26 2.1.1 Introduction: CT DSP theory . 27 2.1.2 CT DSP implementations: previous work . 29 2.1.3 Limitations of previous CT DSP prototypes . 31 2.2 Asynchronous system design . 33 2.2.1 Asynchronous communication protocols . 35 2.2.2 Asynchronous data encoding . 36 2.2.3 Asynchronous controllers and burst-mode machines . 40 3 A Method for Processing Level-Crossing-Encoded Signals 42 3.1 Uniform sampling vs. level-crossing sampling . 43 3.2 Motivation for use of LCS DSPs . 46 3.3 Interpolation in LCS processing . 47 3.4 Method for processing LCS-encoded signals . 50 3.4.1 Basic principle . 51 3.4.2 Proper sampling and reconstruction . 52 ii 3.4.3 Implementation issues . 55 3.5 Simulation results . 60 3.6 Conclusions . 64 4 Designing a Modular and Scalable CT DSP: Initial Considerations 66 4.1 Overview of the CT digital FIR filter . 67 4.2 Data movement and storage . 69 4.2.1 Global storage . 70 4.2.2 Local (per-segment) storage . 72 4.3 Micro-architecture of the timing path of delay segments . 73 4.3.1 Serial vs. parallel structure . 74 4.3.2 Delay cell protocol: 2-phase vs. 4-phase . 76 4.4 Arithmetic blocks: multipliers and multi-way adder . 77 4.4.1 Asynchronous multiplier . 78 4.4.2 Multi-way adder . 78 4.5 Internal arithmetic considerations . 81 4.6 On-chip tuning . 82 4.7 Summary . 84 5 A CT DSP Chip with Open Input Format 86 5.1 Contribution of the designed chip . 87 5.2 Basic asynchronous cells . 88 5.3 CT digital FIR chip overview . 89 5.4 Delay line: implementing the timing path . 92 iii 5.4.1 Baseline delay cell . 93 5.4.2 Even and odd delay cells . 95 5.4.3 Operation modes and associated trade offs . 97 5.5 Delay segment: SRAM memory for local data storage . 99 5.6 FIR tap multiplier . 100 5.6.1 Improving on an earlier design . 102 5.7 Multi-way adder . 104 5.8 On-chip automatic tuning . 106 5.9 Tuning interface and FIR-length programming . 108 5.10 Design challenges . 110 5.11 Summary . 111 6 Measurement Results for the CT DSP Chip Prototype 112 6.1 Implementation details . 113 6.2 Test setup . 113 6.3 Frequency response measurements . 116 6.4 Power measurements . 118 6.5 Effect of automatic tuning . 122 6.6 Additional measurements . 122 6.6.1 Absence of aliasing . 123 6.6.2 Delay line operation modes . 124 6.6.3 Types and range of frequency responses . 125 6.7 Comparison to other DSP systems . 126 iv 6.8 Comparison to other reported results . 130 6.9 Summary . 131 7 A Methodology for Designing Real-Time Delay Lines with Dynamically-Adaptive Gran- ularity 132 7.1 Motivation for adaptive granularity . 134 7.1.1 Role of delay lines . 134 7.1.2 Prior designs and limitations . 135 7.2 Contribution: adaptive granularity delay lines . 136 7.3 Simplified view of an adaptive granularity delay line . 138 7.4 Bi-modal adaptive delay line: original approach . 141 7.4.1 Delay line . 142 7.4.2 Mode controller . 145 7.4.3 Asynchronous control line . 151 7.5 MTBF-type bug in the mode controller . 153 7.6 Bi-modal delay line with MTBF enhancement . 156 7.6.1 Mode controller . 158 7.7 Methodology extension for multiple granularity settings: a tri-modal delay line example . 163 7.7.1 Delay line . 164 7.7.2 Mode controller . 166 7.7.3 Asynchronous control line . 172 7.8 Simulation results . ..
Recommended publications
  • A Multidimensional Companding Scheme for Source Coding Witha Perceptually Relevant Distortion Measure
    A MULTIDIMENSIONAL COMPANDING SCHEME FOR SOURCE CODING WITHA PERCEPTUALLY RELEVANT DISTORTION MEASURE J. Crespo, P. Aguiar R. Heusdens Department of Electrical and Computer Engineering Department of Mediamatics Instituto Superior Técnico Technische Universiteit Delft Lisboa, Portugal Delft, The Netherlands ABSTRACT coding mechanisms based on quantization, where less percep- tually important information is quantized more roughly and In this paper, we develop a multidimensional companding vice-versa, and lossless coding of the symbols emitted by the scheme for the preceptual distortion measure by S. van de Par quantizer to reduce statistical redundancy. In the quantization [1]. The scheme is asymptotically optimal in the sense that process of the (en)coder, it is desirable to quantize the infor- it has a vanishing rate-loss with increasing vector dimension. mation extracted from the signal in a rate-distortion optimal The compressor operates in the frequency domain: in its sim- sense, i.e., in a way that minimizes the perceptual distortion plest form, it pointwise multiplies the Discrete Fourier Trans- experienced by the user subject to the constraint of a certain form (DFT) of the windowed input signal by the square-root available bitrate. of the inverse of the masking threshold, and then goes back into the time domain with the inverse DFT. The expander is Due to its mathematical tractability, the Mean Square based on numerical methods: we do one iteration in a fixed- Error (MSE) is the elected choice for the distortion measure point equation, and then we fine-tune the result using Broy- in many coding applications. In audio coding, however, the den’s method.
    [Show full text]
  • (A/V Codecs) REDCODE RAW (.R3D) ARRIRAW
    What is a Codec? Codec is a portmanteau of either "Compressor-Decompressor" or "Coder-Decoder," which describes a device or program capable of performing transformations on a data stream or signal. Codecs encode a stream or signal for transmission, storage or encryption and decode it for viewing or editing. Codecs are often used in videoconferencing and streaming media solutions. A video codec converts analog video signals from a video camera into digital signals for transmission. It then converts the digital signals back to analog for display. An audio codec converts analog audio signals from a microphone into digital signals for transmission. It then converts the digital signals back to analog for playing. The raw encoded form of audio and video data is often called essence, to distinguish it from the metadata information that together make up the information content of the stream and any "wrapper" data that is then added to aid access to or improve the robustness of the stream. Most codecs are lossy, in order to get a reasonably small file size. There are lossless codecs as well, but for most purposes the almost imperceptible increase in quality is not worth the considerable increase in data size. The main exception is if the data will undergo more processing in the future, in which case the repeated lossy encoding would damage the eventual quality too much. Many multimedia data streams need to contain both audio and video data, and often some form of metadata that permits synchronization of the audio and video. Each of these three streams may be handled by different programs, processes, or hardware; but for the multimedia data stream to be useful in stored or transmitted form, they must be encapsulated together in a container format.
    [Show full text]
  • MC14SM5567 PCM Codec-Filter
    Product Preview Freescale Semiconductor, Inc. MC14SM5567/D Rev. 0, 4/2002 MC14SM5567 PCM Codec-Filter The MC14SM5567 is a per channel PCM Codec-Filter, designed to operate in both synchronous and asynchronous applications. This device 20 performs the voice digitization and reconstruction as well as the band 1 limiting and smoothing required for (A-Law) PCM systems. DW SUFFIX This device has an input operational amplifier whose output is the input SOG PACKAGE CASE 751D to the encoder section. The encoder section immediately low-pass filters the analog signal with an RC filter to eliminate very-high-frequency noise from being modulated down to the pass band by the switched capacitor filter. From the active R-C filter, the analog signal is converted to a differential signal. From this point, all analog signal processing is done differentially. This allows processing of an analog signal that is twice the amplitude allowed by a single-ended design, which reduces the significance of noise to both the inverted and non-inverted signal paths. Another advantage of this differential design is that noise injected via the power supplies is a common mode signal that is cancelled when the inverted and non-inverted signals are recombined. This dramatically improves the power supply rejection ratio. After the differential converter, a differential switched capacitor filter band passes the analog signal from 200 Hz to 3400 Hz before the signal is digitized by the differential compressing A/D converter. The decoder accepts PCM data and expands it using a differential D/A converter. The output of the D/A is low-pass filtered at 3400 Hz and sinX/X compensated by a differential switched capacitor filter.
    [Show full text]
  • TMS320C6000 U-Law and A-Law Companding with Software Or The
    Application Report SPRA634 - April 2000 TMS320C6000t µ-Law and A-Law Companding with Software or the McBSP Mark A. Castellano Digital Signal Processing Solutions Todd Hiers Rebecca Ma ABSTRACT This document describes how to perform data companding with the TMS320C6000 digital signal processors (DSP). Companding refers to the compression and expansion of transfer data before and after transmission, respectively. The multichannel buffered serial port (McBSP) in the TMS320C6000 supports two companding formats: µ-Law and A-Law. Both companding formats are specified in the CCITT G.711 recommendation [1]. This application report discusses how to use the McBSP to perform data companding in both the µ-Law and A-Law formats. This document also discusses how to perform companding of data not coming into the device but instead located in a buffer in processor memory. Sample McBSP setup codes are provided for reference. In addition, this application report discusses µ-Law and A-Law companding in software. The appendix provides software companding codes and a brief discussion on the companding theory. Contents 1 Design Problem . 2 2 Overview . 2 3 Companding With the McBSP. 3 3.1 µ-Law Companding . 3 3.1.1 McBSP Register Configuration for µ-Law Companding. 4 3.2 A-Law Companding. 4 3.2.1 McBSP Register Configuration for A-Law Companding. 4 3.3 Companding Internal Data. 5 3.3.1 Non-DLB Mode. 5 3.3.2 DLB Mode . 6 3.4 Sample C Functions. 6 4 Companding With Software. 6 5 Conclusion . 7 6 References . 7 TMS320C6000 is a trademark of Texas Instruments Incorporated.
    [Show full text]
  • Codec Is a Portmanteau of Either
    What is a Codec? Codec is a portmanteau of either "Compressor-Decompressor" or "Coder-Decoder," which describes a device or program capable of performing transformations on a data stream or signal. Codecs encode a stream or signal for transmission, storage or encryption and decode it for viewing or editing. Codecs are often used in videoconferencing and streaming media solutions. A video codec converts analog video signals from a video camera into digital signals for transmission. It then converts the digital signals back to analog for display. An audio codec converts analog audio signals from a microphone into digital signals for transmission. It then converts the digital signals back to analog for playing. The raw encoded form of audio and video data is often called essence, to distinguish it from the metadata information that together make up the information content of the stream and any "wrapper" data that is then added to aid access to or improve the robustness of the stream. Most codecs are lossy, in order to get a reasonably small file size. There are lossless codecs as well, but for most purposes the almost imperceptible increase in quality is not worth the considerable increase in data size. The main exception is if the data will undergo more processing in the future, in which case the repeated lossy encoding would damage the eventual quality too much. Many multimedia data streams need to contain both audio and video data, and often some form of metadata that permits synchronization of the audio and video. Each of these three streams may be handled by different programs, processes, or hardware; but for the multimedia data stream to be useful in stored or transmitted form, they must be encapsulated together in a container format.
    [Show full text]
  • TMS320C6000 Mu-Law and A-Law Companding with Software Or The
    Application Report SPRA634 - April 2000 TMS320C6000t µ-Law and A-Law Companding with Software or the McBSP Mark A. Castellano Digital Signal Processing Solutions Todd Hiers Rebecca Ma ABSTRACT This document describes how to perform data companding with the TMS320C6000 digital signal processors (DSP). Companding refers to the compression and expansion of transfer data before and after transmission, respectively. The multichannel buffered serial port (McBSP) in the TMS320C6000 supports two companding formats: µ-Law and A-Law. Both companding formats are specified in the CCITT G.711 recommendation [1]. This application report discusses how to use the McBSP to perform data companding in both the µ-Law and A-Law formats. This document also discusses how to perform companding of data not coming into the device but instead located in a buffer in processor memory. Sample McBSP setup codes are provided for reference. In addition, this application report discusses µ-Law and A-Law companding in software. The appendix provides software companding codes and a brief discussion on the companding theory. Contents 1 Design Problem . 2 2 Overview . 2 3 Companding With the McBSP. 3 3.1 µ-Law Companding . 3 3.1.1 McBSP Register Configuration for µ-Law Companding. 4 3.2 A-Law Companding. 4 3.2.1 McBSP Register Configuration for A-Law Companding. 4 3.3 Companding Internal Data. 5 3.3.1 Non-DLB Mode. 5 3.3.2 DLB Mode . 6 3.4 Sample C Functions. 6 4 Companding With Software. 6 5 Conclusion . 7 6 References . 7 TMS320C6000 is a trademark of Texas Instruments Incorporated.
    [Show full text]
  • Efficient and Accuracy-Ensured Waveform Compression For
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, VOL. 40, NO. 7, JULY 2021 1437 Efficient and Accuracy-Ensured Waveform Compression for Transient Circuit Simulation Lingjie Li and Wenjian Yu , Senior Member, IEEE Abstract—Efficient and accurate waveform compression is to large data storage for the waveforms. For accurate transient essentially important for the application of analog circuit tran- simulation, the number of time points ranges from several hun- sition simulation nowadays. In this article, an analog waveform dred thousands to even a billion, and voltage/current values compression scheme is proposed which includes two compressed formats for representing small and large signal values, respec- are stored as floating-point numbers. Therefore, the simulator tively. The compressed formats and the corresponding compres- may output a large hard-disk file occupying multiple GBs of sion/decompression approaches ensure that both the absolute and disk space. This further leads to considerable time for writ- relative errors of each signal value restored from the compres- ing the waveforms into a file and for loading the file into a sion are within specified criteria. The formats are integrated in viewer, and largely worsens the performance of circuit simu- a block-by-block compressing procedure which facilitates a sec- ondary lossless compression and a three-stage pipeline scheme lator and the customer experience of waveform viewer. Hence, for fast conversion from the simulator’s output to the com- it is greatly demanded to have an efficient waveform compres- pressed hard-disk file. Theoretical analysis is presented to prove sion approach which reduces the data storage of waveforms the accuracy-ensured property of our approach.
    [Show full text]
  • Video/Audio Compression
    5.9. Video Compression (1) Basics: video := time sequence of single images frequent point of view: video compression = image compression with a temporal component assumption: successive images of a video sequence are similar, e.g. directly adjacent images contain almost the same information that has to be carried only once wanted: strategies to exploit temporal redundancy and irrelevance! motion prediction/estimation, motion compensation, block matching intraframe and interframe coding video compression algorithms and standards are distinguished according to the peculiar conditions, e.g. videoconferencing, applications such as broadcast video Prof. Dr. Paul Müller, University of Kaiserslautern 1 Video Compression (2) Simple approach: M-JPEG compression of a time sequence of images based on the JPEG standard unfortunately, not standardized! makes use of the baseline system of JPEG, intraframe coding, color subsampling 4:1:1, 6 bit quantizer temporal redundancy is not used! applicable for compression ratios from 5:1 to 20:1, higher rates call for interframe coding possibility to synchronize audio data is not provided direct access to full images at every time position application in proprietary consumer video cutting software and hardware solutions Prof. Dr. Paul Müller, University of Kaiserslautern 2 Video Compression (3) Motion prediction and compensation: kinds of motion: • change of color values / change of position of picture elements • translation, rotation, scaling, deformation of objects • change of lights and shadows • translation, rotation, zoom of camera kinds of motion prediction techniques: • prediction of pixels or ranges of pixels neighbouring but no semantic relations • model based prediction grid model with model parameters describing the motion, e.g. head- shoulder-arrangements • object or region based prediction extraction of (video) objects, processing of geometric and texture information, e.g.
    [Show full text]
  • Quantization Error E[N] Exist: 1
    COMP ENG 4TL4: Digital Signal Processing Notes for Lecture #3 Wednesday, September 10, 2003 1.4 Quantization Digital systems can only represent sample amplitudes with a finite set of prescribed values, and thus it is necessary for A/D converters to quantize the values of the samples x[n]. A typical form of quantization uses uniform quantization steps, where the input voltage is either rounded or truncated. (Porat) A 3-bit rounding uniform quantizer with quantization steps of ∆ is shown below. (Oppenheim and Schafer) Two forms of quantization error e[n] exist: 1. Quantization noise: due to rounding or truncation over the range of quantizer outputs; and 2. Saturation (“peak clipping”): due to the input exceeding the maximum or minimum quantizer output. Both types of error are illustrated for the case of a sinusoidal signal in the figure on the next slide. Quantization noise can be minimized by choosing a sufficiently small quantization step. We will derive a method for quantifying how small is “sufficient”. Saturation can be avoided by carefully matching the full- scale range of an A/D converter to anticipated input signal amplitude ranges. Saturation or “peak clipping” (Oppenheim and Schafer) For rounding uniform quantizers, the amplitude of the quantization noise is in the range −∆/2 <e[n] · ∆/2. For small ∆ it is reasonable to assume that e[n] is a random variable uniformly distributed over (−∆/2, ∆/2]. For fairly complicated signals, it is reasonable to assume that successive quantization noise values are uncorrelated and that e[n] is uncorrelated with x[n]. Thus, e[n] is assumed to be a uniformly distributed white- noise sequence with a mean of zero and variance: For a (B+1)-bit quantizer with full-scale Xm, the noise variance (or power) is: The signal-to-noise ratio (SNR) of a (B+1)-bit quantizer is: 2 where σx is the signal variance (or power).
    [Show full text]
  • Companding Flash Analog-To-Digital Converters
    University of Central Florida STARS Retrospective Theses and Dissertations 1986 Companding Flash Analog-To-Digital Converters Michael P. Brig University of Central Florida Part of the Engineering Commons Find similar works at: https://stars.library.ucf.edu/rtd University of Central Florida Libraries http://library.ucf.edu This Masters Thesis (Open Access) is brought to you for free and open access by STARS. It has been accepted for inclusion in Retrospective Theses and Dissertations by an authorized administrator of STARS. For more information, please contact [email protected]. STARS Citation Brig, Michael P., "Companding Flash Analog-To-Digital Converters" (1986). Retrospective Theses and Dissertations. 4953. https://stars.library.ucf.edu/rtd/4953 COMPANDING FLASH- ANALOG-TO-DIGITAL CONVERTERS BY MICHAEL PATRICK BRIG B.S.E., University of Central Florida, 1984 THESIS Submitted in partial fulfillment of the requirements for the degree of Master of Science in Engineering in the Graduate Studies Program of the College of Engineering University of Central Florida Orlando, Florida Spring Term 1986 ABSTRACT This report will ~emonstrate how a companding flash analog-to­ digi tal converter can be used to satisfy both the dynamic rang~ and resolution requirements of an infrared imaging system. Iri the past, infrared imaging systems had to rely on analog electronics to process a thermal image. This was costly and, in many ways, in­ efficient but the only way to perform the function.. Digital process~ ing was impractical because ADC conversion speeds were slow with respect to video frequencies. Furthermore, it is impossible to gain the necessary dynamic range using linear conversion techniques without targe digital wordlengths.
    [Show full text]
  • Digital Communications GATE Online Coaching Classes
    GATE Online Coaching Classes Digital Communications Online Class-2 • By • Dr.B.Leela Kumari • Assistant Professor, • Department of Electronics and Communications Engineering • University college of Engineering Kakinada • Jawaharlal Nehru Technological University Kakinada 6/15/2020 Dr. B. Leela Kumari UCEK JNTUK Kakinada 1 6/15/2020 Dr. B. Leela Kumari UCEK JNTUK Kakinada 1 Session -2 Baseband Transmission • Introduction to Pulse analog Transmission • Quantization and coding • Classification of Uniform Quantization • Line Coding • Non Uniform Quantization • Objective type Questions & Illustrative Problems 6/15/2020 Dr. B. Leela Kumari UCEK JNTUK Kakinada 3 Pulse Analog Transmission Pulse modulation: One of the parameters of the carrier is varied in accordance with the instantaneous value of the modulating signal Classification: Pulse Amplitude Modulation(PAM) Pulse Time Modulation(PTM) Pulse Width Modulation(PWM) Pulse Position Modulation(PPM) Pulse Amplitude Modulation(PAM) Pulse Width Modulation(PWM) Pulse Position Modulation(PPM) Comparisions Quantization Pulse analog modulation systems are not completely digital Sampled signals are used to get a complete digital signal Sampled signals are first quantized and then coded to get complete digital signal Input Signal digital signal Sampler Quantizer Encoder Sampler samples the analog signal Quantizer: Does the Quantizing (approximation) Quantized signal is an approximation to the original signal The Quality of approximation improved by reducing the step Size Encoder translates the Quantized signal more appropriate format of signal Operation of Quantization Process of Quantization Consider message signal m(t) ,whose range is from VL to VH Divide this range in to M equal intervals, each size S, Step size S= (VL –VH )/M In the center of each of these steps ,locate Quantization levels,m0,m1,m2….
    [Show full text]
  • Inline Vector Compression for Computational Physics
    Inline Vector Compression for Computational Physics W. Trojak ∗ Department of Ocean Engineering, Texas A&M University, College Station F. D. Witherden Department of Ocean Engineering, Texas A&M University, College Station Abstract A novel inline data compression method is presented for single-precision vec- tors in three dimensions. The primary application of the method is for accel- erating computational physics calculations where the throughput is bound by memory bandwidth. The scheme employs spherical polar coordinates, angle quantisation, and a bespoke floating-point representation of the magnitude to achieve a fixed compression ratio of 1.5. The anisotropy of this method is considered, along with companding and fractional splitting techniques to improve the efficiency of the representation. We evaluate the scheme nu- merically within the context of high-order computational fluid dynamics. For both the isentropic convecting vortex and the Taylor{Green vortex test cases, the results are found to be comparable to those without compression. arXiv:2003.02633v2 [cs.CE] 24 Jun 2020 Performance is evaluated for a vector addition kernel on an NVIDIA Titan V GPU; it is demonstrated that a speedup of 1.5 can be achieved. Keywords: Vector Compression, GPU computing, Flux Reconstruction ∗Corresponding author Email addresses: [email protected] (W. Trojak ), [email protected] (F. D. Witherden ) Preprint submitted to Elsevier June 25, 2020 2010 MSC: 68U20, 68W40, 68P30, 65M60, 76F65 1. Introduction Over the past 30 years, improvements in computing capabilities, mea- sured in floating-point operations per second, have consistently outpaced improvements in memory bandwidth [1]. A consequence of this is that many numerical methods for partial differential equations are memory bandwidth bound algorithms.
    [Show full text]