Hybrid Compression Technique Using Linear Predictive Coding for Electrocardiogram Signals

Total Page:16

File Type:pdf, Size:1020Kb

Hybrid Compression Technique Using Linear Predictive Coding for Electrocardiogram Signals International Journal of Engineering Technology Science and Research IJETSR www.ijetsr.com ISSN 2394 – 3386 Volume 4, Issue 6 June 2017 Hybrid Compression Technique Using Linear Predictive Coding for Electrocardiogram Signals K. S Surekha B. P. Patil Research Scholar, Sinhgad College of Engg. Pune Principal (Assoc. Professor, AIT, Pune) AIT,Pune ABSTRACT Linear Predictive Coding (LPC) is used for analysis and compression of speech signals. Whereas Huffman coding is used forElectrocardiogram (ECG) signal compression. This paper presents a hybrid compression technique for ECG signal using modifiedHuffman encoding andLPC.The aim of this paper is to apply the linear prediction coding and modified Huffman coding for analysis, compression and prediction of ECG signals. The ECG signal is transformed through discrete wavelet transform. MIT-BIH database is used for testing the compression algorithm. The performance measure used to validate the results are the Compression ratio (CR) and Percent Root mean square difference (PRD).The improved CR and PRD obtained in this research prove the quality of the reconstructed signal. Key words Huffman encoding, Linear Predictive Coding, Compression 1. INTRODUCTION ECG gives clear indication about heart diseases and arrhythmias. It is necessary to monitor and store the ECG signal of subjects continuously for diagnosis purpose. Due to this, the data storage size keeps on increasing. In such situations, it is necessary to compress the ECG signal for storage and transmission. The important factor for consideration is to obtain maximum data compression. That must also ensure that the signal retain the clinically important features. Fig. 1 shows the ECG signal with different peaks. Fig. 1 The ECG signal with different peaks As shown in the fig. the ECG signal has different peaks. In this paper, a lossless hybrid technique of ECG signal is presented. The technique used is a hybrid technique consisting of LPC and Huffman encoding. In this paper, compression technique is explained in section 2. The Performance measure is briefed in Section 3. Section 4 gives an outline of the methodology used, Results and implementation. The last sections are of conclusion and references. 497 K. S Surekha, B. P. Patil International Journal of Engineering Technology Science and Research IJETSR www.ijetsr.com ISSN 2394 – 3386 Volume 4, Issue 6 June 2017 2. COMPRESSION TECHNIQUE The major classification of compression techniques is lossy and lossless. The lossy compression technique [1] can lose some part of the data. This paper gives importance to a lossless method of compression using Huffman and LPC.The lossless compression is preferred because of preserving the important features of ECGsignal which are required by doctors for diagnosis. Even the slightest loss of important part of the signal may lead to the wrong diagnosis. 2.1 Huffman encoding and Linear Predictive Coding Huffman coding is used as a compression technique with variable-length codes.The Huffman encoding algorithm initially arranges the symbols in descending order. This is done based on the probabalities.After this step, a binary tree is constructed. Every leaf of the tree includes a symbol. This procedure is carried out in steps. The codewords are obtained from the tree.In Huffman procedure, shorter code words are used for the symbols occurring frequently. In LPC, each expression sample is approximated as a combination of past samples. 2.2 The Wavelet Transform The wavelet transform is most suitable to display both the time and the frequency information together.The wavelet transform has two broad classifications. They are continuous wavelet transform (CWT) and Discrete Wavelet Transform (DWT)[2, 3]. In DWT, the wavelets are discretely sampled. The signal x is passed through low pass and high pass filter stages to obtain DWT of the signal.The convolution operation takes place through these filters simultaneously[4,5]. The corresponding output signal through low pass filter is given as below. y() n x k g n k (1) k A similar operation takes place with the high pass filter.[4,5]. In the next step, both the filter outputs go through the process of sub sampling by 2.The filtering and sub-sampling process are shown in fig. 2. Approximation Coefficients(output) g(n 2 ) x(n)(input) 2 Detail Coefficients(output) h(n) Fig. 2 Filtering and sub- sampling process in DWT The discrete wavelet transform is expressed as f() t Cm,, n m n t (2) 3 THE STANDARD PERFORMANCE MEASURE To check the validity of the results, different performance measures are used. In this research, Compression Ratio (CR) [6] and Percent Root mean square Difference (PRD) are used as standard measures of performance. The CR is defined as the ratio of a number of bits in the original signal to the number of bits in the compressed signal[7]. 498 K. S Surekha, B. P. Patil International Journal of Engineering Technology Science and Research IJETSR www.ijetsr.com ISSN 2394 – 3386 Volume 4, Issue 6 June 2017 4 METHODOLOGY USED , RESULTS AND IMPLEMENTATION MIT-BIH data base [8] is used for the testing purpose. The simulation is carried out using The MATLAB tool. The sampling frequency selected is 330 Hz. Initially, the ECG signal is passed through pre processing stage[9]. Base line wander noise and power line interference[4] are eliminated during this process by filtering process. The transformation of the signal is done through DWT. A threshold value is selected during this process. The coefficients below the threshold value are removed.The predictive coding algorithm is applied after the DWT stage. Once the coefficients are extracted through this step, the modified Huffman encoding step is applied for encoding of the samples. The results are tabulated for the different ECG signals as shown in Table 1. The results are also compared with the other researchers [ 9, 10] to prove the improvement in the CR and PRD. The low value of the PRD also indicates the good reconstruction quality of the signal. That helps in retaining the clinically significant properties of the ECG signal for diagnosis purpose. The waveforms showing both the input and output is shown in fig. 3.As shown in Table 1 the CR achieved by this method has shown a significant improvement by using the hybrid technique. Table 1: Result computation Method CR CR (%) PRD [10]Ranjeet Kumar et. al 6 1.11 [11],ShubhadeepBhaniket.al 50 Proposed 6.7 85 0.5 Input 500 0 amplitude/mv -500 0 20 40 60 80 100 120 140 Output after compression 500 0 amplitude/mv -500 0 20 40 60 80 100 120 140 sample no. Fig. 3 The Input signal and the output signal 5 CONCLUSION The methodology used in our research to make use of the hybrid technique shows the usefulness of the technique. The results also demonstrated improvement in the CR value and at the same time low PRD. The high value of CR is useful for better storage and transmission purpose. The low value of PRD achieved indicates good quality of the ECG signal. This will also help doctors for better diagnosis. The methodology used here is also based on the threshold value. In this research, the optimum value of the threshold value is selected to achieve high CR. 499 K. S Surekha, B. P. Patil International Journal of Engineering Technology Science and Research IJETSR www.ijetsr.com ISSN 2394 – 3386 Volume 4, Issue 6 June 2017 Acknowledgement We are grateful to the Principal SCOE(Sinhgad college of Engineering) HoD SCOE, PhD Coordinator SCOE who provided expertise that greatly assisted research to publish this paper. We are grateful the Director and Joint Director, Army Institute of Technology for providing support in publishing this paper. References [1] Mark Nelson, Jean Loup Gailly.”The Data Compression Book”,BPB, 2nd Edition,2009 [2] Mohammad Pooyan, Ali Taheri, MortezaMoazami-Goudarzi, ImanSaboori, “Wavelet Compression of ECG Signals using SPIHT Algorithm”, International Journal of Information & Communication Engineering, pp.219-225, 2005 [3] Mohammed Abo-Zahhad, Sabah M Ahmed, Ahmed Zakaria, “An Efficient Technique for Compressing ECG Signals Using QRS Detection, Estimation, and 2D DWT Coefficients Thresholding”, Modelling and Simulation in Engineering, Hindawi Publishing Corporation ), pp 1-10, 2012. [4] Surekha K S, B P Patil, “Compression of ECG Signal Using Hybrid Technique”, Intelligent Systems in Science and Information 2014, Springer International Publishing, pp.385-396,2014 [5] Surekha KS, BP Patil, “ECG Signal Compression Using the High Frequency Components of Wavelet Transform”, International Journal of Advanced Computer Science and Applications,Vol. 7, No. 3, 2016 [6] Yao Zau, Jun han, SizhongXuan, Shan Huang XinqianWeng, Debin Fang, XiaoyangZeng, “An energy efficient design for ECG recording and R peak detection based on wavelet Transform”, IEEE Transactions on circuits and systems, Vol. 62, No. 2, pp. 119-123, 2015 [7] Ktata, Sana Ouni, Kai's Ellouze, Nouredd. "A novel compression algorithm for electrocardiogram signals based on wavelet transform and SPIHT.”, International Journal of Signal Processing, 2009 [8] http://ecg.mit.edu/ [9] Sarita Mishra, Debasmit Das, Roshan Kumar, ParasuramanSumathi, “ A power line interference canceler based on sliding DFT phase locking scheme for ECG signals”, IEEE Ttransactions on Instrumentation and Measurement, Vol. 64, No. 1, pp 132-142, 2015 10] Ranjeet Kumar, A. Kumar, Rajesh K. Pandey, “Beta wavelet based ECG signal compression using lossless encoding with modified thresholding”, Journal of Computers in Biology and Medicine, Vol. 39, pp. 130-140, 2013 [11] ShubhadeepBanik, Roshan J. Martis2, Dayananda Nayak3,”Code Excited Linear Prediction Codec For Electrocardiogram”, Proceedings of the 26th Annual International Conference of the IEEE EMBS San Francisco, 2004 500 K. S Surekha, B. P. Patil.
Recommended publications
  • A Survey Paper on Different Speech Compression Techniques
    Vol-2 Issue-5 2016 IJARIIE-ISSN (O)-2395-4396 A Survey Paper on Different Speech Compression Techniques Kanawade Pramila.R1, Prof. Gundal Shital.S2 1 M.E. Electronics, Department of Electronics Engineering, Amrutvahini College of Engineering, Sangamner, Maharashtra, India. 2 HOD in Electronics Department, Department of Electronics Engineering , Amrutvahini College of Engineering, Sangamner, Maharashtra, India. ABSTRACT This paper describes the different types of speech compression techniques. Speech compression can be divided into two main types such as lossless and lossy compression. This survey paper has been written with the help of different types of Waveform-based speech compression, Parametric-based speech compression, Hybrid based speech compression etc. Compression is nothing but reducing size of data with considering memory size. Speech compression means voiced signal compress for different application such as high quality database of speech signals, multimedia applications, music database and internet applications. Today speech compression is very useful in our life. The main purpose or aim of speech compression is to compress any type of audio that is transfer over the communication channel, because of the limited channel bandwidth and data storage capacity and low bit rate. The use of lossless and lossy techniques for speech compression means that reduced the numbers of bits in the original information. By the use of lossless data compression there is no loss in the original information but while using lossy data compression technique some numbers of bits are loss. Keyword: - Bit rate, Compression, Waveform-based speech compression, Parametric-based speech compression, Hybrid based speech compression. 1. INTRODUCTION -1 Speech compression is use in the encoding system.
    [Show full text]
  • Digital Speech Processing— Lecture 17
    Digital Speech Processing— Lecture 17 Speech Coding Methods Based on Speech Models 1 Waveform Coding versus Block Processing • Waveform coding – sample-by-sample matching of waveforms – coding quality measured using SNR • Source modeling (block processing) – block processing of signal => vector of outputs every block – overlapped blocks Block 1 Block 2 Block 3 2 Model-Based Speech Coding • we’ve carried waveform coding based on optimizing and maximizing SNR about as far as possible – achieved bit rate reductions on the order of 4:1 (i.e., from 128 Kbps PCM to 32 Kbps ADPCM) at the same time achieving toll quality SNR for telephone-bandwidth speech • to lower bit rate further without reducing speech quality, we need to exploit features of the speech production model, including: – source modeling – spectrum modeling – use of codebook methods for coding efficiency • we also need a new way of comparing performance of different waveform and model-based coding methods – an objective measure, like SNR, isn’t an appropriate measure for model- based coders since they operate on blocks of speech and don’t follow the waveform on a sample-by-sample basis – new subjective measures need to be used that measure user-perceived quality, intelligibility, and robustness to multiple factors 3 Topics Covered in this Lecture • Enhancements for ADPCM Coders – pitch prediction – noise shaping • Analysis-by-Synthesis Speech Coders – multipulse linear prediction coder (MPLPC) – code-excited linear prediction (CELP) • Open-Loop Speech Coders – two-state excitation
    [Show full text]
  • Arxiv:2004.10531V1 [Cs.OH] 8 Apr 2020
    ROOT I/O compression improvements for HEP analysis Oksana Shadura1;∗ Brian Paul Bockelman2;∗∗ Philippe Canal3;∗∗∗ Danilo Piparo4;∗∗∗∗ and Zhe Zhang1;y 1University of Nebraska-Lincoln, 1400 R St, Lincoln, NE 68588, United States 2Morgridge Institute for Research, 330 N Orchard St, Madison, WI 53715, United States 3Fermilab, Kirk Road and Pine St, Batavia, IL 60510, United States 4CERN, Meyrin 1211, Geneve, Switzerland Abstract. We overview recent changes in the ROOT I/O system, increasing per- formance and enhancing it and improving its interaction with other data analy- sis ecosystems. Both the newly introduced compression algorithms, the much faster bulk I/O data path, and a few additional techniques have the potential to significantly to improve experiment’s software performance. The need for efficient lossless data compression has grown significantly as the amount of HEP data collected, transmitted, and stored has dramatically in- creased during the LHC era. While compression reduces storage space and, potentially, I/O bandwidth usage, it should not be applied blindly: there are sig- nificant trade-offs between the increased CPU cost for reading and writing files and the reduce storage space. 1 Introduction In the past years LHC experiments are commissioned and now manages about an exabyte of storage for analysis purposes, approximately half of which is used for archival purposes, and half is used for traditional disk storage. Meanwhile for HL-LHC storage requirements per year are expected to be increased by factor 10 [1]. arXiv:2004.10531v1 [cs.OH] 8 Apr 2020 Looking at these predictions, we would like to state that storage will remain one of the major cost drivers and at the same time the bottlenecks for HEP computing.
    [Show full text]
  • Mpeg Vbr Slice Layer Model Using Linear Predictive Coding and Generalized Periodic Markov Chains
    MPEG VBR SLICE LAYER MODEL USING LINEAR PREDICTIVE CODING AND GENERALIZED PERIODIC MARKOV CHAINS Michael R. Izquierdo* and Douglas S. Reeves** *Network Hardware Division IBM Corporation Research Triangle Park, NC 27709 [email protected] **Electrical and Computer Engineering North Carolina State University Raleigh, North Carolina 27695 [email protected] ABSTRACT The ATM Network has gained much attention as an effective means to transfer voice, video and data information We present an MPEG slice layer model for VBR over computer networks. ATM provides an excellent vehicle encoded video using Linear Predictive Coding (LPC) and for video transport since it provides low latency with mini- Generalized Periodic Markov Chains. Each slice position mal delay jitter when compared to traditional packet net- within an MPEG frame is modeled using an LPC autoregres- works [11]. As a consequence, there has been much research sive function. The selection of the particular LPC function is in the area of the transmission and multiplexing of com- governed by a Generalized Periodic Markov Chain; one pressed video data streams over ATM. chain is defined for each I, P, and B frame type. The model is Compressed video differs greatly from classical packet sufficiently modular in that sequences which exclude B data sources in that it is inherently quite bursty. This is due to frames can eliminate the corresponding Markov Chain. We both temporal and spatial content variations, bounded by a show that the model matches the pseudo-periodic autocorre- fixed picture display rate. Rate control techniques, such as lation function quite well. We present simulation results of CBR (Constant Bit Rate), were developed in order to reduce an Asynchronous Transfer Mode (ATM) video transmitter the burstiness of a video stream.
    [Show full text]
  • A Practical Approach to Spatiotemporal Data Compression
    A Practical Approach to Spatiotemporal Data Compres- sion Niall H. Robinson1, Rachel Prudden1 & Alberto Arribas1 1Informatics Lab, Met Office, Exeter, UK. Datasets representing the world around us are becoming ever more unwieldy as data vol- umes grow. This is largely due to increased measurement and modelling resolution, but the problem is often exacerbated when data are stored at spuriously high precisions. In an effort to facilitate analysis of these datasets, computationally intensive calculations are increasingly being performed on specialised remote servers before the reduced data are transferred to the consumer. Due to bandwidth limitations, this often means data are displayed as simple 2D data visualisations, such as scatter plots or images. We present here a novel way to efficiently encode and transmit 4D data fields on-demand so that they can be locally visualised and interrogated. This nascent “4D video” format allows us to more flexibly move the bound- ary between data server and consumer client. However, it has applications beyond purely scientific visualisation, in the transmission of data to virtual and augmented reality. arXiv:1604.03688v2 [cs.MM] 27 Apr 2016 With the rise of high resolution environmental measurements and simulation, extremely large scientific datasets are becoming increasingly ubiquitous. The scientific community is in the pro- cess of learning how to efficiently make use of these unwieldy datasets. Increasingly, people are interacting with this data via relatively thin clients, with data analysis and storage being managed by a remote server. The web browser is emerging as a useful interface which allows intensive 1 operations to be performed on a remote bespoke analysis server, but with the resultant information visualised and interrogated locally on the client1, 2.
    [Show full text]
  • (A/V Codecs) REDCODE RAW (.R3D) ARRIRAW
    What is a Codec? Codec is a portmanteau of either "Compressor-Decompressor" or "Coder-Decoder," which describes a device or program capable of performing transformations on a data stream or signal. Codecs encode a stream or signal for transmission, storage or encryption and decode it for viewing or editing. Codecs are often used in videoconferencing and streaming media solutions. A video codec converts analog video signals from a video camera into digital signals for transmission. It then converts the digital signals back to analog for display. An audio codec converts analog audio signals from a microphone into digital signals for transmission. It then converts the digital signals back to analog for playing. The raw encoded form of audio and video data is often called essence, to distinguish it from the metadata information that together make up the information content of the stream and any "wrapper" data that is then added to aid access to or improve the robustness of the stream. Most codecs are lossy, in order to get a reasonably small file size. There are lossless codecs as well, but for most purposes the almost imperceptible increase in quality is not worth the considerable increase in data size. The main exception is if the data will undergo more processing in the future, in which case the repeated lossy encoding would damage the eventual quality too much. Many multimedia data streams need to contain both audio and video data, and often some form of metadata that permits synchronization of the audio and video. Each of these three streams may be handled by different programs, processes, or hardware; but for the multimedia data stream to be useful in stored or transmitted form, they must be encapsulated together in a container format.
    [Show full text]
  • Multimedia Compression Techniques for Streaming
    International Journal of Innovative Technology and Exploring Engineering (IJITEE) ISSN: 2278-3075, Volume-8 Issue-12, October 2019 Multimedia Compression Techniques for Streaming Preethal Rao, Krishna Prakasha K, Vasundhara Acharya most of the audio codes like MP3, AAC etc., are lossy as Abstract: With the growing popularity of streaming content, audio files are originally small in size and thus need not have streaming platforms have emerged that offer content in more compression. In lossless technique, the file size will be resolutions of 4k, 2k, HD etc. Some regions of the world face a reduced to the maximum possibility and thus quality might be terrible network reception. Delivering content and a pleasant compromised more when compared to lossless technique. viewing experience to the users of such locations becomes a The popular codecs like MPEG-2, H.264, H.265 etc., make challenge. audio/video streaming at available network speeds is just not feasible for people at those locations. The only way is to use of this. FLAC, ALAC are some audio codecs which use reduce the data footprint of the concerned audio/video without lossy technique for compression of large audio files. The goal compromising the quality. For this purpose, there exists of this paper is to identify existing techniques in audio-video algorithms and techniques that attempt to realize the same. compression for transmission and carry out a comparative Fortunately, the field of compression is an active one when it analysis of the techniques based on certain parameters. The comes to content delivering. With a lot of algorithms in the play, side outcome would be a program that would stream the which one actually delivers content while putting less strain on the audio/video file of our choice while the main outcome is users' network bandwidth? This paper carries out an extensive finding out the compression technique that performs the best analysis of present popular algorithms to come to the conclusion of the best algorithm for streaming data.
    [Show full text]
  • Lossless Compression of Audio Data
    CHAPTER 12 Lossless Compression of Audio Data ROBERT C. MAHER OVERVIEW Lossless data compression of digital audio signals is useful when it is necessary to minimize the storage space or transmission bandwidth of audio data while still maintaining archival quality. Available techniques for lossless audio compression, or lossless audio packing, generally employ an adaptive waveform predictor with a variable-rate entropy coding of the residual, such as Huffman or Golomb-Rice coding. The amount of data compression can vary considerably from one audio waveform to another, but ratios of less than 3 are typical. Several freeware, shareware, and proprietary commercial lossless audio packing programs are available. 12.1 INTRODUCTION The Internet is increasingly being used as a means to deliver audio content to end-users for en­ tertainment, education, and commerce. It is clearly advantageous to minimize the time required to download an audio data file and the storage capacity required to hold it. Moreover, the expec­ tations of end-users with regard to signal quality, number of audio channels, meta-data such as song lyrics, and similar additional features provide incentives to compress the audio data. 12.1.1 Background In the past decade there have been significant breakthroughs in audio data compression using lossy perceptual coding [1]. These techniques lower the bit rate required to represent the signal by establishing perceptual error criteria, meaning that a model of human hearing perception is Copyright 2003. Elsevier Science (USA). 255 AU rights reserved. 256 PART III / APPLICATIONS used to guide the elimination of excess bits that can be either reconstructed (redundancy in the signal) orignored (inaudible components in the signal).
    [Show full text]
  • The H.264 Advanced Video Coding (AVC) Standard
    Whitepaper: The H.264 Advanced Video Coding (AVC) Standard What It Means to Web Camera Performance Introduction A new generation of webcams is hitting the market that makes video conferencing a more lifelike experience for users, thanks to adoption of the breakthrough H.264 standard. This white paper explains some of the key benefits of H.264 encoding and why cameras with this technology should be on the shopping list of every business. The Need for Compression Today, Internet connection rates average in the range of a few megabits per second. While VGA video requires 147 megabits per second (Mbps) of data, full high definition (HD) 1080p video requires almost one gigabit per second of data, as illustrated in Table 1. Table 1. Display Resolution Format Comparison Format Horizontal Pixels Vertical Lines Pixels Megabits per second (Mbps) QVGA 320 240 76,800 37 VGA 640 480 307,200 147 720p 1280 720 921,600 442 1080p 1920 1080 2,073,600 995 Video Compression Techniques Digital video streams, especially at high definition (HD) resolution, represent huge amounts of data. In order to achieve real-time HD resolution over typical Internet connection bandwidths, video compression is required. The amount of compression required to transmit 1080p video over a three megabits per second link is 332:1! Video compression techniques use mathematical algorithms to reduce the amount of data needed to transmit or store video. Lossless Compression Lossless compression changes how data is stored without resulting in any loss of information. Zip files are losslessly compressed so that when they are unzipped, the original files are recovered.
    [Show full text]
  • Understanding Compression of Geospatial Raster Imagery
    Understanding Compression of Geospatial Raster Imagery Document Overview This document was created for the North Carolina Geographic Information and Coordinating Council (GICC), http://ncgicc.com, by the GIS Technical Advisory Committee (TAC). Its purpose is to serve as a best practice or guidance document for GIS professionals that are compressing raster images. This document only addresses compressing geospatial raster data and specifically aerial or orthorectified imagery. It does not address compressing LiDAR data. Compression Overview Compression is the process of making data more compact so it occupies less disk storage space. The primary benefit of compressing raster data is reduction in file size. An added benefit is greatly improved performance over a network, because the user is transferring less data from a server to an application; however, compressed data must be decompressed to display in GIS software. The result may be slower raster display in GIS software than data that is not compressed. Compressed data can also increase CPU requirements on the server or desktop. Glossary of Common Terms Raster is a spatial data model made of rows and columns of cells. Each cell contains an attribute value identifying its color and location coordinate. Geospatial raster data like satellite images and aerial photographs are typically larger on average than vector data (predominately points, lines, or polygons). Compression is the process of making a (raster) file smaller while preserving all or most of the data it contains. Imagery compression enables storage of more data (image files) on a disk than if they were uncompressed. Compression ratio is the amount or degree of reduction of an image's file size.
    [Show full text]
  • CALIFORNIA STATE UNIVERSITY, NORTHRIDGE Optimized AV1 Inter
    CALIFORNIA STATE UNIVERSITY, NORTHRIDGE Optimized AV1 Inter Prediction using Binary classification techniques A graduate project submitted in partial fulfillment of the requirements for the degree of Master of Science in Software Engineering by Alex Kit Romero May 2020 The graduate project of Alex Kit Romero is approved: ____________________________________ ____________ Dr. Katya Mkrtchyan Date ____________________________________ ____________ Dr. Kyle Dewey Date ____________________________________ ____________ Dr. John J. Noga, Chair Date California State University, Northridge ii Dedication This project is dedicated to all of the Computer Science professors that I have come in contact with other the years who have inspired and encouraged me to pursue a career in computer science. The words and wisdom of these professors are what pushed me to try harder and accomplish more than I ever thought possible. I would like to give a big thanks to the open source community and my fellow cohort of computer science co-workers for always being there with answers to my numerous questions and inquiries. Without their guidance and expertise, I could not have been successful. Lastly, I would like to thank my friends and family who have supported and uplifted me throughout the years. Thank you for believing in me and always telling me to never give up. iii Table of Contents Signature Page ................................................................................................................................ ii Dedication .....................................................................................................................................
    [Show full text]
  • Data Compression Using Pre-Generated Dictionaries
    Technical Disclosure Commons Defensive Publications Series January 2020 Data compression using pre-generated dictionaries Simon Cooke Follow this and additional works at: https://www.tdcommons.org/dpubs_series Recommended Citation Cooke, Simon, "Data compression using pre-generated dictionaries", Technical Disclosure Commons, (January 16, 2020) https://www.tdcommons.org/dpubs_series/2876 This work is licensed under a Creative Commons Attribution 4.0 License. This Article is brought to you for free and open access by Technical Disclosure Commons. It has been accepted for inclusion in Defensive Publications Series by an authorized administrator of Technical Disclosure Commons. Cooke: Data compression using pre-generated dictionaries Data compression using pre-generated dictionaries ABSTRACT A file is compressed by replacing its characters by codes that are dependent on the statistics of the characters. The character-to-code table, known as the dictionary, is typically incorporated into the compressed file. Popular compression schemes reach theoretical compression limit only asymptotically. Small files or files without much intra-file redundancy, either compress poorly or not at all. This disclosure describes techniques that achieve superior compression, even for small files or files without much intra-file redundancy, by independently maintaining the dictionary at the transmitting and receiving ends of a file transmission, such that the dictionary does not need to be incorporated into the compressed file. KEYWORDS ● Data compression ● Codec dictionary ● Compression dictionary ● Compression ratio ● Page load speed ● Webpage download ● Shannon limit ● Webpage compression ● JavaScript compression ● Web browser Published by Technical Disclosure Commons, 2020 2 Defensive Publications Series, Art. 2876 [2020] BACKGROUND A file is compressed by replacing its characters by codes that are dependent on the statistics of the characters.
    [Show full text]