How I Came up with the Discrete Cosine Transform Nasir Ahmed Electrical and Computer Engineering Department, University of New Mexico, Albuquerque, New Mexico 87131

Total Page:16

File Type:pdf, Size:1020Kb

How I Came up with the Discrete Cosine Transform Nasir Ahmed Electrical and Computer Engineering Department, University of New Mexico, Albuquerque, New Mexico 87131 mxT*L. BImL4L. PRocEsSlNG 1,4-5 (1991) How I Came Up with the Discrete Cosine Transform Nasir Ahmed Electrical and Computer Engineering Department, University of New Mexico, Albuquerque, New Mexico 87131 During the late sixties and early seventies, there to study a “cosine transform” using Chebyshev poly- was a great deal of research activity related to digital nomials of the form orthogonal transforms and their use for image data compression. As such, there were a large number of T,(m) = (l/N)‘/“, m = 1, 2, . , N transforms being introduced with claims of better per- formance relative to others transforms. Such compari- em- lh) h = 1 2 N T,(m) = (2/N)‘%os 2N t ,...) . sons were typically made on a qualitative basis, by viewing a set of “standard” images that had been sub- jected to data compression using transform coding The motivation for looking into such “cosine func- techniques. At the same time, a number of researchers tions” was that they closely resembled KLT basis were doing some excellent work on making compari- functions for a range of values of the correlation coef- sons on a quantitative basis. In particular, researchers ficient p (in the covariance matrix). Further, this at the University of Southern California’s Image Pro- range of values for p was relevant to image data per- cessing Institute (Bill Pratt, Harry Andrews, Ali Ha- taining to a variety of applications. bibi, and others) and the University of California at Much to my disappointment, NSF did not fund the Los Angeles (Judea Pearl) played a key role. In this proposal; I recall one reviewer’s comment to the effect regard, the variance criterion and the rate distortion that the whole idea seemed “too simple.” Hence I de- criterion were developed and used extensively as per- cided to work on this problem with my Ph.D. student formance measures for evaluating image data com- Mr. T. Natarajan and my friend Dr. Ram Mohan Rao pression. In addition, the Karhunen-Loeve trans- at the University of Texas at Arlington. In fact, I re- form (KLT) evolved as the optimal transform for com- member dedicating the whole summer of 1973 to work parison purposes. With this as background, I can now on this problem. The results that we got appeared too address the DCT issue. good to be true, and I therefore decided to consult What intrigued me was that the KLT was indeed Harry Andrews later that year at a conference in New the optimal transform on the basis of the mean- Orleans. We were both invited speakers at a session square-error criterion and the first-order Markov pro- on Walsh Functions. Harry suggested that I check out cess model, and yet there was no efficient algorithm the performance of this “cosine transform” using the available to compute it. As such, the focus of my re- rate distortion criterion. He then sent me the com- search was to determine whether it would be possible puter program to do so. The results again showed that to come up with a good approximation to the KLT this transform performed better than all the others, that could be computed efficiently. An approach that and its performance compared very closely with that I thought might be worth looking into was Chebyshev of the KLT. When I sent the results back to Harry interpolation, a neat discussion of which was available Andrews, he suggested that I publish them. As such, I in a text book (Computer Evaluation of Mathematical sent them to the IEEE Computer Transactions, and Functions, by C. T. Fike, Prentice-Hall, 1968, Sec- the paper was then published in the January 1974 tions 7.4 and 7.5). This was in early 1972, and I wrote issue. I recall that we decided to send it in as a corre- a proposal to the National Science Foundation (NSF) spondence item in order to get it published with mini- 1051.2004/91$1.50 Copyright 0 1991 by Academic Press, Inc. All rights of reproduction in any form reserved. mum delay. Little did we realize at that time that the Honeywell, Inc., St. Paul, Minnesota. He was with Kansas State resulting “DCT” would be widely used in the future! University, Manhattan, from 1968 to 1983. Since 1983 he has been a Professor in the Electrical and Computer Engineering Depart- It is indeed gratifying to see that the DCT is now ment at the University of New Mexico, Albuquerque. He became essentially a “standard” in the area of image data the Chairman of this department in July 1989. In August 1985 he compression via transform coding techniques. was awarded one of the twelve Presidential Professorships at the University of New Mexico. He is the leading author of Orthogonal Transforms for Digital Signal Processing (Springer-Verlag, 1975), and Discrete-Time Signals and Systems (Reston, 1983), and coau- thor of Computer Science Fundamentals (Merrill, 1979). He is also NASIR AHMED was born in Bangalore, India, in 1940. He re- the author of a number of technical papers in the area of digital ceived the B.S. degree in electrical engineering from the University signal processing. Dr. Ahmed was an Associate Editor for the of Mysore, India, in 1961, and the MS. and Ph.D. degrees from the IEEE Transactions on Acoustics, Speech, and Signal Processing University of New Mexico in 1963 and 1966, respectively. From (1982-1984) and is currently an Associate Editor for the IEEE 1966 to 1968 he worked as Principal Research Engineer in the area Transactions on Electromagnetic Compatibility (Walsh Functions of information processing at the Systems and Research Center, Applications). .
Recommended publications
  • Discrete Cosine Transform Based Image Fusion Techniques VPS Naidu MSDF Lab, FMCD, National Aerospace Laboratories, Bangalore, INDIA E.Mail: [email protected]
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by NAL-IR Journal of Communication, Navigation and Signal Processing (January 2012) Vol. 1, No. 1, pp. 35-45 Discrete Cosine Transform based Image Fusion Techniques VPS Naidu MSDF Lab, FMCD, National Aerospace Laboratories, Bangalore, INDIA E.mail: [email protected] Abstract: Six different types of image fusion algorithms based on 1 discrete cosine transform (DCT) were developed and their , k 1 0 performance was evaluated. Fusion performance is not good while N Where (k ) 1 and using the algorithms with block size less than 8x8 and also the block 1 2 size equivalent to the image size itself. DCTe and DCTmx based , 1 k 1 N 1 1 image fusion algorithms performed well. These algorithms are very N 1 simple and might be suitable for real time applications. 1 , k 0 Keywords: DCT, Contrast measure, Image fusion 2 N 2 (k 1 ) I. INTRODUCTION 2 , 1 k 2 N 2 1 Off late, different image fusion algorithms have been developed N 2 to merge the multiple images into a single image that contain all useful information. Pixel averaging of the source images k 1 & k 2 discrete frequency variables (n1 , n 2 ) pixel index (the images to be fused) is the simplest image fusion technique and it often produces undesirable side effects in the fused image Similarly, the 2D inverse discrete cosine transform is defined including reduced contrast. To overcome this side effects many as: researchers have developed multi resolution [1-3], multi scale [4,5] and statistical signal processing [6,7] based image fusion x(n1 , n 2 ) (k 1 ) (k 2 ) N 1 N 1 techniques.
    [Show full text]
  • Digital Signals
    Technical Information Digital Signals 1 1 bit t Part 1 Fundamentals Technical Information Part 1: Fundamentals Part 2: Self-operated Regulators Part 3: Control Valves Part 4: Communication Part 5: Building Automation Part 6: Process Automation Should you have any further questions or suggestions, please do not hesitate to contact us: SAMSON AG Phone (+49 69) 4 00 94 67 V74 / Schulung Telefax (+49 69) 4 00 97 16 Weismüllerstraße 3 E-Mail: [email protected] D-60314 Frankfurt Internet: http://www.samson.de Part 1 ⋅ L150EN Digital Signals Range of values and discretization . 5 Bits and bytes in hexadecimal notation. 7 Digital encoding of information. 8 Advantages of digital signal processing . 10 High interference immunity. 10 Short-time and permanent storage . 11 Flexible processing . 11 Various transmission options . 11 Transmission of digital signals . 12 Bit-parallel transmission. 12 Bit-serial transmission . 12 Appendix A1: Additional Literature. 14 99/12 ⋅ SAMSON AG CONTENTS 3 Fundamentals ⋅ Digital Signals V74/ DKE ⋅ SAMSON AG 4 Part 1 ⋅ L150EN Digital Signals In electronic signal and information processing and transmission, digital technology is increasingly being used because, in various applications, digi- tal signal transmission has many advantages over analog signal transmis- sion. Numerous and very successful applications of digital technology include the continuously growing number of PCs, the communication net- work ISDN as well as the increasing use of digital control stations (Direct Di- gital Control: DDC). Unlike analog technology which uses continuous signals, digital technology continuous or encodes the information into discrete signal states (Fig. 1). When only two discrete signals states are assigned per digital signal, these signals are termed binary si- gnals.
    [Show full text]
  • Digital Video in Multimedia Pdf
    Digital video in multimedia pdf Continue Digital Electronic Representation of Moving Visual Images This article is about the digital methods applied to video. The standard digital video storage format can be viewed on DV. For other purposes, see Digital Video (disambiguation). Digital video is an electronic representation of moving visual images (video) in the form of coded digital data. This contrasts with analog video, which is a moving visual image with analog signals. Digital video includes a series of digital images displayed in quick succession. Digital video was first commercially introduced in 1986 in Sony D1 format, which recorded a non-repressive standard digital video definition component. In addition to uncompressed formats, today's popular compressed digital video formats include H.264 and MPEG-4. Modern interconnect standards for digital video include HDMI, DisplayPort, Digital Visual Interface (DVI) and Serial Digital Interface (SDI). Digital video can be copied without compromising quality. In contrast, when analog sources are copied, they experience loss of generation. Digital video can be stored in digital media, such as Blu-ray Disc, in computer data storage, or transmitted over the Internet to end users who watch content on a desktop or digital smart TV screen. In everyday practice, digital video content, such as TV shows and movies, also includes a digital audio soundtrack. History Digital Video Cameras Additional Information: Digital Cinematography, Image Sensor, and Video Camera Base for Digital Video Cameras are metallic oxide-semiconductor (MOS) image sensors. The first practical semiconductor image sensor was a charging device (CCD) invented in 1969 using MOS capacitor technology.
    [Show full text]
  • Watermarked Image Compression and Transmission Through Digital Communication System
    International Journal of Electronics, Communication & Instrumentation Engineering Research and Development (IJECIERD) ISSN 2249-684X Vol. 3, Issue 1, Mar 2013, 97-104 © TJPRC Pvt. Ltd. WATERMARKED IMAGE COMPRESSION AND TRANSMISSION THROUGH DIGITAL COMMUNICATION SYSTEM M. VENU GOPALA RAO 1, N. NAGA SWETHA 2, B. KARTHIK 3, D. JAGADEESH PRASAD 4 & K. ABHILASH 5 1Professor, Department of ECE, K L University, Vaddeswaram, Andhra Pradesh, India 2,3,4,5 Student, Department of ECE, K L University, Vaddeswaram, Andhra Pradesh, India ABSTRACT This paper presents a process able to mark digital images with an invisible and undetectable secrete information, called the watermark, followed by compression of the watermarked image and transmitting through a digital communication system. This process can be the basis of a complete copyright protection system. Digital water marking is a feasible method for the protection of ownership rights of digital media such as audio, image, video and other data types. The application includes digital signatures, fingerprinting, broadcast and publication monitoring, copy control, authentication, and secret communication. For the efficient transmission of an image across a channel, source coding in the form of image compression at the transmitter side & the image recovery at the receiver side are the integral process involved in any digital communication system. Other processes like channel encoding, signal modulation at the transmitter side & their corresponding inverse processes at the receiver side along with the channel equalization help greatly in minimizing the bit error rate due to the effect of noise & bandwidth limitations (or the channel capacity) of the channel. The results shows that the effectiveness of the proposed system for retrieving the secret data without any distortion.
    [Show full text]
  • Reception Performance Improvement of AM/FM Tuner by Digital Signal Processing Technology
    Reception performance improvement of AM/FM tuner by digital signal processing technology Akira Hatakeyama Osamu Keishima Kiyotaka Nakagawa Yoshiaki Inoue Takehiro Sakai Hirokazu Matsunaga Abstract With developments in digital technology, CDs, MDs, DVDs, HDDs and digital media have become the mainstream of car AV products. In terms of broadcasting media, various types of digital broadcasting have begun in countries all over the world. Thus, there is a demand for smaller and thinner products, in order to enhance radio performance and to achieve consolidation with the above-mentioned digital media in limited space. Due to these circumstances, we are attaining such performance enhancement through digital signal processing for AM/FM IF and beyond, and both tuner miniaturization and lighter products have been realized. The digital signal processing tuner which we will introduce was developed with Freescale Semiconductor, Inc. for the 2005 line model. In this paper, we explain regarding the function outline, characteristics, and main tech- nology involved. 22 Reception performance improvement of AM/FM tuner by digital signal processing technology Introduction1. Introduction from IF signals, interference and noise prevention perfor- 1 mance have surpassed those of analog systems. In recent years, CDs, MDs, DVDs, and digital media have become the mainstream in the car AV market. 2.2 Goals of digitalization In terms of broadcast media, with terrestrial digital The following items were the goals in the develop- TV and audio broadcasting, and satellite broadcasting ment of this digital processing platform for radio: having begun in Japan, while overseas DAB (digital audio ①Improvements in performance (differentiation with broadcasting) is used mainly in Europe and SDARS (satel- other companies through software algorithms) lite digital audio radio service) and IBOC (in band on ・Reduction in noise (improvements in AM/FM noise channel) are used in the United States, digital broadcast- reduction performance, and FM multi-pass perfor- ing is expected to increase in the future.
    [Show full text]
  • Next-Generation Photonic Transport Network Using Digital Signal Processing
    Next-Generation Photonic Transport Network Using Digital Signal Processing Yasuhiko Aoki Hisao Nakashima Shoichiro Oda Paparao Palacharla Coherent optical-fiber transmission technology using digital signal processing is being actively researched and developed for use in transmitting high-speed signals of the 100-Gb/s class over long distances. It is expected that network capacity can be further expanded by operating a flexible photonic network having high spectral efficiency achieved by applying an optimal modulation format and signal processing algorithm depending on the transmission distance and required bit rate between transmit and receive nodes. A system that uses such adaptive modulation technology should be able to assign in real time a transmission path between a transmitter and receiver. Furthermore, in the research and development stage, optical transmission characteristics for each modulation format and signal processing algorithm to be used by the system should be evaluated under emulated quasi-field conditions. We first discuss how the use of transmitters and receivers supporting multiple modulation formats can affect network capacity. We then introduce an evaluation platform consisting of a coherent receiver based on a field programmable gate array (FPGA) and polarization mode dispersion/polarization dependent loss (PMD/PDL) emulators with a recirculating-loop experimental system. Finally, we report the results of using this platform to evaluate the transmission characteristics of 112-Gb/s dual-polarization, quadrature phase shift keying (DP-QPSK) signals by emulating the factors that degrade signal transmission in a real environment. 1. Introduction amplifiers—which is becoming a limiting factor In the field of optical communications in system capacity—can be efficiently used.
    [Show full text]
  • Digital Signal Processing: Theory and Practice, Hardware and Software
    AC 2009-959: DIGITAL SIGNAL PROCESSING: THEORY AND PRACTICE, HARDWARE AND SOFTWARE Wei PAN, Idaho State University Wei Pan is Assistant Professor and Director of VLSI Laboratory, Electrical Engineering Department, Idaho State University. She has several years of industrial experience including Siemens (project engineering/management.) Dr. Pan is an active member of ASEE and IEEE and serves on the membership committee of the IEEE Education Society. S. Hossein Mousavinezhad, Idaho State University S. Hossein Mousavinezhad is Professor and Chair, Electrical Engineering Department, Idaho State University. Dr. Mousavinezhad is active in ASEE and IEEE and is an ABET program evaluator. Hossein is the founding general chair of the IEEE International Conferences on Electro Information Technology. Kenyon Hart, Idaho State University Kenyon Hart is Specialist Engineer and Associate Lecturer, Electrical Engineering Department, Idaho State University, Pocatello, Idaho. Page 14.491.1 Page © American Society for Engineering Education, 2009 Digital Signal Processing, Theory/Practice, HW/SW Abstract Digital Signal Processing (DSP) is a course offered by many Electrical and Computer Engineering (ECE) programs. In our school we offer a senior-level, first-year graduate course with both lecture and laboratory sections. Our experience has shown that some students consider the subject matter to be too theoretical, relying heavily on mathematical concepts and abstraction. There are several visible applications of DSP including: cellular communication systems, digital image processing and biomedical signal processing. Authors have incorporated many examples utilizing software packages including MATLAB/MATHCAD in the course and also used classroom demonstrations to help students visualize some difficult (but important) concepts such as digital filters and their design, various signal transformations, convolution, difference equations modeling, signals/systems classifications and power spectral estimation as well as optimal filters.
    [Show full text]
  • Multiplexing and Sampling Theory
    Multiplexing and Sampling Theory THE ECONOMY OF MULTIPLEXING contact type determine its current carrying capacity. For instance, laboratory instrument relays typically switch Sampled-Data Systems up to 3A, while industrial applications use larger relays An ideal data acquisition system uses a single ADC for to switch higher currents, often 5 to 10A. each measurement channel. In this way, all data are captured in parallel and events in each channel can be Solid-state switches, on the other hand, are much faster compared in real time. But using a multiplexer, Figure than relays and can reach sampling rates of several MHz. 3.01, that switches among the inputs of multiple chan- However, these devices can’t handle inputs higher than nels and drives a single ADC can substantially reduce 25V, and they are not well suited for isolated applica- the cost of a system. This approach is used in so-called tions. Moreover, solid-state devices are typically limited sampled-data systems. The higher the sample rate, the to handling currents of only one mA or less. closer the system mimics the ideal data acquisition system. But only a few specialized data acquisition Another characteristic that varies between mechani- systems require sample rates of extraordinary speed. cal relays and solid-state switches is called ON resis- Most applications can cope with the more modest tance. An ideal mechanical switch or relay contact sample rates typically offered by mainstream data pair has zero ON resistance. But real devices such acquisition systems. as common reed-relay contacts are 0.010 W or less, a quality analog switch can be 10 to 100 W, and an Solid State Switches vs.
    [Show full text]
  • Carriers and Modulation DIGITAL TRANSMISSION of DIGITAL DATA
    DIGITAL TRANSMISSION OF Carriers and Modulation DIGITAL DATA CS442 Review… Baseband Transmission Digital transmission is the transmission of electrical Baseband Transmission pulses. Digital information is binary in nature in that it has only two possible states 1 or 0. With unipolar signaling techniques, the voltage is Sequences of bits encode data (e.g., text always positive or negative (like a dc current). characters). In bipolar signaling, the 1’s and 0’s vary from a plus Digital signals are commonly referred to as baseband signals. voltage to a minus voltage (like an ac current). In order to successfully send and receive a message, both the sender and receiver have to agree how In general, bipolar signaling experiences fewer errors often the sender can transmit data (data rate). than unipolar signaling because the signals are Data rate often called bandwidth – but there is a more distinct. different definition of bandwidth referring to the frequency range of a signal! 1 Baseband Transmission Baseband Transmission Manchester encoding is a special type of unipolar signaling in which the signal is changed from a high to low (0) or low to high (1) in the middle of the signal. • More reliable detection of transition rather than level – consider perhaps some constant amount of dc noise, transitions still detectable but dc component could throw off NRZ-L scheme – Transitions still detectable even if polarity reversed Manchester encoding is commonly used in local area networks (ethernet, token ring). ANALOG TRANSMISSION OF Manchester Encoding DIGITAL DATA Analog Transmission occurs when the signal sent over the transmission media continuously varies from one state to another in a wave-like pattern.
    [Show full text]
  • Analog and Digital Signals Digital Electronics TM 1.2 Introduction to Analog
    Analog and Digital Signals Digital Electronics TM 1.2 Introduction to Analog Analog & Digital Signals This presentation will • Review the definitions of analog and digital signals. • Detail the components of an analog signal. • Define logic levels. Analog & Digital Signals • Detail the components of a digital signal. • Review the function of the virtual oscilloscope. Digital Electronics 2 Analog and Digital Signals Example of Analog Signals • An analog signal can be any time-varying signal. Analog Signals Digital Signals • Minimum and maximum values can be either positive or negative. • Continuous • Discrete • They can be periodic (repeating) or non-periodic. • Infinite range of values • Finite range of values (2) • Sine waves and square waves are two common analog signals. • Note that this square wave is not a digital signal because its • More exact values, but • Not as exact as analog, minimum value is negative. more difficult to work with but easier to work with Example: A digital thermostat in a room displays a temperature of 72. An analog thermometer measures the room 0 volts temperature at 72.482. The analog value is continuous and more accurate, but the digital value is more than adequate for the application and Sine Wave Square Wave Random-Periodic significantly easier to process electronically. 3 (not digital) 4 Project Lead The Way, Inc. Copyright 2009 1 Analog and Digital Signals Digital Electronics TM 1.2 Introduction to Analog Parts of an Analog Signal Logic Levels Before examining digital signals, we must define logic levels. A logic level is a voltage level that represents a defined Period digital state.
    [Show full text]
  • INTERIM Bual Repo#R
    .* L, '. .. .. ." r . INTERIM BuAL REpo#r I B.W. J319Es (NASA-CE- 166225) S'IOCY OF CCCL~Jb?CBIICNS N83- !777!, 1AI.P CCHFRFSSICI EE11C3S Interin Einal ReFCrt (COY-COCE, Ioc,, EIountain View, Calif.) 267 p HC A I;/!IF AOI CSCL I7E !llnclas i G3/32 32908 Prepared for MAAMfE RESEARCH CEIWEIi MOF'FETT FEU), CALIFOFNU, 94035 NASA TECHNICAL MONITOR Larry B. Hofman COM-CODE, INC. 305 EASY ST. NO.9 IQN. VlEW I CALIFORNIA 94043 - .. , .. SECTION A SUMMARY REPQRT SECTION B VIDEO CCMPFESSION SECTION C LANDSAT IMAGE PRCCESSING SECTION D SATELLITE CaMEINNICATIONS f L SEXTION A SUMMARY REPORT This section provides a brief suaunary report of the work accolnplished under the "Study of Communications .Data Compression Methods", under NASA contract NAS 2-9703. The results are fully explained in subsequent sections on video canpression, Landsat image processipa, and satellite commnications . The first task of contract NAS 2-9703 was to extend a simple monochrome conditional replenishment system to higher compression and to higher motion levels, by incorporating spatially adaptive quantizers and field repeating. Conditional replenishment combines intraframe and interframe compression, and bath areas were to be investigated. The gain of conditional replenishment depends on the fraction of the image changing, since only changed parts of the image need to be transmitted. If the transmission rate is set so that only one-fourth of the image can be transmitted in each field, greater change fractions will overload the system. To accomplish task I, a computer simulation was prepared which incorporated 1) field repeat of changes, 2) a variable change threshold, 3) frame repeat for high change, and 4) two mcde, variable rate Hadamard intraframe quantizers.
    [Show full text]
  • Analytical Links in the Tasks of Digital Content Compression
    Analytical Links in the Tasks of Digital Content Compression Olena Kolganova 1[0000-0002-1301-9611], Viktoriia Kravchenko 1[0000-0003-0399-7013], Lidiia Tereshchenko 1[0000-0001-8183-9016], Volodymyr Shutko 1[0000-0002-9761-5583] and Mykola Shutko 1[0000-0002-3531-7724], Yevhen Vasiliu 2[0000-0002-8582-285X] 1 National Aviation University, Kyiv, Ukraine 2 O.S.Popov Odessa National Academy of Telecommunication, Odessa, Ukraine [email protected], [email protected] Abstract. The article is devoted to the development of a digital image compres- sion algorithm. Image compression is a type of data compression applied to dig- ital images, to reduce their cost for storage or transmission. Algorithms may take advantage of visual perception and the statistical properties of image data. We will consider a lossy compression algorithm. The new algorithm is based on multiscale decomposition with a spline as a basis function. In the process of multiscale analysis, when constructing a spline, we should take into account analytical links. The application of this approach give an increase in the com- pression ratio with the same quality of compressed images. Keywords: Digital Image Compression, Analytical Links, Hermitian Spline, Multiscale Analysis. 1 Introduction Multimedia standards for video compression for personalized television, high defini- tion digital television (HDTV), and image / video database maintenance use close motion and encoding methods. Three basic standards - MPEG-1, MPEG-2 and MPEG-4 were developed by the Moving Picture Experts Group (MPEG), under the auspices of the ISO and the International Telegraph and Telephone Consultative Committee (was renamed the International Telegraph Union (ITU).
    [Show full text]